castanyes blaves

Random ramblings about some random stuff, and things; but more stuff than things -- all in a mesmerizing and kaleidoscopic soapbox-like flow of words.

5/01/2009

 

NCBI SRA blastn service

Never easier before to check your sequence against the NCBI Short Read Archive database:

NCBI SRA BLAST

First thoughts:
  • Transcriptome coverage is hugely biased to the 3' end (or 5' depending on library preparation). A lot more than I suspected.
  • Would be great to do queries for phylogenetic subclades: e.g. my human sequence against all SRA data for primates.
  • A lot of the 454 data has homopolymer issues, mostly TTT[...]TTTs but also some others:
Query  465  GGGCCTTGACAAAGTGTAAACCGCATGGATGGGCTTCCCC-AAGGATTTATTGACATTGC  523<br /><font color="#ff0000"><b>Sbjct</b></font>  249  ........................................<font color="#ff0000"><b>C</b></font>...................  190<br /><br /></pre><ul><li>Some of these (unless they are real variations) get picked up as mismatches, some as indels:<pre>Query  1    CGGCAAGGTATGTGCGTGATTTTGGGCCCACGTGTATTTCCATTAATTTT-AAGCCGTAA  59<br /><font color="#ff0000"><b>Sbjct</b></font>  224  ..................................................<font color="#ff0000"><b>T</b></font>.........  165<br /><br />Query  60   TTGTCGTTTTTGGCGGTTTCGAGTTGAACTGCGTTAGTCCGTGCGCTGTTCGCAAGTGTG  119<br /><font color="#ff0000"><b>Sbjct</b></font>  164  ..........<font color="#ff0000"><b>C</b></font>.................................................  105<br /></pre></li></ul><pre><br />Query  61   TGTCGTTTTTGGCGGTTTCGAGTTGAACTGCGTTAGTCCGTGCGCTGTTCGCAAGTGTGC  120<br /><font color="#ff0000"><b>Sbjct</b></font>  118  .....<font color="#ff0000"><b>C</b></font>......................................................  177<br />Query  61   TGTCG-TTTTTGGCGGTTTCGAGTTGAACTGCGTTAGTCCGTGCGCTGTTCGCAAGTGTG  119<br /><font color="#ff0000"><b>Sbjct</b></font>  160  .....<font color="#ff0000"><b>T</b></font>...............................<font color="#ff0000"><b>-</b></font>......................  102<br />Query  61   TTAATTTTAAGCCGTAATTGTCGTTTTTGGCGGTTTCGAGTTGAACTGCGTTAGTCCGTG  120<br /><font color="#ff0000"><b>Sbjct</b></font>  181  ...........................<font color="#ff0000"><b>C</b></font>................................  122<br /><br /><br /><br />


Labels: ,


Comments: Post a Comment

Subscribe to Post Comments [Atom]





<< Home

Archives

200409   200412   200501   200502   200503   200504   200505   200506   200507   200508   200509   200510   200511   200512   200601   200602   200603   200604   200605   200606   200607   200608   200609   200610   200611   200612   200701   200702   200703   200704   200705   200707   200708   200709   200710   200711   200712   200801   200802   200803   200804   200805   200806   200807   200808   200809   200810   200811   200812   200901   200902   200903   200904   200905   200906   200907   200908   200909   200912   201001   201002   201003   201004   201007   201009   201011   201102  

This page is powered by Blogger. Isn't yours?

Subscribe to Posts [Atom]