castanyes blaves

Random ramblings about some random stuff, and things; but more stuff than things -- all in a mesmerizing and kaleidoscopic soapbox-like flow of words.

5/21/2009

 

Gene by gene turns genome-by-genome

This is a good example of the kind of paper we are probably going to see more and more often in the future:

Gene discovery using massively parallel pyrosequencing to develop ESTs for the flesh fly Sarcophaga crassipalpis
http://www.ncbi.nlm.nih.gov/pubmed/19454017

Get (a) a certain amount of RNAseq reads for your species "X", (b) build as many full-length cDNAs as possible from the fragments and (c) compare against close species in terms of:
  • New interesting cDNAs that don't have hits against existing public cDNAs -- What do they do?
  • Expression patterns -- Are these different to the patterns in other close species?
  • Protein coding evolution -- run pairwise dNdS against closest genome or tree-based dNdS against anĀ existing phylogeny [1,2] -- Does anything show up in a Gene Ontology enrichment analysis?
Then the data is published and stored in a publicly available database, and can be added to the pool to compare against for the next project. Iterate :-)

It used to be gene-by-gene sequencing and it's now transcriptome by transcriptome sequencing. There are still sequence error and sequence coverage issues: one of my first scientific mentors, Prof. Montserrat Aguade was one of the first to do gene sequencing on the Adh gene in Drosophila when doing her PhD in Harvard. People then extended Adh sequencing and analysis to other Drosophila species, then other clades, then other genes, then some gene families like odorant binding proteins for a bunch of Drosophila species or populations, or gene pathways like the insulin pathway, etc.

But now we have a much broader picture with a rather complete transcriptome. And most of the sequencing issues are going to be corrected across the phylogeny in pretty much the same way as allele imputation is filling the gaps at the population genomics level (e.g. 1000 Genomes Project).

I am very excited about all this!

Labels: , ,


Comments: Post a Comment

Subscribe to Post Comments [Atom]





<< Home

Archives

200409   200412   200501   200502   200503   200504   200505   200506   200507   200508   200509   200510   200511   200512   200601   200602   200603   200604   200605   200606   200607   200608   200609   200610   200611   200612   200701   200702   200703   200704   200705   200707   200708   200709   200710   200711   200712   200801   200802   200803   200804   200805   200806   200807   200808   200809   200810   200811   200812   200901   200902   200903   200904   200905   200906   200907   200908   200909   200912   201001   201002   201003   201004   201007   201009   201011   201102  

This page is powered by Blogger. Isn't yours?

Subscribe to Posts [Atom]