150 likes | 286 Vues
This article delves into the transition from tag profiling to mRNA-Seq as a method for gene expression studies, highlighting its significance in the context of New Zealand flora. It reviews challenges and advantages associated with both methodologies while showcasing specific case studies on species diversification and adaptation. The analysis emphasizes how mRNA-Seq can provide comprehensive coverage of the transcriptome, enabling more accurate assessments of gene expression and contributing to a deeper understanding of biological processes.
E N D
Tag profiling is dead... • ...long live mRNA-Seq! Claudia Voelckel Patrick Biggs October 2009
Expression Studies in the New Zealand Flora Hybridization & polyploidy Ourisia Ranunculus Hebe Species diversification & local adaptation Pachycladon Nothofagus Totara We are interested in: Biological processes that differ between species and populations Adaptive gene sets
Expression Studies the Familiar Way: Microarrays DNA chip Sample 1 Sample 2 mRNA mRNA AAAAAA3’ AAAAAA3’ with gene probes AAAAAA3’ AAAAAA3’ TTTTTT5’ TTTTTT5’ red-labeled cDNA TTTTTT5’ green-labeled cDNA TTTTTT5’ DATA ANALYSIS intensity 1 intensity 2 Expression ratio: log
Expression Studies Revolutionized: Tag Profiling AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ Sample 1 mRNA mRNA Sample 2 Solexa Genome Analyzer AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ 18 bp tag library 18 bp tag library TAG MAPPING Reference Sample 1 Sample 2 STATISTICAL ANALYSIS 1 2 2 1 1 1 count 1 count 2 log If needed – build reference transcriptome through RNA seq
Advantages & Challenges of Tag Profiling Advantages • open to any organism (with a reference transcriptome) • any expressed transcript detectable (1 copy/cell) • less RNA needed (tag profiling = 1µg, microarrays = 100 µg) • minor data normalization, cross-species comparisons easier Challenges • mapping 18 bp tags (sequence differences Pachycladon/Arabidopsis) • counting tags per gene (noise, location, abundance) • statistical analysis of differential expression (proportion data)
Tag Profiling Guinea Pig: Previous Microarray Study Pachycladonfastigiata vs. Pachycladonenysii Habitat Rosette Habitat Rosette Fruiting Fruiting Flowering Flowering Comparative gene expression study using Arabidopsis microarrays Voelckel et al. 2008, Molecular Ecology, 17: 4740–4753
Microarray Study Results • Arabidopsis microarray • (20,468 genes) • 310 genes (1.5%) up in P. fastigiata • 324 genes (1.6%) up in P. enysii • up-regulation of ESP and ESM1 • predict P. fastigiatato produce • isothiocyanates and P. enysiito • producenitriles • prediction confirmed by HPLC • role for herbivory in species • diversification? P. fastigiata P. enysii Probability of differential expression ( log odds ratio) ESM1 ESP Magnitude of differential expression (log fold change)
Tag Profiling Results • 8 data sets from different mapping strategies (ELAND, MySQL) • each analyzed with different normalization parameters (R, edgeR) • results vary! Example: P. fastigiata P. enysii • data set 2: • 17423 A. thaliana loci • noise filter 10 • count most abundant tag per gene • analyzed with tagwise normalization • -log2(1.5) < log fold ratio < log2 (1.5) • 2654 genes (15.2%) up in P. fastigiata • 1857 genes (10.7%) up in P. enysii
Microarrays (MA) vs. Tag Profiling (TP) PF 269 41 2613 MA TP MA: 20,468 genes TP: 17,423 genes PE 274 50 1807 MA TP 2654 up in PF 1857 up in PE 310 up in PF 324 up inPE • more differentially expressed genes in TP (10.7-15.2% ) than with MA (1.5-1.6% ) • 13.2% (PF) and 15.4 % (PE) of MA results confirmed by TP results • biological inferences from both studies identical
Tag Profiling is dead, long live mRNA-Seq! • • One year later: Tag profiling works for a non-model plant with a distant reference transcriptome! Let’s do more experiments! • 2 Oct 09: “Illumina is discontinuing the support of Tag Profiling and will no longer be manufacturing the reagent kits for this application.” “...not a popular product, too expensive, tricky chemistry.. instead use: mRNA-Seq!”
Expression Studies Revisited: mRNA-Seq AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ AAA3’ Sample 1 mRNA mRNA Sample 2 Solexa Genome Analyzer cDNA library cDNA library READ MAPPING Reference Sample 1 Sample 2 STATISTICAL ANALYSIS 1 2 2 1 1 1 count 1 count 2 gene length log If needed – build reference transcriptome through RNA seq
Advantages & Challenges of mRNA-Seq Advantages • whole transcriptome coverage • longer reads reduce mapping noise and unmapped reads • multiplex-compatible • adequate coverage (too high with tag profiling) • additional benefits: EST libraries, SNPs • disentangling expression of allopolyloid copies may be easier Challenges • new mapping strategies needed • different statistical treatment required • hardly any R packages available yet
Experiences with mRNA-Seq mRNA-Seq runs (75bp paired end) so far: Tuatara Ourisia Pachycladon Ranunculus await assembly and analysis EST data base built for P. fastigiata analysis in progress
Straight from the Pachycladon EST library: Evidence for allopolyploid copies (e.g. glucosinolate hydrolysis gene)
THANKS TO: Genome Service Patrick Biggs Lorraine Berry Lesley Collins, Maurice Collins, Pete Lockhart Helene Kretzmer Marsden Alexander von Humboldt Foundation