1 / 15

Design experiment

Design experiment. Prepare libraries. Conduct sequencing. Produce FASTQ files. Pre- process data. No reference available. Assembly of genome sequence. R eference available. Map reads to reference genome or transcriptome. Assembly of transcriptome sequence. RNA transcripts.

lyndon
Télécharger la présentation

Design experiment

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Design experiment Prepare libraries Conduct sequencing Produce FASTQ files Pre-process data No reference available Assembly of genome sequence Reference available Map reads toreference genomeor transcriptome Assembly of transcriptome sequence RNAtranscripts DNA modifications ProteinInteractions VariantDiscovery

  2. Produce FASTQ files Conduct sequencing Pre-process data Design experiment Prepare libraries Auer & Doerge, 2010 Genetics 185:405-16 DNA sequences A DNA sequences B Fu et al, 2014, PNAS 111:1891-6 Map reads toreference genomeor transcriptome

  3. Experimental design Genetics 185(2):405-16, 2010 Here seven treatments are applied to each of three biological replicates , and each sample is sequenced in a separate lane Are nuisance effects confounded with treatment effects?

  4. Experimental design Yes – all samples from each rep are on the same flowcell. Flowcelleffects are confounded with replicate effects, and lane effects are confounded with treatments, because T1 is always in lane 1, T2 in lane 2, and so forth. Genetics 185(2):405-16, 2010

  5. Experimental design Auer & Doerge, Genetics 185:405, 2010

  6. Experimental design Same experiment: seven treatments applied to each of three biological replicates , but samples are allocated differently Are nuisance effects confounded with treatment effects now? Auer & Doerge, Genetics 185:405, 2010

  7. Experimental design No – biological replicates are randomized across flowcells, and treatments are randomized across lanes. Biological sources of variation (reps) are orthogonal to technical sources (flowcells and lanes). Auer & Doerge, Genetics 185:405, 2010

  8. Produce FASTQ files Conduct sequencing Pre-process data Design experiment Prepare libraries Auer & Doerge, 2010 Genetics 185:405-16 DNA sequences A DNA sequences B Fu et al, 2014, PNAS 111:1891-6 Map reads toreference genomeor transcriptome

  9. Molecular indexing measures bias Abundant transcripts are detected more times than there are unique start/stop sites for sequence reads Fu et al, PNAS 111:1891–1896, 2014

  10. The more steps, the greater the sample loss… Even with estimated 70% yield at each step, the cumulative yield after 15 steps in RNA-seq library production is very low Fu et al, PNAS 111:1891–1896, 2014

  11. Produce FASTQ files Conduct sequencing Pre-process data Design experiment Prepare libraries Auer & Doerge, 2010 Genetics 185:405-16 DNA sequences A DNA sequences B Fu et al, 2014, PNAS 111:1891-6 Map reads toreference genomeor transcriptome

  12. Retrieve genome annotation from database DNA sequences A DNA sequences B Map reads toreference genomeor transcriptome Merge read alignment with genome annotation to produce tables with counts of aligned reads per genome feature BAM file A Count table A BAM file B Count table B

  13. Table of differentially-expressed genes with annotation Genome annotation from database Merge read alignment with genome annotation to produce tables with counts of aligned reads per genome feature Count table A Statistical comparison of counts, inference of differential expression Count table B

  14. Align reads to reference Assemble transcripts Merge across treatments Count reads per treatment for assembled transcripts, adjusting for bias detected in start- or end-points of RNA fragments to more accurately estimate transcript levels (Roberts et al., Genome Biol 12:R22, 2011) Count reads per treatment for assembled transcripts Visualize differential expression results From http://cufflinks.cbcb.umd.edu/

More Related