1 / 21

The Deciphering of Ctenophore Genomes:

The Deciphering of Ctenophore Genomes: How to make a fully automatic transcriptome annotation for electrophysiologists & field biologists?. David Orion Girardo Worcester Polytechnic Institute Bioinformatics and Computational Biology/ Mathematics (CS minor) Moroz Lab.

zalman
Télécharger la présentation

The Deciphering of Ctenophore Genomes:

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Deciphering of Ctenophore Genomes: How to make a fully automatic transcriptome annotation for electrophysiologists & field biologists? David Orion Girardo Worcester Polytechnic Institute Bioinformatics and Computational Biology/ Mathematics (CS minor) Moroz Lab Photo courtesy of Mat Citarella

  2. How did the Nervous System Evolve? • Still poorly understood • Applications for fundamental neuroscience & regeneration medicine

  3. Model organism: the Ctenophore Pleurobrachiabachei • The most basally branched animal lineage with ‘true’ neurons and muscles • No identified intercellular signal molecules

  4. Goals for a 1st year summer student in the Moroz lab: • One-click transcriptome analysis pipeline within one day • Develop secretory signalling peptide prediction system • Integratetranscriptome annotation & neuropeptide predictions in pipeline

  5. “Automatic Genome-wide annotation pipelines are hallmarks of large sequencing centers” • Manual analysis costs a lots of Time and Money • Few centers have fully automated analysis • UF does not have this pipeline • Still manual – Costs ~$3000/few months w/out visualization

  6. Supercomputer 1 Day Never been done

  7. Basic Definitions • Read is any segment of DNA from the sequencing. • Contig(from contiguous) is a set of overlapping DNA segments derived from a single genetic source.

  8. Assembly requires multiple steps Reads NewblerCap3 MIRA Contigs

  9. Annotation Pipeline Contigs mpiBlastx Pfam Blastx NR Blastx SP Annot8r GO KEGG Database

  10. What are Signaling Peptides? • Small secreted proteins • Effect nervous system in many different ways • Hypothesis: Older than Classical Neurotransmitters

  11. Precursor Neuropeptide Signal peptide Internal Repeats Basic Cleavage sites No Transmembrane Domain

  12. Implementation of the Pipeline Contigs SignalP TMHMM Need to computationally integrate all predictions Phobius TargetP Neuropred Database

  13. Predicted Secretory Products 38 Products (most stringent criteria) Secreted Cell Guidence Molecules Secreted Proteolytic Enzyme Toxins Neuropeptides Additional 453 predicted products under less stringent criteria out of 19573

  14. Predicted prohormones are differentially expressed % Expression Tentacles

  15. Predicted prohormones have few introns

  16. Homologs Across Phyla Cutoff at <10-4

  17. “2,000lines of HASKELL” Leads to…

  18. ‘0-Click’ transcriptome analysis pipeline • De Novo computational predictions of signaling secretory peptides yield good results • Neuropeptide annotation has been integrated inthe transcriptomeanalysis pipeline

  19. Acknowledgements • Dr Leonid Moroz • Mat Citarella • Dr Andrea Kohn • Yelena Bobkova • Jim Netherton • Josh Swore • NSF, NIH

More Related