1 / 25

Biological Sequence Analysis

Biological Sequence Analysis. 140.638.01. The materials used in this class are made possible by:. Zhiping Weng, http://zlab.bu.edu Wenyi Wang Zhijin Wu Garland publishing, Alberts’s the Cell And the wealth of internet resources. Who are we?. Sining Chen Carlo Colantuoni

gryta
Télécharger la présentation

Biological Sequence Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Biological Sequence Analysis 140.638.01

  2. The materials used in this class are made possible by: • Zhiping Weng, http://zlab.bu.edu • Wenyi Wang • Zhijin Wu • Garland publishing, Alberts’s the Cell • And the wealth of internet resources

  3. Who are we? • Sining Chen • Carlo Colantuoni • Giovanni Parmigiani

  4. Who are you? • Field of research • Stats & computing background • Register or audit • Why are you taking this course • Specific topics you are interested

  5. Administrative Details http://astor.som.jhmi.edu/~sining/BSA/syllabus.htm

  6. The MHS program in Bioinfo • Jointly offered by Dept. Biostatistics and Molecular Microbiology and Immunology • An intensive one-year program that emphasizes biology, statistical methods, and computing

  7. Goal of the class • Learn to look at biological sequences from a probabilistic point of view • Understand algorithms behind routine operations, e.g. BLAST. • Be able to build statistical model to solve problems involving sequences

  8. Biological Sequence Analysis: Basic Biological Concepts Carlo Colantuoni Clinical Brain Disorders Branch, NIMH, NIH Dept. Biostatistics, JHSPH ccolantu@jhsph.edu colantuc@intra.nimh.nih.gov

  9. Replication DNA Transcription Translation Molecular Cell Biology: Central Dogma RNA Protein Sequence analysis important at all 3 levels

  10. The Human Genome Genomic Content: 3.3 billion bases ~30K genes 23 chromosomes (22+X/Y) Millions of variants DAD MOM 2 copies in every cell (46 chr) One copy from each parent Each parent passes on a “mixed copy” YOU

  11. Nucleotides are the chemical building block of Nucleic Acids: DNA and RNA

  12. Nucleotides are the chemical building block of Nucleic Acids: DNA and RNA

  13. Protein-coding genes are not easy to find - gene density is low, and exons are interrupted by introns. From Genomic DNA to mRNA Transcripts EXONS INTRONS ~30K >30K Promoters Alternative splicing Poly-Adenylation

  14. Molecular Cell Biology: Components of the Central Dogma Protein Translation protein coding START STOP mRNA AAAAA 5’ UTR 3’ UTR Transcription Genomic DNA 3.3 Gb

  15. Translation - Protein Synthesis:Every 3 nucleotides (codon) are translated into one amino acid DNA: A T G C 1:1 RNA: A U G C 3:1 Protein: 20 amino acids Replication Transcription Translation

  16. Translation - Protein Synthesis RNA Protein 5’ -> 3’ : N-term -> C-term

  17. Nucleotide sequence determines the amino acid sequence

  18. The Human Genome Genomic Content: 3.3 billion bases ~30K genes 23 chromosomes (22+X/Y) 2 copies in every cell One copy from each parent Each parent passes on a “mixed copy” DAD MOM Deletions Insertions Mutations Evolutionary Scale YOU

  19. Biological Sequence Analysis: Primary Concepts Homologue Paralogue Ortholog Identity & Similarity

More Related