Download
which orf n.
Skip this Video
Loading SlideShow in 5 Seconds..
Which ORF? PowerPoint Presentation

Which ORF?

90 Vues Download Presentation
Télécharger la présentation

Which ORF?

- - - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

  1. Which ORF? Jeltje – September 7 2005

  2. Where to start? gatgtcatgcgatgttattg M R C Y gatgtcatgcgatgttattg M S C D V I gatgtcatgcgatgttattg M L L

  3. Eukaryotes …A/GNNAUGG…… Methylated cap small ribosomal subunit

  4. Eukaryotes …A/GNNAUGG…… Methylated cap

  5. Eukaryotes …A/GNNAUGG……

  6. Eukaryotes …A/GNNAUGG…… Large ribosomal subunit

  7. Eukaryotes …A/GNNAUGG…… M

  8. …CNNAUGTGCGTTAUGG…… Leaky scanning HIC …CNNAUGTGCGTTAUGG…… …CNNAUGTGCGTTAUGG……

  9. Skipping AUG In some cases translation is initialized but terminated upon encounter of the second AUG Internal Ribosome Entry Site (IRES): not sequence specific viral (only?)

  10. MGC genes Tested 1000 MGC genes (Skipped genes with same ORF) Looked at longest ORF, first ORF, and longest first ORF (picked longest from three frames). ORFs must be >5 aa Compared to ‘called’ ORF in GenBank

  11. MGC genes • Of 1000 genes • For 887, the first large ORF is the largest ORF • Of those, only 388 have the A/GNNATGG consensus • MGC ORFs: • 845 are the same as first/largest ORF • 35 are a subset of the first/largest (all skip first M) • 6 pick another orf (1 notfound )

  12. MGC genes • Of 1000 genes (the remaining 113) • In 102 cases, the annotated ORF is the longest, not the first • In 3 cases, the annotated ORF is a subset of the longest ORF • In 6 cases, the annotated ORF is the first, not the longest • 1 annotated ORF cannot be found • 1 annotated ORF is neither the first nor the longest

  13. Examples: GenBank ORF is first >longest MSLSLVFRAASYFKLVPFHSSSSNQFLQPPGWVVLTQTLVLLHFERFSYQNVPKSAQGKGNLQPETNIHLFHFLTFPKQISRNLFNSLLCLMCLTYF >first MTNVYSLDGILVFGLLFVCTCAYFKKVPRLKTWLLSEKKGVWGVFYKAAVIGTRLHAAVAIACVVMAFYVLFIK (Longest not found in mouse)

  14. GenBank neither first nor longest >longest MESDPRICTMGNQEWPGWVPPPGPASSPPNCPHPMDEAGGTFGAKPACLPAPCLTRASFQLALPPAGPWAWPGPTGGYGLGSPSPLRGWRATSLGCYNLTPDSIGPLPLPRAPRSAALRLNMSARPCQCCGTPVRASDCVCRRDAGTRGCVCMCVCVRAACPPVCMVCGLGPHPWPEHFILWGRGADLVGGAPL >first MGGGRAPPERLGGCR >GBprot MRCLSSKKAGSTSVVKYIKTWRPRYFLLKSDGSFIGYKERPEAPDQTLPPLNNFSVAECQLMKTERPRPNTFVIRCLQWTTVIERTFHVDSPDEREEWMRAIQMVANSLQPHLCAQTRIWKTPPPAQAWAVGRLEIQVLIHTSPSEG

  15. GenBank ORF is subset of longest >longest MSKRRMSVGQQTWALLCKNCLKKWRMKRQTLLEWLFSFLLVLFLYLFFSNLHQVHDTPQMSSMDLGRVDSFNDTNYVIAFAPESKTTQEIMNKVASAPFLKGRTIMGWPDEKSMDELDLNYSIDAVRVIFTDTFSYHLKFSWGHRIPMMKEHRDHSAHCQAVNEKMKCEGSEFWEKGFVAFQAAINAAIIEIATNHSVMEQLMSVTGVHMKILPFVAQGGVATDFFIFFCIISFSTFIYYVSVNVTQERQYITSLMTMMGLRESAFW >first MGSSLQELSQKMENEKTDLVGMALFISSGTVSVPIFLQFTSSS >GBprot MGWPDEKSMDELDLNYSIDAVRVIFTDTFSYHLKFSWGHRIPMMKEHRDHSAHCQAVNEKMKCEGSEFWEKGFVAFQAAINAAIIEIATNHSVMEQLMSVTGVHMKILPFVAQGGVATDFFIFFCIISFSTFIYYVSVNVTQERQYITSLMTMMGLRESAFW (Longest found in mouse)