230 likes | 353 Vues
This guide provides an overview of key bioinformatics resources crucial for biochemistry research. It highlights various literature sources, including NCBI's extensive book collection and PubMed's vast citation database. Important information on downloading articles from the web, proper article management practices, and literature search techniques is also covered. Additionally, it introduces users to essential databases for genomic and protein analysis, emphasizing the significance of using reliable tools and cross-validation. Enhance your bioinformatics analysis and literature search with this essential resource.
E N D
Bioinformatics I Stephen Tsui Biochemistry
Sources of Literature • Books in NCBI Currently 61 books on the shelf http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Books&itool=toolbar • Selected Biochemistry-related books Molecular Cell BiologyGenomes Molecular Biology of the Cell Biochemistry Modern Genetic Analysis Retroviruses Introduction to Genetic Analysis Immunobiology Genes and Diseases Neurosciences Human Molecular Genetics 2 Developmental Biology Sequence-Evolution-Function Basic Neurochemistry The Cell – A Molecular Approach Endocrinology
Where can I get an article? • PubMed in NCBI More than 16 million citations from MEDLINE. For PubMed Central articles or journal articles that have been subscribed by our University, the full-text PDF files could be downloaded from the Web. http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed • Electronic journals http://www.lib.cuhk.edu.hk/electronic/jol.htm • The corresponding author of the article • Friends in other institutions
Important Notes for Downloading Articles from the Web • Don’t post the PDF files on Web • Be careful when you distribute a downloaded article • Prevent systematic download of a whole issue or all issues of a journal • Print out articles only if you plan to read them • Sort your articles into different folders in your computer • Name your articles by their titles
How to Conduct a Comprehensive Literature Search? • Be familiarized with the PubMed by browsing the PubMed tutorial http://www.nlm.nih.gov/bsd/disted/pubmed.html • Type the keywords in the PubMed page http://www.ncbi.nlm.nih.gov/entrez/query.fcgi e.g. SARS AND spike AND receptor • Save the abstracts in a text format • Read the abstracts and remove unrelated records • Download the essential PDF files and read the most recent review to have an overview of the issue.
ENTREZ • Entrez integrates the scientific literature, DNA and protein sequence databases. http://www.ncbi.nlm.nih.gov/gquery/gquery.fcgi • Properly use “AND”, “OR”, “NOT” and “[ti]” strings
Online Mendelian Inheritance in Man http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM Online Mendelian Inheritance in Man (OMIM) is a catalog of human genes and genetic disorders. The team has done an excellent job in summarizing and updating information about genes and diseases. Example: Genetic disorders: Lung cancer, AIDS Genes: p53, FHL2
Taxonomy Browser http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=TAXONOMY The NCBI taxonomy database contains the names of all organisms that are represented in the genetic databases with at least one nucleotide or protein sequence. Examples: Mus musculus, Bos taurus, Mycoplasma genitalium
Protein-Protein Interaction Databases • Protein-protein interactions could be experimentally revealed or computer predicted. Both of them have important implications on the protein functions. For example: MIPS Protein-Protein Interaction Database http://mips.gsf.de/proj/ppi/
Genome Biology • Selection of organism specific genomic databases: Human -http://www.ncbi.nlm.nih.gov/genome/guide/human/ Mouse -http://www.informatics.jax.org/ Drosophila malanogaster -http://flybase.bio.indiana.edu/ Caenorhabditis elegans -http://www.wormbase.org/ Escherichia coli -http://www.ecocyc.org/ Saccharomyces cerevisiae -http://www.yeastgenome.org/
Four Golden Rules for Bioinformatics Analysis • Use published tools, which are more reliable. • Read the user manual to learn the proper use and limitations of the tools. • Look for the information that is helpful for the interpretation the results. • Use more than one tools with the same function to cross validate the results.
Four Major Sites as Good Starting Points • National Center for Biotechnology Information (NCBI) http://www.ncbi.nlm.nih.gov/ • European Bioinformatics Institute (EBI) http://www.ebi.ac.uk/Information/ • Biological Information Resource (BIR) http://courses.washington.edu/bioinfo/BIR/ • ExPASy Proteomics Server http://ca.expasy.org/
Biological Information Resources http://courses.washington.edu/bioinfo/BIR/ • Biological Information Resources (BIR) is an excellent portal for the access of commonly used bioinformatics tools.
DNA Analysis Tools • Codon usage http://bioinformatics.org/sms2/codon_usage.html Example: p53 mRNA
DNA Analysis Tools • Restriction site analysis For example: WebCutter http://www.firstmarket.com/cutter/cut2.html Example: p53 mRNA
DNA Analysis Tools • Primer analysis For example: Primer 3 http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi Example: p53 mRNA
Protein Analysis Tools • Statistical Analysis of Protein Sequences http://www.isrec.isb-sib.ch/software/SAPS_form.html Example: p53 protein, insulin receptor
Protein Analysis Tools • Hydrophobicity plot http://expasy.ch/cgi-bin/protscale.pl Example: p53 protein, insulin receptor
Protein Analysis Tools • Isoelectric point and molecular weight Example: p53 protein, insulin receptor
Protein Analysis Tools • Protein Subcellular Localization Prediction http://wolfpsort.seq.cbrc.jp/ Example: p53 protein, insulin receptor
Protein Analysis Tools • Transmembrane region and orientation DAS - http://www.sbc.su.se/~miklos/DAS/ Sosui - http://bp.nuap.nagoya-u.ac.jp/sosui/ Example: p53 protein, insulin receptor