Automated Annotation and Probe Update System for Cancer Center Microarrays
This system annotates microarray probes using databases like Entrez Gene and UniGene for cancer research. It automatically updates annotations, allows web-based and API queries, and aims for caGRID compliance. Utilizes caCORE SDK for code generation. Data sources include UniSTS, Gene Ontology, dbSNP, and more. Limitations include slow PubMed curation, non-optimized complex searches, and lack of meta-data storage in caArray.
Automated Annotation and Probe Update System for Cancer Center Microarrays
E N D
Presentation Transcript
caFE Server Siteman Cancer Center
Introduction • Annotate probes on microarrays using publicly available biomedical databases • Automatically update annotations on a regular basis • Web-based and API based query search • Uses caCORE SDK (version 1.0.3) generated code • Working towards caGRID compliance
Data Sources • Entrez Gene • UniGene • Gene Ontology • dbSNP • UniSTS • HomoloGene • PubMed
Annotation Interlinks Microarrays caArray Custom Entrez Gene UniGene GenBank Gene ID UniGene ClusterID Accession Number HomoloGene dbSNP PubMed Gene Ontology Orthologus genes RefSNP ID Pubmed relationship GO terms
Limitations • Initial run of PubMed data curation takes 2-3 months • APIs generated using caCORE SDK • Not optimized for complex searches • Performing searches that involves complex computations. • For example, searching for literature correlation with depth 2 or more • Meta-information of caArray is not stored