1 / 6

Knowledge Integration for Gene Target Selection

GeneRanker is a powerful tool designed to prioritize genes based on their associations with specific diseases and phenotypes by leveraging knowledge from biomedical literature, curated protein-protein interaction (PPI) databases, and protein network topology. Users can input a disease or biological process, and GeneRanker extracts associated genes from literature, analyzes their interactions, and ranks them based on co-occurrences and statistical significance. An online application, GeneRanker demonstrates significantly improved precision over traditional NLP methods and has shown promising results in empirical validation, particularly within brain tumor research.

tanek-vega
Télécharger la présentation

Knowledge Integration for Gene Target Selection

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact: graciela.gonzalez@asu.edu

  2. GeneRanker in a Nutshell • Integration of knowledge from • biomedical literature • curated PPI databases, and • protein network topology • Seeks to prioritize lists of genes on their association to specific diseases and phenotypes [1], • Such associations may or may not have been published (thus, not text mining) [1] Gonzalez G, Uribe JC, Tari L, Brophy C, Baral C. Mining Gene-Disease relationships from Biomedical Literature: Incorporating Interactions, Connectivity, Confidence, and Context Measures. Pacific Symposium in Biocomputing; 2007; Maui, Hawaii; 2007.

  3. GeneRanker Interface • The user types a disease or biological process to be searched. • Genes found to be in association to the disease are extracted from the literature. • Protein-protein interactions involving those genes are then pulled from the literature & curated sources • The protein network is built and each gene ranked

  4. GeneRanker Interface • Each gene is scored and can be annotated (count of co-occurrences and statistical representation) • Collaboration: Application of GeneRanker to a biological context, with Dr. Michael Berens, Director of the Brain Tumor Unit at the Translational Genomics Institute (TGen). • GeneRanker is available as an online application at http://www.generanker.org.

  5. Evaluation of GeneRanker • Contextual (PubMed search) based shows > 20% jump in precision over NLP based extraction. • Synthetic network results show AUC > 0.984 • Empirical validation against a glioma dataset shows consistent results (118 vs 22 differentially expressed probes from top vs bottom of list)

  6. Complementary Work • CBioC: www.cbioc.org shows PPIs, gene-disease, and gene-bioprocess associations extracted from abstracts • BANNER: sourceforge.banner.org (presenting a poster on this one). An open source entity recognizer available now. • Gene normalization: a similar open source system soon to be available.

More Related