320 likes | 454 Vues
This project focuses on developing a robust Knowledge Discovery System (KDS) that integrates bioinformatics data to elucidate the complex relationships between genes, proteins, miRNA, and their roles in diseases and therapeutics, particularly in Traditional Chinese Medicine (TCM). By utilizing analytical pipelines and databases such as KEGG and DrugBank, we aim to explore tissue-specific gene expressions and enhance drug discovery through data mining, integration, and translation, thereby fostering innovative approaches in diagnosing and treating diseases.
E N D
Marcel Proust The real act of discovery consists not in finding new landsbut in seeing with new eyes - Marcel Proust Eying a picture is better than seeing a thousand words
Uncover the hidden links Bioinformatics Data Knowledge Gene Protein miRNA SM Drug TCM Disease Symptom
HT Analytical pipeline, database, algorithm, prediction/analytical tools Drug screening, TCM, Target ID, Diagnosis, Prognosis Omics, Annotation, Literature, Structure, Medical records/images Mining Clinical & HT data, Sequence, Literature etc. Focus Data Mining Data Integration Platform Building Translational Research
Data Sources • KEGG • DrugBank • Locate • MGI • PubMed • iHOP • GO • OMIM Others Literature Structure HT data • NCBI - GEO • EBI – ArrayExpress • NextGen Sequencing • Connectivity Map • GWAS/SNP/aCGH • GenBank • Transfac • miRBase • InterPro • TarFisDock
A Platform for Tissue-Specific Genes and beyond
Gene X Gene Y Expression Profiling of TSG Tissue Specific Tissue Selective Tissue Types
TSG Mining ~130 Tissues ~4000 Samples GeneLogic + Novartis
liver COXPRESdb Novatis Human tissue compendium liver
Drug-TSG Disease-TSG Disease-Drug Making the connection
symptom Prescription Component TCM
Functional Modules Batch View Multiple View Tissue View Gene View Title in here
Discovery: From Diseases to Drug p < 1E-5 p < 1E-5 Enrichment (p < 1E-5): immune response inflammatory response Cytokine-cytokine interaction Toll-like receptor signaling
Discovery: From Diseases to Drug p < 0.05 TNF inhibitor: Etanercept Adalimumab Ortiz P, Bissada NFet al Periodontal therapy reduces the severity of active rheumatoid arthritis in patients treated with or without tumor necrosis factor inhibitors. J Periodontol 80: 535–540, 2009.
Discovery:From Drug to Diseases p < 1E-5 Simvastatin: hypercholesterolemia cardiovascular disease
Discovery:From Drug to Diseases p < 0.05 Bruner-Tran KL, Osteen KG, Duleba AJ. Simvastatin protects against the development of endometriosis in a nude mouse model. J Clin Endocrinol Metab 94: 2489–2494, 2009.
Literature mining Gene
Disease Gene Drug 4324 – drugbank 562 – C-Map 544 – Compound 1305+ - TCM 3960 – drugbank 17119 – non-TSG 2741 – gene set 611 – miRNA 15188 – MeSH+OMIM 86 – TCM symptom 8703 – mammalian phenotype Pathway KDS 880 – KEGG + Reactome 38611 – gene ~ pathway Still growing Localiza- tion 3687 – TSG-related 52532 – gene ~ Go CC
TFBS PFM
TFBS Prediction 研究策略
TFBS Prediction for TSG TFBS PFM CRM1 . . . . . . . . . CRMk Tissue types CRM: T1 T2 T3
Future Plan • More data collection & integration • Multiple verticals & prioritization • Mining capability & feature enrichment • Hypothesis generation & validation • Translational use • Collaboration
Thanks! Xiaoqin Yang Xia Chen Guiping Wang Yun Ye Xuezhong Zhou NCBI KEGG Reactome MGI Locate Drugbank GO …… NSFC GD EA SMU