1 / 13

GraphSig : Mining Significant Substructures in Compound Libraries

GraphSig : Mining Significant Substructures in Compound Libraries. GraphSig. Provides powerful insight into structure-activity relationship in the form of significant substructures . Input: Diverse background database Libraries of compounds with specific activity Output:

fynn
Télécharger la présentation

GraphSig : Mining Significant Substructures in Compound Libraries

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. GraphSig: Mining Significant Substructures in Compound Libraries

  2. GraphSig Provides powerful insight into structure-activity relationship in the form of significant substructures • Input: • Diverse background database • Libraries of compounds with specific activity • Output: • Prioritized list of significant substructures

  3. Applications of GraphSig • What makes compounds active against a target? • Develop pharmacophore models based on results • What substructures impart specific biological activity (e.g., BBB permeability)? • Screen compounds for similar activity

  4. Key Benefits • Only automatic tool that identifies significant substructures • No other tools can mine structure-activity relationship based on topology • Unique use of a background statistical model derived from diverse compound libraries. • Scales to large databases • Two orders of magnitude faster than alternatives

  5. Validation Studies • Briem & Lessel dataset • Find substructures specific to activity classes, and their significance as well as support. • hERG dataset • Find substructures that are significant for toxicity. • Permeability Datasets (Oral bioavailability, BBB barrier) • Find substructures that are significant for permeability.

  6. 49 5HT3 Re-uptake Inhibitors 40 ACE Inhibitors 111 HMG-CoA Inhibitors 134 PAF Antagonists 49 TXA2 Antagonists 574 "Inactives" Briem and Lessel (BL) Dataset • 957 compound subset of MDL Drug Data Report (MDDR) classified by biological activity . These are summarized in the table below.

  7. Interesting Substructures from the BL Dataset Found in PAF Antagonists and ACE inhibitors only Found in PAF Antagonists and 5HT3 inhibitors only Found in all ACE Inhibitors Found in HMG-CoA inhibitors only Found in PAF and TXA Antagonists and 5HT3 inhibitors only Found in PAF Antagonists and TXA inhibitors only Found in PAF Antagonists only

  8. hERG Dataset • This dataset consists of compounds known to be active or inactive as hERG blockers. Here are some interesting substructures that we found on applying GraphSig to this dataset. Significant substructures contained in compounds that are not hERG blockers Significant substructures contained only in hERG blockers

  9. BBB Permeability • BBBDataSet – 1593 compounds labeled as BBB permeable or not • Significant substructures belong exclusively to either the permeable or non-permeable set, not both. Hence, substructures are representative of each set. • The frequency of occurrence of these substructures is very low (0.2-5%). Significant substructures would not be found on the basis of frequency. • Classification Accuracy of 82% with 5-fold cross validation

  10. Connections to Other Acelot Tools • Once a substructure is identified, use SimFinder for similarity searching. • Perform 3-d alignment of compounds that contain the specific structure and find well-aligned pharmacophoric features

  11. Technical Details: Mining Feature Vector Mining RWR on graphs Graph DB Significant feature vectors Feature vectors high frequency threshold Significant subgraphs Sets of subgraphs Frequent subgraph mining

  12. Technical Details: in silicoPrediction

  13. References • Huahai He; Ambuj K. Singh; GraphRank: Statistical Modeling and Mining of Significant Subgraphs in the Feature Space. Proceedings of the 6th IEEE International Conference on Data Mining (ICDM), December, 2006, doi:10.1109/ICDM.2006.79 • SayanRanu and Ambuj Singh, GraphSig: A Scalable Approach to Mining Significant Subgraphs in Large Graph Databases in 25th International Conference on Data Engineering (ICDE), 2009. • SayanRanu; Ambuj Singh; Mining Statistically Significant Molecular Substructures for Efficient Molecular Classification., J. Chem. Inf. Model., 2009, 49(11), pp 2537–2550 DOI : 10.1021/ci900035z • Hans Briem and Uta F Lessel, Perspect. Drug Discov. Dec. 2000 (20), 231-244.

More Related