1 / 6

Towards Seamless Integration and Querying of Biological Data

Explore the challenges of integrating and querying distributed biological data sources, aiming to improve information retrieval effectiveness. Discussing current limitations and proposing potential solutions for seamless data integration. Presenting modes of information integration and ongoing work on matching XML schema and Java objects.

xanthe
Télécharger la présentation

Towards Seamless Integration and Querying of Biological Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Towards Seamless Integration and Querying of Biological Data Estella T. Pham – Master’s Student in CS, UML Dr. Kajal Claypool – Professor, UML

  2. Topics of Discussion • The BIG problem • A quick background information • Our long-term goal • My current work

  3. The BIG problem • Distributed, heterogeneous data sources. • Database systems ( DBMSs, semantic heterogenity ) • Operating systems ( files ) • Hardware • How to obtain most of the relevant information on one particular subject effectively when the pieces of the information are in different databases ? For example, find protein A structure, its folding properties and propensities, amino acid sequence, DNA sequence, organization and expression? • Why are the current data integration tools inadequate?

  4. A Quick Background Information • 3 modes of information integration • Federated databases ( n databases ) • Warehousing ( n databases, a warehouse ) • mediation ( n databases, n wrappers, a mediator )

  5. Our Long-Term Goal:

  6. My Current Work • “U00096.gbk” and “ecoli.txt” ( GenBank and Swiss-Prot ) XML ‘Schema Java objects Schema matching

More Related