1 / 44

Introduction to Ontologies (for data integration and exchange) SEEK EOT Workshop

Introduction to Ontologies (for data integration and exchange) SEEK EOT Workshop. Shawn Bowers San Diego Supercomputer Center University of California, San Diego (bowers@sdsc.edu). Outline. Ontologies basics Ontologies and data management Benefits of ontologies Constructing ontologies

Télécharger la présentation

Introduction to Ontologies (for data integration and exchange) SEEK EOT Workshop

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to Ontologies (for data integration and exchange)SEEK EOT Workshop Shawn Bowers San Diego Supercomputer Center University of California, San Diego (bowers@sdsc.edu)

  2. Outline • Ontologies basics • Ontologies and data management • Benefits of ontologies • Constructing ontologies • Breakout Exercises

  3. Outline • Ontologies basics • Ontologies and data management • Benefits of ontologies • Constructing ontologies • Breakout Exercises

  4. What are ontologies? It depends on who you ask We focus on the data-management view Generally speaking, an ontology specifies a theory (a model) by … defining and relating …generic concepts representing features of the real or abstract world (a domain of interest) [Bunge]

  5. Concepts, Symbols, and Things • Humans use symbols (e.g., words) to communicate • Words are mapped to things indirectly through concepts that denote (refer to) things Concept “Jaguar” Ogden, C. K. & Richards, I. A. 1923. "The Meaning of Meaning." 8th Ed. New York, Harcourt, Brace & World, Inc [Carole Goble, Nigel Shadbolt]

  6. Concepts, Symbols, and Things Symbols and concepts are not precise • The same symbol can stand for multiple things • The same thing can have multiple symbols • Concepts are usually not well-defined Concept “Jaguar” Ogden, C. K. & Richards, I. A. 1923. "The Meaning of Meaning." 8th Ed. New York, Harcourt, Brace & World, Inc [Carole Goble, Nigel Shadbolt]

  7. Concepts, Symbols, and Things An ontology attempts to define and relate specific concepts for certain sets of things via agreed upon symbols Concept “Jaguar” Ogden, C. K. & Richards, I. A. 1923. "The Meaning of Meaning." 8th Ed. New York, Harcourt, Brace & World, Inc

  8. What are ontologies? Ontologies are typically created to: Commit to a definition (a model) of a domain Explicitly state assumptions concerning the definition Have a wide scope (be general) Support exchange and integration of heterogeneous data sources and applications (more on this later…)

  9. What are ontologies? Ontologies may be expressed Informally using natural language (e.g., in philosophy and sometimes biology) Formally using a mathematical language, e.g., first-order logic We focus on formal ontologies To be precise about what the theory proposes

  10. What are ontologies? Formal ontologies can vary in detail Controlled Vocabulary (list of terms) Simple Thesaurus (synonyms) Thesaurus (broader/narrower terms) Classification (class, instance, is-a, maybe part-of) Classification (value, cardinality constraints) Classification (axioms such as disjoint, union, etc.) Classification (general logic constraints)

  11. What are ontologies? Formal ontologies can vary in detail Controlled Vocabulary (list of terms) Simple Thesaurus (synonyms) Thesaurus (broader/narrower terms) Expressiveness Classification (class, instance, is-a, maybe part-of) Classification (value, cardinality constraints) Classification (axioms such as disjoint, union, etc.) Classification (general logic constraints)

  12. What are ontologies? • A conceptualization proposes a theory of the domain of interest • An ontology is a (possibly incomplete) representation of the conceptualization set of all theories that can be expressed in the language ontology Conceptualization (of the theory) [Guarino]

  13. Class, Instance, and Is-a Animal “Every Jaguar is an Animal” x . Jaguar(x)  Animal(x) is-a Jaguar Set of things (instances) denoted by the class Animal Set of things (instances) denoted by the class Jaguar

  14. Properties and Cardinality Constraints Animal is-a eats Carnivore A cardinality constraint might state that carnivores must eat at least one Animal Question: Must Jaguars eat at least one Animal? is-a Jaguar

  15. Value Restrictions Animal is-a eats Carnivore A value restriction for Jaguar might restrict the eats propertyto the specific animals eatenby Jaguars is-a Jaguar

  16. Value Restrictions Jaguars restrict the eats relationship to Marsh Deer, … Animal eats Carnivore Herbivore eats Marsh Deer Jaguar

  17. Value Restrictions Does anyone see a problem with this choice of representation? Animal eats Carnivore Herbivore eats Marsh Deer Jaguar

  18. Value Restrictions These different representations propose the same basic underlying theory Animal eats Herbivore Carnivore JaguarFood Marsh Deer Peccary Jaguar eats

  19. What are ontologies? An (informal) ontology of wine: Wines are potable liquids made by wineries within regions and with specific vintages Wines are characterized by the type of grape they are made with, their color (white, rose, red), their sugar (dry, offdry, or sweet), their body (light, medium, full), and their flavor (delicate, moderate, strong) Sauvignon Blanc, Merlot, Pinot Noir, and Riesling are types of wines [OWL Guide]

  20. Exercise With a partner, take 5 minutes and try to define a “formal” ontology for the wine example • Select two or three classes • Identify some relationships between them • List any constraints (cardinality or value restrictions) that exist between them

  21. What are ontologies? (Philosophy) An ontological theory can answer “ontological” questions • Is Merlot a potable liquid? • Are there wines made of things other than grapes? • How are Pinot Gris and Pinot Noir related? • Are there white wines that are dry, full, and strong made in Napa Valley? We will look at other uses later [Bunge]

  22. Outline • Ontologies basics • Ontologies and data management • Benefits of using ontologies • Constructing ontologies • Breakout Exercises

  23. Ontologies and Data Management Where do ontologies fit within data management architectures? There is no specific answer to this question… However, an ontology is similar to a schema or conceptual model if one exists, but is • Developed independently of a particular application • Probably given in a different language • Inherently more general • Usually not a very good schema (data rep.)

  24. Ontologies and Data Management Ontology use concepts from (explicitly or implicitly) Design Artifact Conceptual Model Conceptual Model Schema Schema Schema Schema  Metadata Data

  25. Outline • Ontologies basics • Ontologies and data management • Benefits of ontologies • Constructing ontologies • Breakout Exercises

  26. Benefits of ontologies Ontologies are often developed within a community and are interdisciplinary Explicitly capture “knowledge” about a domain • Standard terms (symbols) for metadata values and schema design • Enables advanced searching techniques (via reasoning) • Enables exchange and integration

  27. Benefits of ontologies Ontologies for metadata keywords {sonoma county, wine} {cabernet sauvignon, sonoma county, …} {medium, red, dry, …}

  28. Benefits of ontologies Ontologies for metadata keywords Find information about dry californiared wines {sonoma region, wine} {cabernet sauvignon, sonoma region, …} {medium, red, dry, …} We use the ontology to “expand” the query, e.g., that cabernet sauvignon is red and dry; sonoma valley is in california

  29. Benefits of ontologies Dataset (wines by regions) What regional characteristics produce the best-selling wines? Dataset (wine sales) Integrate Analysis Dataset (region characteristics) Integration can be extremely complex due to structural (schema and values) and semantic (ontological) differences Ontologies can help!

  30. Benefits of ontologies Dataset (wines by regions) What regional characteristics produce the best-selling wines? Dataset (wine sales) Integrate Analysis Dataset (region characteristics) Registering datasets with ontologies Map structure (schema) to concepts Map data to classes/instances (various ways to do this…) Provides a uniform view of disparate sources

  31. Outline • Ontologies basics • Ontologies and data management • Benefits of ontologies • Constructing ontologies • Breakout Exercises

  32. Constructing ontologies Various Web-based standards are emerging for defining ontologies (each exploit XML) XML Schema • Mainly for defining “vocabularies” and less-formal ontologies (term-based is-a, some constraints) • Mainly a structural/schema representation • Topic Maps • For advanced thesauri, subject indexes • RDF/RDFS/OWL • Formal ontologies based on description logics and semantic networks

  33. Resource Description Framework (RDF) Simple data model that consists of • Resources (uniquely identified via URIs) • Properties • Values (resources or character strings) Data organized into triples (subject, property, value) locatedIn CaliforniaRegion SonomaRegion Property (Resource) Subject (Resource) Value (Resource) locatedIn(SonomaRegion, California)

  34. RDF Schema Adds a set of pre-defined properties to define classes and properties Allows instances to be connected to classes Sub-class and sub-property (is-a) relationships Region is a class locatedIn is a property locatedIn connects Regions locatedIn Region rdf:type rdf:type locatedIn CaliforniaRegion SonomaRegion

  35. OWL Adds additional pre-defined properties to further constrain an ontology (See http://www.w3.org/TR/owl-guide/) Note, RDF(S) and OWL use XML Some graphic tools exist (e.g., Protégé) <owl:Classrdf:ID="Vintage"> <rdfs:subClassOf> <owl:Restriction> <owl:onPropertyrdf:resource="#hasVintageYear"/> <owl:cardinality>1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> </owl:Class>

  36. OWL Adds additional pre-defined properties to further constrain an ontology (See http://www.w3.org/TR/owl-guide/) Note, RDF(S) and OWL use XML Some graphic tools exist (e.g., Protégé) A Vintage is a class that is a subclass of an unnamed class whose instances always have one hasVintageYearproperty. <owl:Classrdf:ID="Vintage"> <rdfs:subClassOf> <owl:Restriction> <owl:onPropertyrdf:resource="#hasVintageYear"/> <owl:cardinality>1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> </owl:Class>

  37. Protégé

  38. Description Logic A language and syntax for describing “concept” logics • Concept names C (denote sets of instances) • Class definitions D (denote sets of instances) • Subclass definition C ⊑ D • Equivalence definition C  D • Definition constructors • intersection D ⊓ D • union D ⊔ D • Property existence hasProp.D • Property restriction hasProp.D • Cardinality =1 hasProp.D, >1 hasProp.D, <2 hasProp.D

  39. Description Logic Wine ⊑ PotableLiquid ⊔ hasColor.{Red, Rose, White) The class Wine is a sub-class of PotableLiquids that have at least one (exists one) hasColor property whose values are either Red, Rose, or White WhiteWine  Wine ⊓ hasColor.{White) WhiteWines are exactly Wines whose color is White WhiteBurgandy ⊑ WhiteWine ⊓ Burgandy The set of WhiteBurgandy wines is a subset of the set of WhiteWines intersected with Burgandy wines SauvignonBlanc ⊑ WhiteWine ⊓ =1 madeFromGrape.SauvignonBlancGrape

  40. Constructing Ontologies In general, creating an ontology is hard • Requires general agreement and understanding of a domain • Requires a clear, concise, and unambiguous definition • May invoke controversy • Is a hard data-modeling problem (complex constraints, broad domain)

  41. Outline • Ontologies basics • Ontologies and data management • Benefits of ontologies • Constructing ontologies • Breakout Exercises

  42. Breakout Exercises Divide into the same groups as yesterday Develop an ontology for the domain you worked on: • Define relevant concepts • Define relationships among concepts • If you have time, work on simple constraints (cardinality, value restrictions) Capture (on paper, or in PPT if you feel ambitious) your ontology in whatever way makes sense to you (e.g., as circle-line drawings or as list of terms and properties). What assumptions did you make in creating your ontology? If you have time, develop a scenario for your ontology in terms of your workflow. For example, to show how your ontology could help integration or query.

  43. Some References Mario Bunge. Treatise on Basic Philosophy, Vol. 3, Ontology I: The Furniture of the World. D. Reidel Publishing Company, 1977. Nicola Guarino. Formal ontology and information systems. In Proc. of Formal Ontology in Information Systems, IOS Press, pp. 3-15, 1998. Thomas R. Gruber. Toward principles for the design of ontologies used for knowledge sharing. In Formal Ontology in Conceptual Analysis and Knowledge Representation, Kluwer Academic Publishers, 1993. Jeffrey Parsons and Yair Wand. Emancipating instances from the tyranny of classes in information modeling. In ACM Transactions on Database Systems, 25(2):228-268, 2000.

  44. Some References Michael Smith, Chris Welty, and Deborah McGuinness. OWL Web Ontology Language Guide. W3C Proposed Recommendation. (http://www.w3.org/TR/owl-guide/). Includes Wine Ontology. Protégé. Stanford Medical Informatics. http://protege.stanford.edu/index.html. Freely available. Lots of plug-ins.

More Related