Discovering Query Context using Concept Hierarchy

Discovering Query Context using Concept Hierarchy Mukesh Mohania IBM Research – India

Discovering Query Context using Concept HierarchyforInformation Integration Motivation: • We keep refining the search query keywords till we get the desired results • That is, the onus of specifying appropriate set of keywords (called, query context) remains with the user, • limitation since the user might not be aware of the overall context at the point of submitting the query. • Inevitable when the search query is submitted through the mobile device, particularly when the data source is not available all the time for searching or the bandwidth is limited. • The problem is how to derive the full context (as a set of keywords) of a query without sending the search query to the back-end data source. • We propose to store and use the concept hierarchies at mobile device for discovering the query context. • The advantage of discovering the query context is to further get all documents from enterprise content repositories and/or from external web which are highly relevant to the query results.

CRUX Main Idea • Specify information need in terms of SQL over the structured database • Additional information needs specified using “directives” (optional) • Automatically synthesize the “context” of the SQL query from its result as well as related information present elsewhere in the database (found using the known semantic dependencies in the structured data) • Considers the foreign-key dependencies in either direction (PK  FK, FK  PK) • Automatically determines the most promising subset of the related information to explore • Use this context and the directives to retrieve the relevant unstructured data

FILLED IN BY THE USER Select GROUPS.Name, PROJECTS.Description From PROJECTS, PROJEMPS, EMPLOYEES, GROUPS Where PROJECTS.Proj# = PROJEMPS.Proj# And PROJEMPS.Emp# = EMPLOYEES.Emp# And EMPLOYEES.Grp# = GROUPS.Grp# And PROJECTS.Name = “CORTEX” Select ARTICLES.id From ARTICLES Where ARTICLES.contains(“Report AND SET-OR($GROUPS.Name)”) And ARTICLES.year > 2001 ORDER BY RELEVANCE USING THRESHOLD = 0.8 CORE QUERY SEARCH DIRECTIVES

Core SQL Query Select GROUPS.Name, Projects.Description From PROJECTS, PROJEMPS, EMPLOYEES, GROUPS Where PROJECTS.Proj# = PROJEMPS.Proj# And PROJEMPS.Emp# = EMPLOYEES.Emp# And EMPLOYEES.Grp# = GROUPS.Grp# And PROJECTS.Name = “CORTEX” Restrict to documents containing “Report” and at least one group name Select ARTICLES.id From ARTICLES Where ARTICLES.contains(“Report AND SET- OR($GROUPS.Name)”) And ARTICLES.year > 2001 ORDER BY RELEVANCE USING THRESHOLD = 0.8 Order by relevance to the structured query with cutoff threshold as 0.8 Restrict to documents published after 2001 Search Directives (optional) Core SQL Query and Search Directives

Concept Hierarchies (CH) • CH are defined over relational tables using FK, PK relationship (not all attributes are considered – depending on the application/scenarios) • Predefined hierarchical relationships that map a set of lower-level concepts to their higher level correspondences • Example, {tennis, rugby, hockey, football} can be generalized as “sports” at a high level concept. • CH can be constructed in many ways

Scenarios • Health: Patient specific report and medical articles, • Manufacturing: Defect statistics and engineering specifications, • Marketing: Customer transaction history and marketing documents, • Travel: Traveler itinerary and promotional flyers, travel advisories, • Management: Employee records and status reports

Research Issues • Limitation: Context is a set of keywords. • – Semantics based context? • Limitation: The context of a search query is determined by concept hierarchy. • -- Other avenues such as the previous results retrieved and the query workload, the user profile, if given • Limitation: Context of a search query is mapped to one or more categories • -- Precise context-based document retrieval may be helpful • Limitation: The documents and search query research are returned as unordered sets. • -- For usability reasons, efficient ranking algorithms to order the returned results with respect to the input query context would be needed. • Limitation: Entire query results returned in addition to the documents on response to a query. • -- (HCI issue) The query results need to be presented in a browse-able manner, or may even be presented as smart tags dynamically attached to the documents.

Discovering Query Context using Concept Hierarchy

Discovering Query Context using Concept Hierarchy

Presentation Transcript

Context-Sensitive Query Auto-Completion

Learning Objective (skill-concept-context)

Using Context Clues

Using Context Clues

Object Retrieval Using Visual Query Context

Using Context Clues

Constructive Engagement : Context and Concept

Using Context Clues

Query Health Concept Mapping Activity

Using Context Clues

Concept Hierarchy Induction

Learning Objective (skill-concept-context)

Context-based Visual Concept Detection Using Domain Adaptive Semantic Diffusion

Expanding Query Terms in Context

Concept Hierarchy Induction by Philipp Cimiano

Context as Supervisory Signal: Discovering Objects with Predictable Context

Query Suggestions Using Query-Flow Graphs

Context-Aware Query Classification

Context-Sensitive Query Auto-Completion

Using Context Clues

Using Context Clues

Context-Sensitive Query Auto-Completion