Personalized Web Search by Mapping User Queries to Categories

Personalized Web Search by Mapping User Queries to Categories Fang Liu Presented by Jing Zhang CS491CXZ February 26, 2004

Background • Different users use same key words • such as “apple” can be fruit or computer • Category hierarchy can’t fit in one screen • Users are impatient to identify hierarchy before submit query

Central problem • How to personalize web search by mapping user queries to categories?

Key ideas of this paper • Build profile (both user and general profile) on search history • Deduce appropriate categories based on user’s profile • Associate query key words with category • Return top 3 categories to user each time

Methods to map key words to category • Use both user profile and general profile • Use user profile only • Use general profile only

Build user profiles (1) • Tree representation of search record

Build user profiles (2) Predefined Input Output

Build general profile • First two level of ODP category hierarchy (619 categories) Row1 Row2

Algorithms to learn profiles • Linear Least Squares Fit (LLST) • Rocchio-based Algorithm • K-Nearest Neighbor (kNN) • Adaptive Learning

LLSF Singular Value Decomposition

Pseudo-LLSF (pLLSF)

Ricchio-based Algorithm (bRocchio) • where m is the number of documents in DT , Niis the number of documents that are related to the i-th category, and M(i,j) is the average weight of the j-th term in all documents that are related to the i-th category.

kNN • where q is the query; cjis the j-th category; diis a document among the k nearest neighbors of q and the i-th row vector in DT , Cos(q, di) is the cosine similarity between q and di , and DC(i,j) denotes whether diis related to the j-th category.

Adaptive Learning (aRocchio)

Data sets for the experiment

Performance Evaluation • where n is the number of related categories to the query, scoreci is the score of a related category ci that is ranked among the top 3, rankciis the rank of ci and ideal_rankci is the highest possible rank for ci

Experiment Results (1) • Batch Learning Method

Experiment Results (2) • Comparison of Mapping methods

Experiment Results (3) • Adaptive Learning (aRocchio)

Discussions • Why user 1 have lowest accuracy and user 3 have highest accuracy for batch learning method?

Personalized Web Search by Mapping User Queries to Categories

Personalized Web Search by Mapping User Queries to Categories

Presentation Transcript

Personalized Ontologies for Web Search and Caching

Intermodeling , Queries and Kleisli categories

Personalized Ranking Model Adaptation for Web Search

Personalized Image Search

Reducing Latency of Web Search Queries

Personalized Search

Personalized Search Based on User Search Histories

Personalized Web Search using Clickthrough History

User Experience Issues in Web Search

Clustering User Queries of a Search Engine

Personalized Search

Clustering Personalized Web Search Results

Personalized Search

Improving Web Search Ranking by Incorporating User Behavior Information

Learning User Clicks in Web Search

Scaling Personalized Web Search

Implicit User Modeling for Personalized Search

“Personalized Search”

Personalized Web Search using Clickthrough History

Personalized Web Search Uncommon Responses to Common Queries

Personalized Search