İ rem Ar ıkan , Srikanta Bedathur, Klaus Berberich

Time Will Tell: Leveraging Temporal Expressions in IR İrem Arıkan, Srikanta Bedathur, Klaus Berberich

Motivation • Documents contain temporal information in the form of temporal expressions

Motivation • Users have temporal information needs • Query: Prime Minister United Kingdom2000

Motivation • Users have temporal information needs • Query: Prime Minister United Kingdom2000 PROBLEM Traditional information retrieval systems do not exploit the temporal content in documents Temporal expressions are more than common terms

Motivation • Users have temporal information needs • Query: Prime Minister United Kingdom2000 PROBLEM Traditional information retrieval systems do not exploit the temporal content in documents Temporal expressions are more than common terms OUR APPROACH Integratestemporal dimensioninto a language model basedretrievalframework

Outline Motivation Model Our Approach Experimental Evaluation

Document Model • Documentd = { dtext,dtemp} • dtext: a bag of textual terms • dtemp: a bag of temporal expressions

Document Model • Documentd = { dtext,dtemp} • dtext: a bag of textual terms • dtemp: a bag of temporal expressions • a temporal expression is considered as a time interval T = [begin,end ] T [ ] 0 begin end

Query Model • Query q= { qtext,qtemp} • qtext: set of textual terms • qtemp: set of temporal expressions • Prime Minister United Kingdom 2000 qtext qtemp

Outline • Motivation • Model • Our Approach • Filtering Approach • Weighted Approach • Experimental Evaluation

Our Baseline: Ponte and Croft‘s Model (LM) • Each document has a language model associated • Query is a random process • Documents are ranked according to the likelihood that the query would be generated by the language model estimated for each document

Filtering Approach (LMF) • Idea: Discard all documents that do not contain any temporal expression relevant to the user‘s query t

Filtering Approach • Idea: Discard all documents that do not contain any temporal expression relevant to the user‘s query • our definition of temporal relevance • only relevant, if it overlaps with a temporal expression from the query 2 May 1997 – 27 June 2007 28 Nov 1990 - 2 May 1997 2000 query t begin end

Filtering Approach • Idea: Discard all documents that do not contain any relevant temporal expressions to user‘s query • our definition of temporal relevance • only relevant, if it overlaps with a temporal expression from the query • Relevant 2 May 1997 – 27 June 2007 X Irrelevant 28 Nov 1990 - 2 May 1997 2000 query t begin end

Filtering Approach • Problem:has a black-and-white view of the world • Does not take into account • how many relevant temporal expressions a document contains • how closely they match the temporal expressions specified in the user‘s query

Filtering Approach • Problem:has a black-and-white view of the world • Does not take into account • how many relevant temporal expressions a document contains • how closely they match the temporal expressions specified in the user‘s query • query: 1980 – 1990 1980 – 1989 is more relevant than 23 March 1984

Weighted Approach (LMW) • Idea: Assign higher relevance to a document, if it contains more temporal expressions that match more closely to the temporal expressions from the user‘s query

Weighted Approach • Idea: Assign higher relevance to a document, if it contains more temporal expressions that match more closely to the temporal expressions from the user‘s query • We assume that qtext and qtemp are produced independently

Weighted Approach • Idea: Assign higher relevance to a document, if it contains more temporal expressions that match more closely to the temporal expressions from the user‘s query • We assume that qtext and qtemp are produced independently • Temporal expressions occur independently

Weighted Approach • Each temporal expression T in d is a sample from a different generative model

Weighted Approach • Each temporal expression T in d is a sample from a different generative model • Generating a temporal expression Q = [qBegin, qEnd] given dtemp • draw a single temporal expression T=[dBegin, dEnd] at uniform from d • generate Q by the generative model that is associated with T

Weighted Approach • Each temporal expression T in d is a sample from a different generative model • Generating a temporal expression Q = [qBegin, qEnd] given dtemp • draw a single temporal expression T=[dBegin, dEnd] at uniform from d • generate Q by the generative model that is associated with T • The likelihood of generating Q by the set of generative models that produced dtemp

Weighted Approach • Generate Q=[qBegin, qEnd]from the query by the generative model that is associated with T = [dBegin, dEnd] from a document P(qBegin) P(qEnd|qBegin) dBegin-α(dEnd-dBegin) qBegin dBegin dEnd qBegin qEnd dEnd dEnd+α(dEnd-dbegin)

Weighted Approach • Generate Q=[qBegin, qEnd]from the query by the generative model that is associated with T = [dBegin, dEnd] from a document P(qBegin) P(qEnd|qBegin) dBegin -α(dEnd-dBegin) qBegin dBegin dEnd qBegin qEnd dEnd dEnd +α(dEnd-dbegin) produces only relevant temporal expressions of T P(Q|T) gets smaller as the length of their overlap decreases

Outline Motivation Model Our Approach Experimental Evaluation

Experimental Evaluation Dataset HTML snapshot of English Wikipedia from May 2007 containing ~ 2M documents Implementation • Terrier Information Retrieval Platform: • provides an implementation of Ponte & Croft's approach • LMF, LMW • Java + MySQL • A set of regular expressions for extracting temporal information

Experimental Evaluation Spanish painter 18th century Anectodal query results - 1

Experimental Evaluation Sea Battle 1650 - 1670 Anectodal query results - 2

Experimental Evaluation User Study • 20 queries • Pooling top-10 results returned by the three methods • Relevance assessment by 15 users • highly relevant: 2 • marginally relevant: 1 • irrelevant: 0 • NDCG as a measure of effectiveness

Experimental Evaluation

Thank you! Questions?

Conclusion • Documents are rich of temporal expressions, but existing retrieval models are ignorant of their inherent semantics • Our work proposes two methods addressing this problem • Initial experimental evidence shows that our methods improve retrieval effectiveness for temporal information needs

Experimental Evaluation

b’ e e+α(e-b) Weighted Approach • generative model associated with T =[b,e] P(b’) P(e’) b e b-α(e-b) only generates overlapping intervals of T P(b’,e’) ~ |overlap|

Our Baseline: Ponte and Croft‘s Model (LM) • Query likelihood: the likelihood that a query q and a document d is generated by the same language model • depends on the term frequency of query words in the document and their collection frequency

İ rem Ar ıkan , Srikanta Bedathur, Klaus Berberich