Computing Text Semantic Relatedness using Hypertext Encyclopedia

Presenter : Bo-Sheng Wang Authors : MajidYazdania,b,*, Andrei Popescu-Belisa AI, 2013 Computing text semantic relatedness using the contents and links of a hypertext encyclopedia

Outlines Motivation Objectives Methodology Empirical analyses Experiments Conclusions Comments

Motivation • Existing measures of semantic relatedness based on lexicaloverlap, though widely used, are of little help when text similarity is not based on identicalwords.

Objectives Therefore, they will computing text semantic relatedness based on concepts and their relations, which have linguistic as well as extra-linguistic dimensions, remains a challenge especially in the general domain and/or over noisy

Methodology-build concept network • Concept • They removed all Wikipedia articles. • (Talk,File, Image, Template, Category, Portal, and List,) • Disambiguation pages were removed. • They set a cut-off limit of 100 non-stop words. • They extracted the corresponding anchor text and considered it as another possible secondary title for the linked article.

Methodology

Methodology-build concept network • Relatoins • They focus in the present study on the hyperlinks and links computed from similarity of content, of category. • we computed the lexical similarity between articles as the cosine similarity between the vectors derived from the articles’ texts, after stopword removal and stemming using Snowball.

Methodology

Methodology-VP

Methodology-VP to weighted sets of concepts and to texts

Methodology-Approximation

Methodology-Approximation • T–truncated • ε-truncated

Methodology-Learning embedding

Empirical analyses Convergence of the T-truncated

Empirical analyses Convergence of ε-truncated

Empirical analyses

Experiments Average training error

Experiments Word Similarity

Experiments

Experiments Document similarity

Experiments Document clustering

Experiments Comparison of VP and cosine similarity

Experiments Text classification

Experiments

Conclusions

Comments • Advantages • Disadvantage • Applications • Text categorization

Computing Text Semantic Relatedness using Hypertext Encyclopedia

Computing Text Semantic Relatedness using Hypertext Encyclopedia

Presentation Transcript

Extended Gloss Overlaps as a Measure of Semantic Relatedness

Adding Hypertext Links to a Web Page

OMIOTIS: A Thesaurus-based Measure of Semantic Relatedness

Using the Encyclopedia

From Text to Hypertext

Pertemuan 03 Text and Hypertext

The Encyclopedia of

ADDING HYPERTEXT LINKS

Computing semantic relatedness using Wikipedia features

Text and Hypertext

Semantic Wikipedia The missing links

Text to text links

TEXT ANALYSIS FOR SEMANTIC COMPUTING

Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis

Using Semantic Relatedness for Word Sense Disambiguation

A Word at a Time: Computing Word Relatedness using Temporal Semantic Analysis

Computing Semantic Relatedness

Locating Information in a Text Using Table of Contents and Index

Background and Text Links

Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis

Using an Encyclopedia

TEXT ANALYSIS FOR SEMANTIC COMPUTING