100 likes | 223 Vues
The PERTAINS project, stemming from previous work on EnTag, aims to improve the quality of user-generated tag metadata in online services. By leveraging existing document metadata, classification terms, and user tags—alongside systematic suggestions from controlled vocabularies—the service provides accurate tag recommendations. Utilizing algorithms for matching, filtering, and ranking, PERTAINS features a web service that is fast, scalable, and platform-neutral. It enables disambiguation of suggestions and can restrict results to specific subject areas based on the Dewey Decimal Classification (DDC).
E N D
Caroline Williams, Executive Director of Intutecaroline.williams@manchester.ac.ukAndy Priest, Intute Technical Co-ordinatorandrew.priest@manchester.ac.uk PERsonlisation TAgging interface INformation in Services (PERTAINS)
Background • University of Glamorgan and Mimas • Intute and Copac • Follow on from EnTag
PERTAINS – Tag Suggestion Services • Rationale • Draws on previous work (EnTag project) • Improving quality of user tag metadata • Basis for suggestions • Using existing document metadata • Titles and abstracts • Classification terms and existing keywords (user tags) • Suggestions from controlled vocabulary • Matching against DDC captions and relative index terms
PERTAINS – Tag Suggestion Services • Document Metadata • Title • Classification • Uncontrolled keywords • Controlled keywords • Description Suggestion Services DDC (SKOS) • Aim – improve quality of tag metadata • Basis – existing document metadata / DDC (SKOS) • Algorithm – matching, weighting, filtering, ranking • Implementation – URL based web service interface
PERTAINS – Tag Suggestion Services • Algorithm • Disambiguation of suggestions • Area of Interest (AOI) filter – restrict suggestions to specific subject areas (DDC summaries) • Weighting of suggestions - ranking by relevance • Implementation • Boolean full text matching (MySQL) • URL based web service call interface • Fast, scalable, platform neutral • JSON data structures returned