1 / 5

Enhancing Invenio with External Ranking Tools: Current Status and Future Directions

This document outlines the current status and future plans for incorporating external ranking methods into Invenio. It discusses available ranking methods such as word similarity and citation count, and the current implementation of word similarity ranking. Written in Python, this approach aims to bridge Invenio with external information retrieval systems while ensuring scalability for approximately 2 million records. The document highlights ongoing integration efforts, expected completion by the end of June, and future priorities driven by community feedback, including potential features like a Xapian adapter and further ranking methods.

Télécharger la présentation

Enhancing Invenio with External Ranking Tools: Current Status and Future Directions

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Using external ranking tools May 8, 2012 Patrick Glauner patrick.oliver.glauner@cern.ch

  2. Current Status • Available ranking methods: • Word similarity • Citation count • Current word similarity ranking implementation: • Meta-data only • Written in Python • Maintenance and furtherdevelopmentconsume development resources that could be spent on digital library features • Scalability, designed for ~2 million records

  3. Goal • A generic bridge between Invenio and external information retrieval systems • Invenio shall dispatch word similarity ranking tasks to the external system and retrieve its results Resultset Bridge … Rankedresultset

  4. Evaluation and • Written in Java and Lucene-based • HTTP access • Written in C++ • Direct access to local database in program code

  5. Progress andOutreach • integration is currently being implemented • Will be ready end of June including scalability measurements • Future priorities will depend on thecommunityfeedback • Possiblefuturefeatures: • Xapian adapter • Furtherrankingmethods • Externalsearch • …

More Related