1 / 12

Faceted browsing for ACL Anthology

Faceted browsing for ACL Anthology. Praveen Bysani. ACL Anthology. a digital archive of research papers in CL and NLP contains over 20,100 papers free of cost a rchive for sister conferences and journals. Current browser. d irect and navigational search hard to navigate

galvin
Télécharger la présentation

Faceted browsing for ACL Anthology

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Faceted browsing for ACL Anthology Praveen Bysani

  2. ACL Anthology • a digital archive of research papers in CL and NLP • contains over 20,100 papers • free of cost • archive for sister conferences and journals

  3. Current browser • direct and navigational search • hard to navigate • non-customized search • non-sortable results

  4. Faceted browsing • Combination of navigational and direct search paradigms • Facets are properties of information elements • Access to organized information • Ability to explore the collection in multiple dimensions through filters

  5. Faceted Browsing • RoR + Blacklight plugin • Apache Solr • Metadata from XML • Blacklight customization for XML

  6. Show view

  7. Index View

  8. More cookies.. • User Feedback • Comment/ Share / Like • Suggestions for correcting the meta data • Ability to export bib in six formats • Author pages • List of publications • Co-authors

  9. Third-party annotations • Automatically annotate articles with new metadata • Anthology as a corpus • API to make anthology an object of study • OAI compatible • allows metadata harvesting • @ http://aclanthology.heroku.com/

  10. Challenges • Normalizing the quality of anthology meta data information • SIG Information • yaml files • no identifiers provided • DOI • from acm • changes in names of papers, authors

  11. Similar works ACL Author Network • bibliometrics ACL Search Bench • Semantic search

  12. Plans for the future • A common data schema to integrate all • Indexing the whole text data • Range queries for year facet • Exporting total volume bibliography • Enriching author pages

More Related