1 / 46

XTF 2.1

XTF 2.1. Powerful Search and Display without the Headaches. Martin Haye California Digital Library. Overview. What is XTF? Community-inspired development New features in 2.1 Planned improvements. XTF in 2 minutes. e X tensible T ext F ramework Search and display technology from CDL

alsatia
Télécharger la présentation

XTF 2.1

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. XTF 2.1 Powerful Search and Display without the Headaches Martin HayeCalifornia Digital Library

  2. Overview • What is XTF? • Community-inspired development • New features in 2.1 • Planned improvements

  3. XTF in 2 minutes • eXtensible Text Framework • Search and display technology from CDL • Open-source Java framework • Powerful and highly configurable • XML + Full text search • Also indexes PDF, HTML, Word

  4. XTF in 2 minutes • Search: Query power/speed of Lucene, plus: • keyword search, facets, spelling, lots more • View: Processing power of Saxon, plus: • large file optimizations, hit markup • Configure and customize exclusively in XSLT • Mature, tightly integrated, well documented • In use at CDL and many other places

  5. Online Archive of California

  6. eScholarship Editions

  7. calisphere

  8. Mark Twain Project Online

  9. UC Berkeley

  10. Encyclopedia of Chicago

  11. Indiana University: Newton

  12. Indiana University: Swinburne

  13. Sweden

  14. Italy

  15. How does XTF compare? Green- stone * * Solr Turn-key / easy---------------> XTF 2.1 XTF 2.0 Customizable / Powerful ----------------------------------------> * disclaimer: based on my limited experience with Greenstone and Solr

  16. Community-inspired Development • First, we asked the XTF community for features they wanted • Then everybody voted • People wanted many features they saw in XTF projects at CDL

  17. Aligning Our Process • Our group was starting a new CDL project • We aligned our development • Result: Everybody benefits

  18. New and improved features • Faceted browse • Search flexibility • Bookbag • Spelling correction • Similar items • Other stuff

  19. Faceted browse • Previously implementing faceted browse required lots of XSLT programming. • Hierarchical facets: even harder • Required us to deeply refactor the stylesheets, but now it’s simple to add new facets.

  20. Faceted browse

  21. Faceted browse

  22. Hierarchical facets

  23. Hierarchical facets

  24. Search flexibility • Keyword search: single box (now default). Internally, searches multiple fields. • Advanced search: explicitly fill in constraints for various fields • Freeform search (new): text-based field specifiers, AND, OR, parentheses, etc.

  25. Keyword search

  26. Advanced search

  27. Freeform search

  28. This fit nicely into XTF’s architecture Simple but conforming implementation OAI-PMH

  29. OAI-PMH

  30. Bookbag • Refactored the AJAX to use YUI (Yahoo User Interface widgets) • Still session based • Now supports emailing the bookbag

  31. Bookbag

  32. Bookbag

  33. Bookbag

  34. Bookbag

  35. Spelling correction • Unicode bug fixes • On by default and fully integrated

  36. Spelling correction

  37. Spelling correction

  38. Similar items • Allows user to see “more like this” • Improved AJAX integration • On by default - no configuration needed

  39. Similar items

  40. Similar items

  41. Other changes in XTF 2.1 • Built-in NLM “Blue”, TEI P5, MS Word support (still support TEI P4, EAD, PDF, HTML, text) • Valid XHTML output • RawQuery servlet to provide a query back-end to a (e.g. Ruby) front-end or mash-up. • Bug fixes and minor changes (many reported/requested by users)

  42. Wiki documentation

  43. Wiki documentation

  44. On the horizon • A page-turner for scanned texts and converted PDFs • Pop-up image/PDF page snippets • Background auto-warming, to speed response after incremental indexing • And of course, features suggested as users upgrade to or adopt XTF 2.1

  45. Philosophy • Adaptation through programming • XTF is still about building what you want using a set of powerful tools But now: • Stylesheets are more modular • Build interfaces faster using honed widgets • Prettier UI to start with

  46. Fin • Download: xtf.sourceforge.net • Documentation: xtf.wiki.sourceforge.net • Discussion: groups.google.com/group/xtf-user • Me: martin.haye@ucop.edu

More Related