1 / 42

Pattern Recognition in Action for Cataloging and Metadata

Pattern Recognition in Action for Cataloging and Metadata. 2006 OLC Technical Services Retreat Chris Grabenstatter April 25, 2006. Agenda. OCLC Cataloging/Metadata strategic directions Architecture to support strategy Examples of projects . Cataloging Environment.

Jeffrey
Télécharger la présentation

Pattern Recognition in Action for Cataloging and Metadata

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Pattern Recognition in Action for Cataloging and Metadata 2006 OLC Technical Services Retreat Chris Grabenstatter April 25, 2006

  2. Agenda • OCLC Cataloging/Metadata strategic directions • Architecture to support strategy • Examples of projects

  3. Cataloging Environment • Fewer catalogers, reduced budgets • Little growth in print materials acquisitions • E-resources increasing – cataloged?

  4. Deliver more automatically • Build on PromptCat, Cataloging Partners program success • Partner with major materials providers • Cataloging tied to selection – possible new service

  5. More Scripts/Language Support • Growing WorldCat • Supporting libraries’ diverse collections • Easier to get materials cataloged • Growing membership • One stop shopping • Both US and global libraries

  6. Metadata support for e-content • Support automated metadata generation for e-resources • Facilitate storage and discovery of digital content • Support new metadata schemes - crosswalks • Enrich WorldCat with e-serials records and holdings

  7. Continue to deliver value • Ongoing Connexion maintenance • Standards • Simplify pricing

  8. “The Internet is entering its Lego era. Indeed, blocks of interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.” --John Markoff, The New York Times, April 5, 2006 Lego Era

  9. Library 2.0 “Library 2.0 is about small pieces of software loosely joined, … requires business models where multiple vendors bring value to consumers together … to reduce duplication of effort and reduce barriers to innovation…” --Paul Miller, “Library 2.0: the challenge of disruptive innovation.” http://www.talis.com/resources/documents/447_Library_2_prf1.pdf

  10. OCLC Metadata Management Service Connexion Digital Archive Content Coop ILS PICA NetLibrary Material Vendors Publishers OAI Repositories Local DB’s Web Services/Portal/API Layer Local Holdings (MFHD) OAI Harvest Validate DA Ingest DA Extract DA Access Metadata Creation Z39.50 (authorities Non-roman Format Crosswalks Acquisitions/ Selection Terminologies Pan/Zoom Language Service SRW/Zing Update Shelf Ready Reports & Stats Profiling Metadata Capture Usage Stats Profiling Data Digital Archive

  11. Projects Metadata support for e-content • Extraction/Creation Web Service • Crosswalk Web service • OCLC Terminologies Service • Content Cooperative Pilot • OCLC eSerials Holdings Service

  12. Extraction/Creation Web Service Extract metadata from Web sites, PDF files, and Word files • Re-implementing and enhancing functionality currently available in Connexion browser • Connexion browser – May 2006 • Connexion client 1.60 – June 2006

  13. Connexion extract metadata • Enter URL or path to extract metadata • Supported file types .htm, .doc, .pdf • Create multiple records from Web sites linked to the parent URL • Specify to display or save created workforms, apply default constant data, and define My Status value • Future – add tools to “create” metadata

  14. Crosswalk Web Service • Batchload/PromptCat • ONIX to MARC • OAI harvesting – Dublin Core to MARC • Future • Import and export Dublin Core data from Connexion client • Support for other metadata schemes both browser and client interfaces

  15. OCLC Terminologies service • Introduction, June 2006 • Add more access points using other controlled vocabularies, e.g., MeSH, GSAFD • Available to all OCLC Cataloging subscribers • Subscriptions available for non-Cataloging users • Use with a variety of metadata editors, e.g., Connexion browser and client

  16. List of Terminologies in initial release • aat - Art & Architecture Thesaurus (J. Paul Getty Trust) • dct - Dublin Core Metadata Initiative Type Vocabulary (Dublin Core Metadata Initiative) • gmgpc - Thesaurus of Graphic Materials, TGM I (Library of Congress) • gsafd - Guidelines On Subject Access To Individual Works Of Fiction, Drama, Etc. (American Library Association) • lctgm - Thesaurus of Graphic Materials, TGM II (Library of Congress) • mesh - Medical Subject Headings (MeSH®) (National Library of Medicine) • ngl - Newspaper Genre List (University of Washington) • tgn - Thesaurus of Geographic Names (J. Paul Getty Trust) • ulan - Union List of Artists' Names (J. Paul Getty Trust

  17. Terminology Pane A separate application 1 2

  18. Content Cooperative Pilot • Upload content objects to the OCLC Digital Archive from Connexion browser and client interfaces • Digital image, thesis & dissertation, oral history, e-book, video, etc. • Replace WorldCat records to automatically add a URL pointing to the content object • Access digital content from FirstSearch, Group Catalogs, and OpenWorldCat

  19. Planned Open WorldCat Page for Digital Image

  20. Planned Zoom & Pan

  21. OCLC eSerials Holdings service • Automatically updates eSerials holdings in WorldCat • Access to eSerials via WorldCat Resource Sharing • Access to eSerials through OCLC discovery platform • Compare electronic and print serials collections No additional work for the library!

  22. P P P E E P E P P E E P E E E Digital Collections D P E P E P E Print Collections P P P E P E E Links to online Full Text Resource Sharing Services Links to OPACs P P E P E E OCLC Libraries Vendors OCLC OCLC FirstSearch WorldCat Resource Sharing WorldCat Collection Analysis Open WorldCat Resolver / A-Z serials list Resolver / A-Z serials list P E P P

  23. Pilot Partners • 35 Pilot libraries • EBSCO • Ex Libris • Serials Solutions • TDNet • More to come

  24. Benefits to the library • Increased operational efficiencies in ILL • Filling where possible • A revenue opportunity for some • You control requests via automatic deflection • Increased visibility at the point of need • Leverages investment in services

  25. Progress • Initial production system available late June 2006 • Web-based registration form • No charge to participate in the eSerials holdings service • Future enhancements projected to include options for local holdings data, MARC record update service, and additional deflection choices

  26. ProjectsDeliver more automatically • Improve shelf-ready cataloging • PromptCat/Cataloging Partners – 100% goal • Partner with major vendors • Selection • Possible future service • OCLC partnering with materials vendors to help with notification slip selection process • Cataloging a by product of selection • Watch for more information in the future!

  27. Projects More Scripts/Language support • New scripts support • Cyrillic, Greek and Hebrew – July 2005 • Thai and Tamil scripts for use with Connexion client 1.50 (investigating Devanagari, Sinhala, and Bengali next) • Connexion interface translations • Client • Chinese (Traditional and Simplified) and Japanese – July 2005 • German and Korean – Nov. 2005 • CatExpress – French (Nov. 2006) • Unicode export – Nov. 2005 • Automatic transliteration Web service – June 2006

  28. Questions, Answers & Sharing ? ? ?

  29. Cataloging future directionsContact us • Eric Childress: eric_childress@oclc.org • Chris Grabenstatter: c_grabenstatter@oclc.org

More Related