470 likes | 654 Vues
Primo and Omeka : turning local databases into harvestable repositories. Alexander J. Jerabek Bibliothécaire Technologies de l’information Service des bibliothèques jerabek.alexander_j@uqam.ca 2014-05-01. Goal.
 
                
                E N D
Primo and Omeka : turning local databases into harvestable repositories Alexander J. Jerabek Bibliothécaire Technologies de l’information Service des bibliothèques jerabek.alexander_j@uqam.ca 2014-05-01
Goal Makespecialperipheric collections more accessible and more visible by integratingtheminto Primo
The Pouchet collection • Donation of 36 000 print documents and 20 050 vinyl records to the Music library • Primarilypedagogic or popular documents • Cataloguedapartfrom main catalogue, searchable in a local database (Access, .asp) • Ongoingwork to catalogue all items
Problem • How to getexisting records into Primo? • How to get new or modified records into Primo?
Omeka • “Omekais a free, flexible, and open source web-publishing platform for the display of library, museum, archives, and scholarly collections and exhibitions. Its “five-minute setup” makes launching an online exhibition as easy as launching a blog.” • http://omeka.org/about • Omekais a project of the Roy Rosenzweig Center for History and New Media, George Mason University.
Advantages of Omeka • Easy set up and maintainance • French interface • Doesexactlywhatweneed : create and update records and allowharvest via by Primo • Useful plugins • Create multiple users • Long range plans for possible digitization
Disadvantages of Omeka • Not possible to make global changes to records • Dublin Core not always best fit for data • Not alwayseasy to define default values • Not possible to export data
Omeka plugins • CSV Import • OAI-PMH Repository • Simple Vocab • Dublin CoreExtended • HideElements • Collection Tree
Prepare the staff • Createusers • Write up procedures for creating records • Re-iterativeprocess • Test runs in staging to findsnags
A few bugs • Dropping initial diacritic • Cannotsearch on threeletterwords
Import data into Excel • Tidy data as much possible • Filters in Excel • Search and replace in Textpad • Corrections usingOpenRefine (http://openrefine.org/) • Addcolumns, constants (e.g. Format) • Crosswalk, column headers to DC elements • Save as csv UTF-8
Dataset import intoOmeka Omeka CVS import defaults ChooseColumnDelimiteris : ; Choose Tag Delimiteris : | Choose File Delimiter : , ChooseElementDelimiter : /
Setting up Primo • Set up a datasource • Set up a scope • Set up a pipe • Create new local fields • Create new set of normalizationrules • Tweak Primo interface
4. Create new local fields • lds08 : Parolier (lyricist) • lds09 : Compositeur (composer) • lds10 : Interprète (performer) • (see notes below for steps)
4. Rules for new local fields • Ex. new field for lyricistbased on ‘ (par.) ’
5. Create new normalizationrules • Strip out parenthetical notes for display
5. Create new normalizationrules • Strip out parenthetical notes for display
5. Create new normalizationrules • Addcomplementary information • not: • Dublin Core:Publisher Dublin Core:Description • Bibliothèque de Musique Disponible au comptoir de prêt • Insteadadded : • <display> • <ispartof>Musique en feuille no.10599, voir au comptoir de prêt de la Bibliothèque de Musique</ispartof>
5. Create new normalizationrules • Added or modified a few elements to conformwithour Aleph records • <display/type> = score • <search/general> = Musique en feuille • <search/searchscope> = ubibmusique • <facets/toplevel> = uqam_inst • <facets/library> = M
6. Tweak Primo interface • No use for location/request tab or for more (sfx) tab.Hidethemwith CSS using the datasourceprefix : • ul.EXLResultTabsli.EXLRequestTab a[href*="BIBMUSIQUE"], • ul.EXLResultTabsli.EXLMoreTab a[href*="BIBMUSIQUE"] {display:none;} • Html: • <ul class="EXLResultTabs…"> • <li class="EXLRequestTab…"> • <a href="display.do?tabs=requestTab….doc=BIBMUSIQUE10478..."> • <a href="display.do?tabs=moreTab...&doc=BIBMUSIQUE10478...">
A few problems, questions remain • Aznavour and Coulonges • The problem of Aznavour as (comp.), (interp.), (par.) – leave in parenthicalelements or removethem • Vs
Aznavour and Coulonges • Aznavour • (include all facets)
Aznavour and Coulonges • Aznavour • (include all facets)
Aznavour and Coulonges • The example of Georges Coulonges as (comp.), (par.) – leave in parenthicalelements or removethem • Vs
A few problems, questions remain • Strip out parenthetical notes for facets and suggested new searches • In addtion to ‘(par.)’ etc. wealso have ‘(par. Fr.)’ and others. To getthem all weused:
A few problems, questions remain • Currently no way to limit or prefilter to ‘Musique en feuille’, searchable elements are incompatible with visible elements • Resource type vs Format • Library vs Collection • Not a visible searchable scope option
Outcomes • Collection is available via Primo • Records are modified, added, harvested nightly into Primo • Circulation stats increase dramatically
Future plans • Phase 2 of Pouchet collection, ~10k vinylrecordings • Horus : Law libraryannual reports database, 1500 records • Gestio: Management documentation centre collection of greyliterature, technicalpapers, etc. 6000 records • Possibility of adding digital objects if sheet music isscanned, documents are digitized