1 / 11

A Practice and Value Proposal for Doctoral Dissertation Data Curation

A Practice and Value Proposal for Doctoral Dissertation Data Curation. Aaron Collie Digital Curation Librarian Michigan State University. Michael Witt Assistant Professor of Library Science Purdue University. We are going to answer:. Why expose data? Why dissertation data?

elainet
Télécharger la présentation

A Practice and Value Proposal for Doctoral Dissertation Data Curation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Practice and Value Proposal for Doctoral Dissertation Data Curation Aaron Collie Digital Curation Librarian Michigan State University Michael Witt Assistant Professor of Library Science Purdue University

  2. We are going to answer: • Why expose data? • Why dissertation data? • What’s in it for us? • Why can’t we do it now? • How could we do it? • Why are you telling us this?

  3. Proposition Summary • By reengineering ETD workflows to include research data, we can add value to both the process and the products of research • The Electronic Theses & Dissertation (ETD) lifecycle is a scale model of the scholarly communication lifecycle • By enabling curation at scale, we can grapple with the digital curation lifecycle model from within a subsystem of scholarly communication

  4. Why expose data?

  5. Why dissertation data? “The role of the library in data-intensive research is important and a strategic repositioning of the library with respect to research support is now appropriate.” Alma Swan & Sheridan Brown. Skills, Role & Career Structure of Data Scientists & Curators: Assessment of Current Practice & Future Needs. JISC, July 2008. "The term ‘digital curation’ is increasingly being used for the actions needed to maintain and utilise digital data and research results over their entire life-cycle for current and future generations of users.” Joint Information Systems Committee (June 2003). JISC Circular 6/03 (Revised). http://www.dcc.ac.uk/docs/6-03Circular.pdf

  6. Value Proposal: What’s in it for us? • Ensures research integrity • Collection of Digital Assets • Option to self-archive data as Open Access • Availability of Data Citations • Preparation for grant writing • Improves research impact

  7. Limiting Context: Why can’t we do it now? Current ETD workflows allow for the students to affix data as a supplementary and disjointed attachment to the theses or dissertation as a final step in the deposition process. Format and file-size restrictions limit the type and amount of data that ETD submission systems will accept. Because data is not hosted until after the final draft has been deposited, persistent unique identifiers are not able to co-link document and data. Because data is treated as supplementary to the ETD, data is only acquired if provided by the author. Examination committee’s are limited by not having data readily (automatically) available during the review process. Data currently inherit the restrictions placed upon the ETD package

  8. ETD Management Systems • ETD Management Systems are content management systems with a specialized suite of services to support ETDs • ETD Management Systems include submission packages • Examples: openETD, vireo, VALET, HYDRA, FEZ

  9. Deposition vs. Curation: How could we do it? Current Proposed

  10. ETDs: A scale-model digital curation environment Similar Route Different Scale Flickr user: contemplative imaging Flickr user: loadstone

  11. Conclusions • ETDs: A scale-model digital curation environment • That is a subsystem of Scholarly Communication • That is a tractable fragment of the research lifecycle • That contains a captive audience • Dissertation data: A low hanging fruit for building and curating data collections • That is aligned with the mission of the university and library • That will help populate fledgling data collections • That has a significant value proposition

More Related