1 / 42

LDS 3

LDS 3. David Tarrant @ davetaz davetaz@ecs.soton.ac.uk Open Planets Foundation / University of Southampton. Applying Preservation Principals to Linked Data Systems. iPres2012 Toronto, October 2012. Present Day. Presenting the REF The Results Evaluation Framework.

abra
Télécharger la présentation

LDS 3

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. LDS3 David Tarrant @davetaz davetaz@ecs.soton.ac.uk Open Planets Foundation / University of Southampton Applying Preservation Principals to Linked Data Systems iPres2012 Toronto, October 2012 This work was partially supported by the SCAPE Project. The SCAPE project is co-funded by the European Union under FP7 ICT-2009.4.1 (Grant Agreement number 270137).

  2. Present Day

  3. Presenting the REFThe Results Evaluation Framework • 5 Tools (Droid, Fits, file, fido, Tika) • 65 Versions (from 2008 to now) • 1 Govdocs Corpora • 1 Question….

  4. How accurate are file format identification tools historically?

  5. PDF 1.4

  6. DOCX

  7. 9 Months Ago

  8. Why is Data Important? • Data and Metadata are knowledge. • Knowledge is power. • Knowledge enables decision. • Knowledge enables process. • Knowledge empowers action. • Knowledge enables us to say because…

  9. Processes DATA Process Decision DATA DATA A Classic Flow Chart Data is key to making decisions

  10. Policy DATA Process Policy DATA DATA A Preservation Flow Chart Data is key to informing policy

  11. Policy Data - Generated • When? • Who? • What it affects? • What action is taken? • Why? Policy

  12. Why? • Because something said so? DATA • When? • Who? • What it affects? • What action is taken? • Why? DATA DATA

  13. Case Study Example (Opinion) • Due to format obsolescence, all flash video files are to be migrated to H264/AAC. • Input data: Study on proliferation of flash and evidence of lacking support from the rights holder, adobe. • File B was created from File Aa year ago as it was identified as being a flash video file. • Today, File Ais identified as being an ogg video file. • What has changed? Why? Does it affect me? Who generated the wrong information? Did they generate any other wrong information?

  14. I Don’t Know!

  15. 6 Months Ago

  16. A Fact? File#1 hasIdentification application/zip

  17. Provenance • Tarrant, David and Carr, Leslie (2012) LDS3: Applying Digital Preservation Principals to Linked Data Systems. In, Ninth International Conference on Digital Preservation (iPres2012), Toronto, Canada Tim Berners-Lee Provides 5-Star Linked Data Guide

  18. Data!!! • One fact. • One document the fact comes from • One citation about the documents place of publication. • Who, What, Whenand Where • Who they worked for and with.

  19. In Linked Data a document is called a named-graph. • But these also get used for two purposes!! Named-Graph File#1 hasIdentification Application/zip

  20. The two uses of the named-graphNo. 1 – Data Publication DATA Named-Graph File#1 DATA hasIdentification Application/zip DATA

  21. The two uses of the named-graphNo. 2 – Data Discovery/Query Named-Graph DATA File#1 hasIdentification application/zip DATA File#1 hasIdentification application/msword DATA

  22. The two uses of the named-graphNo. 2 – Data Discovery/Query Named-Graph Named-Graph File#1 File#1 hasIdentification Works For application/zip hasIdentification File#1 hasIdentification Works For Application/zip application/msword

  23. Quads Query Graph Source Graph 1 File#1 hasIdentification application/zip Source Graph 2 File#1 hasIdentification application/msword After all, RDF is a graph model RDF the spec, not the RDF/XML serialization

  24. Quads Query Graph File 5.04 Source Graph 1 usesTool File#1 hasIdentification application/zip File 5.07 Source Graph 2 usesTool File#1 hasIdentification application/msword

  25. Still with me… • Ok so what about versioning? File1/Identification/tool/file/version/5.03 File#1 hasIdentification University of Southampton File1/Identification/tool/file/version/5.07 File#1 hasIdentification application/msword

  26. Latest File1/Identification/tool/file/version/5.03 File#1 hasIdentification /File1/Identification/tool/file/ previous version University of Southampton File1/Identification/tool/file/version/5.07 File#1 hasIdentification application/msword

  27. 3 Months Ago

  28. www.LDS3.org • A technical solution to all the complexity, automatic: • Versioning • Linking • Annotation • Named-Graph Management • Query Management

  29. Demo

  30. www.LDS3.org • CRUD • SWORDv2 (Based Upon) • Oauth Authentication

  31. In the paper • Links between P2-Registry, Pronom and LDS3 • Description of the LDS3 specification • Overview of software in the LDS3 stack (hardly any of it is new) • How LDS3 relates to Amazon S3 • More on named-graphs versioning • More on information and non-information resources.

  32. 2 Months Ago

  33. DEMO • http://dev.lds3.org/admin/timemachine.php?uri=http://dev.lds3.org/doc/B1/E3/7F01/8ACE-43BA-9AA9-B708B7A20263

  34. Present Day

  35. Presenting the REFThe Results Evaluation Framework • 5 Tools (Droid, Fits, file, fido, Tika) • 65 Versions (from 2008 to now) • 1 Govdocs Corpora • 1 Question….

  36. How accurate are file format identification tools historically?

  37. PDF 1.4 http://data.openplanetsfoundation.org/ref/pdf/pdf_1.4/

  38. DOCX http://data.openplanetsfoundation.org/ref/docx/

  39. Back To The Future

  40. The Future • Get me the identification for a file as it would have been on 3rd October 2010. GET /ref/?query=“SELECT ?identificaiton where file = X” HTTP/1.1 Accept-Datetime: Sun, 3 Oct 2010 12:00:00 GMT Accept: text/plain application/zip

  41. LDS3 David Tarrant @davetaz davetaz@ecs.soton.ac.uk Open Planets Foundation / University of Southampton Applying Preservation Principals to Linked Data Systems iPres2012 Toronto, October 2012 This work was partially supported by the SCAPE Project. The SCAPE project is co-funded by the European Union under FP7 ICT-2009.4.1 (Grant Agreement number 270137).

More Related