1 / 6

Replicating Linguistic Resources

Replicating Linguistic Resources. B2SAFE: MPI-TLA CLARIN Center. Willem Elbers (MPI-TLA) 2 nd EUDAT Conference. Date : 29 October 2013. The Language Archive. Data on languages: about 60 Terabyte of well-described resources about 20.000 hours of digitized audio/video recordings

blake-chang
Télécharger la présentation

Replicating Linguistic Resources

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Replicating Linguistic Resources B2SAFE: MPI-TLA CLARIN Center Willem Elbers (MPI-TLA) 2nd EUDAT Conference Date: 29 October 2013

  2. The Language Archive • Data on languages: • about 60 Terabyte of well-described resources • about 20.000 hours of digitized audio/video recordings • about 73.000 metadata described sessions • about 4.5 million annotated segments • data on more than 200 languages • among these, data from about 60 DOBES teams • acquisition, speech, multimodal, multilingual, language and cognition, brain imaging, ethnological and other data. • Mission: • Maintaining access to all stored resources for the current generation of researchers, language communities and the interested public. • Preserve the valuable cultural heritage for current en future generations.

  3. B2SAFE • Goals • Replication of data • B2SAFE! • Replication of services • RZG providing Language Archive Technology services at replica side • B2SAFE Community extensions: • Replication based on logical structure defined in the IMDI/CMDI metadata • Integrated with underlying SAM-FS

  4. Approx: 80TB EUDAT > 3TB > 3TB

  5. Summary “Cultural Heritage Data replicated for the future” Data replication running in production LAT Software stack running @ RZG (beta) Replication of authorization records running (beta)

  6. Questions Thank you for your attention

More Related