1 / 30

National Research Data Archive MIDAS: development decisions and usage peculiarities

National Research Data Archive MIDAS: development decisions and usage peculiarities. Saulius Maskeliūnas Vilnius University Institute of Mathematics and Informatics Akademijos str. 4, Vilnius LT-08663, Lithuania. Content.

Télécharger la présentation

National Research Data Archive MIDAS: development decisions and usage peculiarities

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. National Research Data Archive MIDAS: development decisions andusage peculiarities Saulius Maskeliūnas Vilnius University Institute of Mathematics and Informatics Akademijos str. 4, Vilnius LT-08663, Lithuania .

  2. Content • Introductory facts about National Research Data Archive (MIDAS) project • Implementation aims and principles of MIDAS • Planned MIDAS outcomes and peculiarities • MIDAS data mining tool (DAMIS) • Conclusions • Demonstration of MIDAS • Demonstration of DAMIS

  3. 1. Introductory facts about MIDAS project(1) • Project Title: National Open Access Research Data Archive (LT: Nacionalinis atviros prieigos Mokslo Informacijos Duomenų Archyvas, MIDAS) • Lead institution:Vilnius Universitywww.vu.lt • Project partner:Vilnius University Hospital Santariškių Klinikos (Santariškės Clinics)santa.lt • Project participants:13 institutions of science and studies, and medical institutions

  4. 1. Introductory facts about MIDAS project(2) • Funded by: EU Structural Funds and national budget • Project budget: ~ € 4.34M(i.e., almost 15M LTL) • Duration: 40 months (start date: January 1, 2012 , end date: June 30, 2014  April 30, 2015) • Current status:– technical infrastructure: not installed yet; – software development: beginning of 2nd iteration.

  5. 2. Implementation aims and principles of MIDAS • to establish the infrastructure that enables collection, organizing and storage of empirical and research data (with corresponding metadata), ensuring free, convenient, interactive search, access and analysis of data. MIDAS implementation purpose

  6. Prospective MIDAS users • Researchers, lecturers, professors, students; • Science and studies institutions [and/or their representatives]; • Institutions which present research data (e.g., hospitals), • Research and development (R&D) enterprises; • Public administration institutions which use R&D statistical data; • other interested physical and judicial persons.

  7. Development principles • privacy and security (i.e., information confidentiality, integrity and non-repudiation) • usability • accessibility (functioning 24 hours per day, 7 days per week) • extensibility (i.e., software architecture scaling in cases of incorporation of additional hardware)

  8. MIDAS compatibility • MIDAS archive will be based on usage of open code software, XML format and other open metadata, bibliographic, information retrieval standards (CERIF, CERIF for Datasets, CIF, DICOM, Dublin Core, MARC21, ISO/IEC 11179-1:2004, OAI-PMH, etc.). • That will ensure compatibility with other information systems, data archives and registries in Lithuania and internationally (e.g., Data Citation Index of Thomson Reuters http://thomsonreuters.com/data-citation-index/ ).

  9. Integration with other data archives and registers • Lithuanian Academic E-Library eLABawww.elaba.lt • Lithuanian Data Archive for Social Sciences and Humanities LiDAwww.lidata.eu/en • Lithuanian Networked Digital Library of Theses and Dissertations Lit-ETDetd.elaba.lt • National Medical Picture Archiving and Information Exchange System MedVAIShttp://www.epractice.eu/en/news/5364871 • etc.

  10. 3. Planned MIDAS outcomes and peculiarities MIDAS outcomes(1) • The infrastructure that enables collection, organizing and storage of empirical and research data (with corresponding metadata), ensuring free, convenient, interactive search, access and analysis of data;

  11. MIDAS outcomes(2) • National united research data archive with analytical software tools; • Infrastructure for collection and transferring of biomedical research data,consisting of DICOM (for collecting data from medical equipment), ECG (for collecting electrical cardiogram data from medical devices), content management, data depersonalisation, and data archiving modules; • Public interactive e-service “Search, Delivery and Analysis of Research Data”.

  12. MIDAS implementationadvantages • Guaranteed safety and effective sharing of research data • Increased quality of research outputs • Preventing duplication of effort in research data collection • Increased variety of research outputs

  13. 4. Data mining tool DAMIS(slides by Olga Kurasova <......................................> ) Graphical user interface (GUI) web services Data mining algorithm

  14. DAMIS is a tool for analysis of the MIDAS data; The following data mining methods are implemented: • preprocessing (cleaning, filtering, splitting, transposing, norming, feature selecting); • statistical primitives (min, max, mean, standard deviation, median); • dimensionality reduction (multidimensional data visualization); • classification and clustering. FunctionalitiesofDAMIS

  15. DAMIS is a web-based systemhttp://dev.damis.lt (user name/password: demo/demo , 1234/1234 ); The web interface does not require any software installation; a web browser is enough for its usage; There is a possibility to choose high performance computing resources (VU MII cluster – VU MIF supercomputer); The usage is based on creation of scientific workflows; The results obtained can be saved inMIDAS and in a user computer. FunctionalitiesofDAMIS

  16. A sample of multidimensional data(breast cancer data)

  17. DAMIS GUI

  18. Data upload

  19. Data preprocessing

  20. Experiments

  21. Statistical primitives

  22. Dimensionality reduction

  23. Data classification and clustering

  24. Matrix view of Iris after dimensionality reduction by PCA

  25. Iris graphical representation

  26. 5. Conclusions (1) • MIDAS will provide virtual services for researchers and other participants in research and education that can lead to more efficient, effective and higher quality research; • Users will have the possibilities to:– register, find and cite research data, – search for and use other infrastructures andtools (whichprovide data archiving services), – share or integrate data and tools to other science and studies infrastructures;

  27. 5. Conclusions (2) • National Research Data Archive MIDASwill increase research cooperation possibilities, because of simpler, more convenient, unified, advanced possibilities of research data collection, analysis, application and sharing.

  28. 6. Demonstration of MIDAS http://midas.insoft.lt:8888/web/User name / password: 123/123 456/456 789/789 101/101

  29. 7. Demonstration of DAMIS http://dev.damis.ltUser name / password: demo/demo 1234/1234

  30. Thanks for Your Attention !Questions ?...

More Related