1 / 66

3. September Duisburg, Germany International Interdisciplinary Open Archives

Subject-specific international services in Physics Eberhard R. Hilf, H. Stamerjohanns, and Thomas Severiens Institute for Science Networking physnet.uni-oldenburg.de/~hilf. 3. September Duisburg, Germany International Interdisciplinary Open Archives

libitha
Télécharger la présentation

3. September Duisburg, Germany International Interdisciplinary Open Archives

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Subject-specific international services in Physics Eberhard R. Hilf, H. Stamerjohanns, and Thomas SeveriensInstitute for Science Networkingphysnet.uni-oldenburg.de/~hilf 3. September Duisburg, Germany International Interdisciplinary Open Archives and Subject specific services in Mathematics and Physics.

  2. Content of talk: • I: Why subject-specific services? • II: Open Archives Distributed in Physics • III: International embedding and organization E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  3. Part I: Why subject-specific services? Knowledge repository requirements • Restricted • Complete • Professional • Research-driven • Additional subject-specific services E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  4. 1. Why restrict the knowledge basis? • Higher ratio of relevant information retrieved • Less ‚missunderstanding‘ [different meanings and content for same word in different fields] Search for Ideal Altavista: no relevant in first twenty Google: no relevant in first twenty >Science>Math: one in five PhysDoc (in title): third title relevant ; with metadata: all relev. Mpress (in title): only relevant documents E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  5. Use machine-readable metadata Tool for authors in MathNet and Phys-Net Webform for adding metadata MyMetaMaker E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  6. Subject-specific Additional Information • Examination regulations • Teaching plans • Technical specifications for experiments E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  7. Problem of Interdisciplinarity • Upgrade services in both fields • Additional functionality into used-to services • Use knowledge repository of both fields • Intellectual Mapping of keywords failed [few usable docs, level mismatch] • Automated Mapping: 17.000 INSPEC with PACS AND MSC. Statistical analysis, ranking, grammar truncation. • Workpackage 9 of CARMEN (BMBF) • J. Pluemer et al. (Osnabrueck), Th. Severiens (ISN). • Interest of documents only in border areas • Border areas are often most active scientifically E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  8. Keywords (Physics) ><PACS >>keywords’=joint repository=><MSC ><keywords (Math) • Physicists use keywords, not PACS • Mathematicians use MSC E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  9. 2. Why complete repository? Prime research needs • instant (Web, no delay) information of all relevant new results • complete information fom anywhere in the world • One stop service despite a multitude of distributed heterogenous repositories. Consequences for financing concepts E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  10. 3. Why professional content repository • Researchers need mostly information from their professional colleagues. • Researchers can act only in their own subject-field as referees, quality filters for the wider public, comment and select. • The Web allows for a multilevel professional quality management for all heterogenous purposes E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  11. 4. Why research driven repository? • Authors have the highest motivation to be read, to get their documents distributed and archived. • Author communication communities are subject-specific. • Scientists understand only their subject-colleagues • Research is organized most often in subject-specific topical institutes E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  12. Part II: Realization bits • Quality filter schemes • PhysNet of EPS • Open Archive Distributed OAD E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  13. A field specific professional service has to meet the expectation of a quality service:The service should not contain everything but only material certified by physicists to be relevant and good physics. Thus we need certification levels. PhysNet has but just one: what is on Physics Department‘s webservers E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  14. What refereeing do we need ? • Instant publishing before refereeing • Time stamp for prime research before refereeing • Archiving of relevant information • Competitive parallel) refereeing • Multilevel refereeing • Full information published to be fair to referees • Open refereeing [signed Annotation instead of advice] • Voluntary refereeing to be a pleasure for referees E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  15. Author Library PhysicsDep. Group heads Referees ofother Univ. Referees of Learned Societies NationalLibrary Archiving Service X arXiv PhysDoc Scenario for Tomorrow: OAi Data and Service Providers including Vetting to Peer Reviewing DocumentMetadata Documents Reviews Metadata Multi-level Peer Rev. Data Provider Service Provider E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  16. The role of University Libraries • Be Oai-Database Provider of complete local Information • Assure free full text access of all research material • Assure correct metadata usage (by training or adding) • Do handshake with National Archives • Be Oai-Service Provider of specific fields at your university • Vetting system with the local department scientists • Train users to pick from the multitude of competing Oai-service Providers E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  17. Vetting at German Universities • University Publishing Network (Project) • Local vetting with department scientists and library • Peer reviewing between different universities • Shared functions (work flow system, marketing ...) • Separate functions (business model, financing ... ) E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  18. PhysNet, an international subject- specific service www.eps.org/PhysNet European Physical Society EPS controlled by its Action Commmittee on Publication and Scientific Communication E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  19. The Concept of PhysNet • Crawl across all worldwide distributed Physics Departments • Same Metadata as Math-Net [IMU, EPS] • Distributed Gatherers [locally allow/deny !!] • Distributed Brokers [no nation to dominate] • Agreements for an unbiased distributed system [Charter] • Distributed manpower [at present: 1 Mill. $/a] • Aims at all types of information E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  20. Status by 2001 • About 40 local, regional, national gatherers • Brokers at US, DE, Russia, Hungary, France, UK, DK, India, Japan, Australia, .., EPS [DFN-Project] • 40.000 documents and document lists • MyMetaMaker author tool to add DC:metadata [with Mathematics (IMU) and Physics (EPS).] • 20.000 page impressions per month . E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  21. Distributed Open Archive for Physics OAD Vision of the ultimate subject-specific Open Archive • All departments/Universities worldwide as prime, complete, open free repositories • Secondary virtual add-on services use these: • Quality filters • Collections • topical archives E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  22. Present incomplete realization • All worldwide departments • Few cooperate by local quality filters yet • Few comply with metadata (1000 of 40000 documents) PhysNet E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  23. Towards completeness of heterogenous document-bases • Free locally posted documents: PhysDoc • Free archived theses [Depts, Univs., DDB,..] • Free preprint repositories: ArXiv • Free fulltext journals • Free research lab docs: CERN, ANL, .. • University Publishers • Journals of Natl. Societies: APS, IoPP • Commercial journals E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  24. OAD Physics Project 2001 Oai compliant service provider for • PhysDoc [1.000 out of 40.000] • ArXiv • IoPP • [APS] • PhysDiss [European] • NDLDT [2001] • Cornell, CERN, MIT [Oai-compliant Document providers] (by 2001 show E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  25. Joint project VT-ISN • Funded under a new scheme jointly by NSF and DFG (German Science Foundation) • One application, one refereeing body, one funding scheme • Thus one team, one final intelligent Online service suited to be adapted to any language and any field. • Started: 1.March 2001 E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  26. Part III: Organizing international distributed repositories E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  27. The concept of the Open Archive Initiative OAi 1. Discussion (workshops, meetings, ..)2. Concept (free access, a multitude of data providers and service providers but one internationally to be accepted standard)3. Software and workforce sharing. • Three layers: E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  28. DataProvider Implementation 8. March 2001 Skim through Service Provider Implementation 13. April 2001, 11.30 am VT-time E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  29. Publisher Servers PhysDoc arXiv UniversityLibraries Servers(Google,.. Scenario of Tomorrow: Types of Searching – Retrieving offers • Competition by • quality of add-ons • level of refereeing • quality of contents • specialization • depth of search • size • comfort of retrieval • level of integration • local focus • ... E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  30. National activities to support the OAi • DINI German Initiative for Networked Information • similar to CNI • Cooperation between • Research Libraries (DBV), • Computer Centres (ZKI), • Media Centres (AMH), • Initiative of Learned Societies IuK • DINI´s Appeal to join the OAi (2000) • Training camps for German Oai-Implementation (2001) E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  31. Oai: Cooperation of repositories Oai lists • Data providers comply with Oai • Yes, if they are not service providers [Departments] • Yes, if they are free access providers [ArXiv] • Subtle, if national society publishers [APS, IoPP] • No, if commercial publishers [Elsevier,..] scirus Cut throat competition of service providers with best service for same documents Commercial publ. collect free access documents E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  32. Political and Funding Policy Effective services for research • Money to libraries per No of accessible documents • Multiple access ways [TibOrder vs others] • Regulations for hiring scientists to Universities • Funding selforganization of research communities • University publishers as regular prime research outlet • Fund IuK research to professionalize content search E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  33. Subject-specific National Port of Entry German Physical Society DPG plan • Cooperative project of partners [FIZ, TIB, ISN; KFP] Rescue boat syndrom? E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  34. International Networking • No bias policy: no single society allowed to dominate • Funding policy: each society finds ist own funds • Broker policy: democratic‘ network of brokers [DFN-Project] • Department cooperation: • Operator • Quality filters [select what to enter PhysNet] • Metadata for documents • Home page for document lists • University publishers (vetting and archiving) • National entry points for Oai. PhysNet Charter E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  35. For the Acceptance of a services : • Bottom up: just do it and spread the rumour • Top down : Charter of IMU, EPS • Engagement by registration: Institutions, Departments, Graduate Schools, Universities • 4. Joint international standards and cooperation • 5. Distributed work sharing (‚infinite workforce‘) • 6. Professionalism: • Scientists provide content and quality filtering • Departments with University Library distributes and retrieves • Universities set up research-oriented suitable infrastructure • Funding and politics to enable competition and effectiveness. E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  36. <<End of talk Duisburg>> E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  37. Beginn Steinbruch

  38. Scientist Did not know what services we were deprived of Librarian Assumed to know what services are good for science A dicussion in the train The young Elsevier...... did ask the scientists: „What new services are needed?“ E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  39. Libraries with different schemes (OPAC, PICA..) Multiple Publishers with a monopole on content and different schemes PhysDoc arXiv Searching - Retrieving in the past age Multiple costs for Providers SFXorMetaSearch CrossRef.Links Inconvenience for the user! E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  40. OAI _Identify E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  41. Any Oai Data Provider Harvest gatherer DC-converter SQL DB: MySQL Any Oai Service Provider OAI-Data Provider OAi Broker OAI-Harvester E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  42. Collections to be Represented in Oai-PhysDoc • PhysDoc: • Distributed document Database for Physics worldwide • using HARVEST as Retrieval mechanism • University document servers • North German Univ. superstructure • DissOnline.org Physics part • Physics part of NDLTD • Arxiv, MIT, .. Physics part E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  43. OAI Implementation • modified HARVEST holds SOIF and DC metadata in local text files • storage size no problem • decision to convert data offline and store structured data in SQL database (mysql) • use DC when possible, otherwise map SOIF to DC E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  44. OAI Implementation documents documents documents HARVEST SQL DB normalize metadata OAI Server E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  45. OAIImplementation • software written in PHP • protocol • easy because it uses modified implementation of HU Berlin • metadata converter • maps SOIF to DC • converts different DC representations to one common one E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  46. DC-Standards and Sets of OAi German National Library DC7 TheO-Duisburg OCLC-NDLDT Math-Net Worldwide: Int.Math.Union Phys-Net Worldwide: EPS Differences: Html 2 , 4 XML, rdf

  47. Advantages of the new Scenario Less work for the author! Immediate publication! Most e-prints free for the community! Lower costs for the library! Open multi-level peer reviewing! Easy integration of metadata into existing services! Less printed journals but more accessible e-publications! Value-added servicesby different providers! Non-exclusive rights for the publisher! OAi = „Napster for the Sciences“Richard Sietmann in: c´t 6/2001, S. 78 E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  48. Departments worldwide Physicists Harvest gatherer DC-converter SQL DB: MySQL SOIF DC Harvest Broker Marian Bypass OAI-Data Provider Marian for ranking OAI-Harvester OAi Broker E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  49. Future work • improve metadata converter • improve summarizers • closer look at different DC representations • tell people to use metadata • OAI workshops • ease production of metadata E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg

  50. PhysDep Linklist + Seachengine approved by National Societies businessmodel • administrational inform. • distributed gatherers: 26 • search depth: 2-full • acceptance 500/day 400/day PhysDoc publications distributed gatherers: 3 search depth: special E.R.Hilf, Institute for Science Networking, Germany: 3.9.2001 Duisburg 50

More Related