1 / 19

Data services and computing

Data services and computing. Computing reality. We tend to be dealt the computing environment in which we must operate. Few of us have enough influence to steer the direction of central computing on our campus. Thus, we try to match our computing needs with the resources provided locally.

Télécharger la présentation

Data services and computing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data services and computing

  2. Computing reality • We tend to be dealt the computing environment in which we must operate. • Few of us have enough influence to steer the direction of central computing on our campus. • Thus, we try to match our computing needs with the resources provided locally.

  3. Match to level of service Develop a computing strategy that identifies • the hardware, • the software, and • the network connectivity needed to support the level of data service you are providing now and will be in the near future.

  4. Strategic factors • Desktop computing power • hardware • fastest affordable processor • largest affordable hard drive • largest affordable monitor • removable media drives support

  5. Strategic factors • Desktop computing power • the software it should support • metadata tools • statistical software • network tools • compression utilities

  6. Strategic factors • Large quantities of disk space on a fast system • uncompressing files • unpacking files • package and compress files

  7. Strategic factors • Access to at least one fast processing machine with statistical software • powerful Unix workstation • handle larger-scale problems

  8. Strategic factors • Mass storage that supports multiple-user access to files and preferably multiple-system access • distributed file system • institutional repository

  9. Strategic factors • Support software for data services • statistical packages • metadata support • communication tools • web tools • blogs/wikis • training tools

  10. Strategic factors • Network connectivity permitting high-speed file transfers • off and on campus transfer • may require using services elsewhere on campus

  11. Implementation strategy • System administration takes a lot of time! • think twice about becoming your own system administrator

  12. Implementation strategy • Purchase compatible computing equipment • to receive maintenance support • simplifies the sharing of peripheral devices

  13. Implementation strategy • Investigate local computing support • possibly a centralized high performance computing service or compute grid • site licenses for software

  14. Implementation strategy • Align with local institutional repository services and digital preservation initiatives • introduce data to the planning of your institutional repository

  15. Data infrastructure models • Data centres • The data centre is part of the instrumentation infrastructure. • e.g., the Large Hadron Collider • Data repositories • The repository is part of a specific institution’s larger stewardship mandate for digital resources. • e.g., Odesi in Scholar’s Portal • Domain archives • Domain archives are institutions established explicitly to preserve and provide access to a domain’s data. • e.g., the ICPSR and the UKDA

  16. Emerging data infrastructure

  17. Examples

  18. health physics soc sci biology astro clinical data LHC data biology data astronomy data scientific data infrastructure computing/data grid infrastructure GÉANT network infrastructure Data as an infrastructure ICPSR data Source: Mário Campolargo Open Grid Forum Barcelona, 3 June 2008 source: eSciDR study (adapted)

  19. Information Authenticity Quality Longevity Collections: data, work-flows, publications, learning materials, etc. Repositories services Ease of use Availability Reliability Deposit, annotation, delivery, visualisation, search, help, etc Trusted Open Well managed Repositories Repository management, curation, physical security, etc Standardised Stable Flexible Access Authentication, authorisation, logical security, federation, portals, etc Transparent Responsive Informed Management Grids, Virtual Organisations, etc Physical infrastructure Available Scaleable Reliable Networks, computing, HPC, physical storage, etc e-Infrastructure for repositories e-Infrastructure of repositories e-Infrastructure for repositories Source: Mário Campolargo Open Grid Forum Barcelona, 3 June 2008 source: eSciDR study (adapted)

More Related