1 / 14

March 2015 – Today

March 2015 – Today. Updated hardware architecture Cassandra Hierarchical approach Data lifecycle CI deployment update Team update. Overall CI Architecture. Flexible network architecture Decentralized network architecture with 4 P2P connected sites (OMC-WHOI, OMC-OSU, OMC-SIO, CI-RU)

Télécharger la présentation

March 2015 – Today

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. March 2015 – Today • Updated hardware architecture • Cassandra • Hierarchical approach • Data lifecycle • CI deployment update • Team update

  2. Overall CI Architecture • Flexible network architecture • Decentralized network architecturewith 4 P2P connected sites (OMC-WHOI, OMC-OSU, OMC-SIO, CI-RU) • Independent administrative domains (e.g., IP spaces) • Supports both VPN and non-VPN configurations • Security • Short-term: traditional firewall system (iptables) • Long-term: Latest generation firewall appliances to consolidate security layers • External connections • Leverages institutional VPN infrastructure

  3. CI Architecture – Functional View

  4. CI Implementation. ILM infrastructure West Coast CI (currently at Pittock) East Coast CI (RU) Latest Generation Network Switch, Firewall Appliances, Load Balancer Latest Generation Network Switch, Firewall Appliances Mirroring External National Archive (e.g., NODC) 10G over Internet 2 Archive Mirroring 2 PB SAN 2 PB SAN Up to 10 PB Tape Library Last-tier /Backup Storage solution (distributed, multi-tiered, resilient)

  5. UFrame-based OOINet Ecosystem – Architecture (currently) Network backbone (10G upgradable to 100G) Cassandra cluster ~50TB (Estimated) UFrame server (i.e., Dell R810) “Thin” nodes: 16 Xeon cores 32GB DRAM 2x4TB local disk + SSD? 4x1G interconnect 10G “Fat” nodes: 16 Xeon cores 128GB DRAM 2T RAID (disk) 1/10G interconnect On 1G network 1G UFrame server (i.e., Dell R810) 10G … … faster UFrame server (i.e., Dell R810) 10G SAN (e.g., Compellent) Up to 2PB 10G (multiple) Internet 2 link to mirror in West Coast CI (Pittock) 10G Internet link to OMCs 1G Tape Library Up to 10PB larger Information Lifecycle Management System (with West Coast)

  6. CI Hardware and Network Deployment Plan LAST REVIEW √ √ √ √ √ • RFP in process • 3 components: • Storage (ILM) • Compute • Networking • Expected timeline: • Bidding process by ~Jul 31 • Award by ~Aug 15 • Deployment by ~Sept 15 √ √ Phase 1 – Minimal CI hardware • Essential compute, storage, networking, software to support construction, testing, commissioning • Leveraging existing capabilities • End-to-end high-bandwidth network connection operational • Ramping up personnel – 5 positions on interviews Phase 2 – Operational Intermediate O&M Hardware and Network (Target*: 03/31) • Functional compute, software; intermediate storage solution • Expect to hire 3-4 systems/software personnel • Data flow and copy from RSN to Rutgers + Antelope ORB at Rutgers Phase 3 – Commissioning O&M CI (Target*: 09/15) • Full compute, software; redundant/reliable storage with backup solution; • Full datacenter capability; west coast re-deployed and operational • OOINet in production • Security solution deployed * Contingent to funding

  7. UFrame-based OOINet Ecosystem – Production (intermediate) architecture Network backbone (10G upgradable to 100G) Cassandra cluster ~50TB (Estimated) UFrame server (i.e., Dell R810) “Thin” nodes: 16 Xeon cores 32GB DRAM 2x4TB local disk + SSD? 4x1G interconnect 10G “Fat” nodes: 16 Xeon cores 128GB DRAM 2T RAID (disk) 1/10G interconnect On 1G network 1G Cassandra 6 nodes - up to 12 ~8-16TB usable Servers and network Under support UFrame server (i.e., Dell R810) 10G … … faster UFrame server (i.e., Dell R810) 10G SAN (e.g., Compellent) Up to 2PB Tunnel to West Coast Established with existing Palo Alto 10G (multiple) Internet 2 link to mirror in West Coast CI (Pittock) 10G SAN Compellent under support (~40TB usable) + Synology Internet link to OMCs 1G Tape Library Up to 10PB larger Information Lifecycle Management System (with West Coast)

  8. UFrame-based OOINet Ecosystem – Data Flow (1) Network backbone (10G upgradable to 100G) Cassandra cluster ~50TB (Estimated) UFrame servers 10G UI Server On 1G network 1G … Ingestion Engine 10G PostgreSQL … (e.g. 32) Ingestion Engine EDEX … faster … Ingestion Engine 10G (e.g. 32) Ingestion Engine SAN (e.g., Compellent) Up to 2PB 10G (multiple) Internet 2 link to mirror in West Coast CI (Pittock) (1) Data acquisition - i.e., rsync from OMCs, video, etc. 10G Internet link to OMCs 1G Tape Library Up to 10PB larger Information Lifecycle Management System (with West Coast)

  9. UFrame-based OOINet Ecosystem – Ingestion Data Flow (1) Network backbone (10G upgradable to 100G) Cassandra cluster ~50TB (Estimated) UFrame servers 10G 2.1. EDEX calls IE 2.2. Ingestion Engine returns JSON 2.3. EDEX inserts into cassandra UI Server On 1G network 1G … Ingestion Engine 10G PostgreSQL … (e.g. 32) Ingestion Engine EDEX … (3) Data insertion … … (n threads) faster … Ingestion Engine 10G (e.g. 32) Ingestion Engine (2) Data ingestion SAN (e.g., Compellent) Up to 2PB 10G (multiple) Internet 2 link to mirror in West Coast CI (Pittock) 10G Internet link to OMCs 1G Tape Library Up to 10PB larger Information Lifecycle Management System (with West Coast)

  10. UFrame-based OOINet Ecosystem – Data Query Data Flow (Use Case 1 – all in Cassandra) (1) User query (portal) Network backbone (10G upgradable to 100G) Cassandra cluster ~50TB (Estimated) UFrame servers (4) Response 10G UI Server User On 1G network 1G … Stream Engine 10G PostgreSQL … (e.g. 32) Iterative Stream Engine EDEX … (2) SOA query … (3) Cassandra Queries … (n threads) faster … Stream Engine 10G (e.g. 32) Stream Engine SAN (e.g., Compellent) Up to 2PB 10G (multiple) Internet 2 link to mirror in West Coast CI (Pittock) 10G Internet link to OMCs 1G Tape Library Up to 10PB larger Information Lifecycle Management System (with West Coast)

  11. Cassandra-based Architecture Issues • Uframe performance issues with Cassandra • Data lifecycle management • Hierarchical approach • Mechanisms • Policies • Video and hydrophone data

  12. Other CI Issues • Long-term data archival • Working with NODC • Federated CI identity management • Leverage existing XSEDE/NCSA infrastructure • Security audits • Working with Center for Trustworthy Scientific Cyberinfrastructure (CTSC) – http://trustedci.org • Monitoring capabilities • Performance, availability and communications pathways • CI evaluation • Quality of service (exclusive of scheduled downtime) • Network bandwidth utilization (e.g., prioritize non-user facing functionality when bandwidth utilization reaches >85%)

  13. Risks Timelines Production readiness Beyond functionality Community acceptance

  14. Operation – Systems Team

More Related