270 likes | 292 Vues
Distributed Data Management. Graeme Kerr Oracle in R&D Programme. Distributed Data Management. Provides access to data when and where you need it Optimizes information access regardless of physical location of data. Enables virtualisation of data in Grids. Types of Data:
E N D
Distributed Data Management Graeme Kerr Oracle in R&D Programme
Distributed Data Management • Provides access to data when and where you need it • Optimizes information access regardless of physical location of data. • Enables virtualisation of data in Grids. • Types of Data: • Flat Files, Relational. • Structured & unstructured. • Text, Binary, proprietary. • Scalar, user defined typed, XML, RDF • Images, DICOM, LOB, Spatial. • High Volumes.
Methods of Data Distribution • Distribute • Federate • Consolidate
Distribute Information • Share information between users, applications, databases, organisations and partners. • Move or copy / replicate information as needed • Traditionally implemented as replication or message queuing • Wide range of technology solutions to achieving distribution. • Continuous feed and periodic refresh capabilities are key requirements. • Common within Scientific Grid Approach • Works well when replicas are read-only. • Considerable issues when replicas are updateable.
Data Distribution - Example Continuous Feeds SOURCE SITE REPLICA SITES
Data Distribution - Example Periodic Refresh Continuous Feeds SOURCE SITE REPLICA SITES
Data Distribution - Example Transportable Tablespaces, RMAN, Data Pump Oracle STREAMS SOURCE DATABASE REPLICA DATABASES
TRIUMPF Gridka ASGC SARA IN2P3 CNAF RAL BNL PIC Nordu CERN LCG Example Offline DB @ CERN IT Grid Online DB @ Detector
Methods of Data Distribution • Distribute • Federate • Consolidate
Federate Information • Federate data in multiple data stores into a single virtual database • Access both Database and non-Database data • Access both structured and unstructured data • Hide physical location of data from applications • Vendor and open-source solutions. • Used mostly as point solution within Commercial World rather than as a key design point.
Data Federation - Example Excel DB2 Web Services
Data Federation - Example Excel DB2 Flat Files XML Text Web Services Flat Files Sybase
Data Federation - Example Excel DB2 Flat Files XML Text Web Services Flat Files Sybase
Data Federation - Example Excel DB2 Flat Files XML Text Web Services Flat Files Sybase
Methods of Data Distribution • Distribute • Federate • Consolidate
Consolidate Information • Consolidate heterogeneous data into a central pool. • Single database or pool of databases • Any data (audio, video, XML, email, messages, etc…) • Any platform (Linux, Windows, Solaris, HPUX, AIX, Tru64, OpenVMS, OS/390) • Proven scalability • Hundreds of Very Large Database customers • Industry leading benchmarks • Very common in commercial world.
Typical Data Centre View DB Users DB Clusters Disk
Consolidated Clusters with RAC & ASM DB Users DB Cluster Pool Disk Pool
DB Server & Disk Virtualisation DB Users DB Cluster Pool Disk Pool
DB Server & Disk Virtualisation DB Users DB Cluster Pool Disk Pool
DB Server & Disk Virtualisation DB Users DB Cluster Pool Disk Pool
Distributed Data Management : Summary • Oracle supports efficient utilization of all resources: • Consolidate Information—Scalable access to large amounts of information • Federate Information—Data can be located where needed but accessed as single database • Share Information—Multiple users and applications can share data/information to operate more effectively • Information Integration & fast connectivity critical to Grid Computing
Oracle Database Versions. • Database 10G Express Edition • Entry-Level, small footprint, free to develop, deploy & distribute. • 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva • Database 10G Enterprise Edition • Flagship, Full-featured version. • Database 10G Standard Edition • Limited featured version, RAC, ASM, Streams Apply supported • Database 10G Standard Edition 1 • SE on limited hardware. Streams Apply supported. • Data Grid / Distributed Caching • TimesTen • TangoSol Coherence
Oracle Database Versions. • Database 10G Express Edition • Entry-Level, small footprint, free to develop, deploy & distribute. • 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva • Database 10G Enterprise Edition • Flagship, Full-featured version. • Database 10G Standard Edition • Limited featured version, RAC, ASM, Streams Apply supported • Database 10G Standard Edition 1 • SE on limited hardware. Streams Apply supported. • Data Grid / Distributed Caching • TimesTen • TangoSol Coherence
For More Information Database: www.oracle.com/database/index.html Express Edition: www.oracle.com/technology/products/database/xe/index.html Tangosol: www.oracle.com/tangosol/index.html Times 10: www.oracle.com/database/timesten.html or http://search.oracle.com Database | Tangosol | etc
EMEA Oracle in R&D Programme Graeme.Kerr@oracle.com Monica.Marinucci@oracle.com