1 / 31

Talend

Talend. first provider of open source data integration software. Contents. Products Talend Open Studio Talend Integration Suite Talend On Demand Solutions Operational data integration Data migration Data synchronization ETL for Business Intelligence and Data Warehousing.

gvang
Télécharger la présentation

Talend

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Talend first provider of open source data integration software

  2. Contents • Products • Talend Open Studio • Talend Integration Suite • Talend On Demand • Solutions • Operational data integration • Data migration • Data synchronization • ETL for Business Intelligence and Data Warehousing

  3. Products • Talend Open Studio The most open, innovative and powerful data integration solution • Talend Integration Suite The first Open Source enterprise data integration solution;supports the tough requirements of enterprise development, and scales to the highest levels of data volumes and process complexity • Talend On Demand The industry's first data integration Software as a Service (SaaS), Talend On Demand consolidates TOS metadata and project information in an online, shared repository hosted by Talend.

  4. TOS Functions • ETL (Extract, Transform, Load) for business intelligence and data warehousing; • real-time or near-real-time synchronization of applications; • migration of data; • loading of data; • many other tasks related to data movement and transformation.

  5. Prerequisites GDI JVM 1.5+ Perl database client software for bulk component Download and install Register, login and first project Installation of TOS

  6. GUI Talend Open Studio window is composed of the following elements: • Tools bar and menus • Repository • Graphical workspace -- flowcharting editor • Properties -- Various configuration views in a tab system • Outline view and Code Viewer Repository

  7. Feature (1) Business modeling • providing an easy-to-understand, non-technical view of a business workflow • Systems, connections, steps and requirements are all designed using standardized workflow notation through an intuitive graphical toolbox

  8. Designing a Business Model ---- Objectives • draw your business needs • create and assign numerous repository items to your model objects • define appearance properties of your model objects.

  9. Feature (2) Graphical development • providing both a graphical and a functional view of integration processes • Component Library— a graphical palette of components and connectors includes over 200 components and connectors, providing basic functions such as mappings, transformations, and lookups;specialized functions such as data filtering, data multiplexing, or ELT,etc.

  10. Designing a Job ---- Objectives • put in place actions in your job design using a library of technical components. • change the default setting of components or create new components to match your needs. • set connections and relationships between components in order to define the sequence and the nature of actions • access code at any time to edit in Perl or document your job components. • create and add items to the Repository for reuse and sharing purposes .

  11. Feature (3)Metadata-driven design and execution • a metadata-driven solution— all metadata is stored and managed in a centralized Metadata Repository, shared by all the modules • Metadata Repository stores all project information : business models; integration jobs; results of their execution

  12. Feature (4)Real-time debugging • powerful debugging and tuning features that allow the real-time tracking of data flowing • activate a trace mode — display row-by-row behavior and show the result of the transformations

  13. Feature (5)Robust execution • TOS dynamically distributes the processing across a grid of systems – based on their available capacity and leverages available resources, regardless of their nature. • TOS is the only data integration solution that leverages both the traditional ETL (Extract-Transform-Load) approach as well as the ELT (Extract-Load-Transform) approach.

  14. Key benefits of TOS(1) • A business view of the integration processes. • Maximize productivity and ease of use through an advanced drag-and-drop interface. • High performance and robustness with industry standard languages generation, grid deployment and support of both ETL and ELT architectures.

  15. Key benefits of TOS(2) • Increased versatility through connectivity to all business applications, databases, files, Web Services, etc. • Consistency and reusability of developments, as well as maintenance facilitated by the project repository. • Ability to analyze errors and trace bottlenecks in the integration processes.

  16. Talend Integration Suite • Teamwork and consolidation of developments • Management of complex deployments • Execution monitoring • Technical support

  17. Talend On Demand

  18. SaaS benefits • the same level of flexibility to both local and distributed project teams • enforce strict security for the project metadata • does not require any configuration or administration

  19. What’s stored in the repository • Connection information and data structures for source and target systems • TOS Business Models • Integration Jobs • Administrative information

  20. 3 steps

  21. Data Integration Solutions • Operational data integration Data migration/loading and data synchronization/replication are the most common applications of operational data integration • ETL for Business Intelligence and Data Warehousing

  22. Data migration • Purpose: transfer existing data to a format suitable for the new system • Challenges: high volumes of data heterogeneous environments consistency

  23. Solutions for Data migration • Business-oriented process modeling that involves business stakeholders and ensures proper coordination during the migration of business data and processes. • Fully graphical development environment that improves productivity, makes it easy to perform dry-runs, and allows reusability of data mappings and transformations for synchronization processes . • Highly scalable and fast execution platform with a grid approach that enables the processing of data close to its source and target, using both the ETL and ELT approaches for shorter downtimes. • Broadest connectivity to support all source and target systems

  24. Data Synchronization • Purpose: maintain the consistency of data contained in several applications, databases or systems. • Challenges : • real time • heterogeneous • conflicts of data

  25. Solutions for Data Synchronization • Business-oriented process modeling that involves business stakeholders and ensures proper mapping of data integration processes to business processes. • Fully graphical development environment that greatly improves productivity, facilitates maintenance and ensure reusability of data mappings and transformations. • Highly scalable and fast execution platform with a grid approach that enables the processing of data close to its source and target. • Broadest connectivity to support all source and target systems

  26. ETL for Analytics • ETL (Extraction, Transformation and Loading) processes are the most critical – and value added – components of a Business Intelligence infrastructure • Challenges : • Data volumes are growing exponentially • disparity of sources • BI structures and applications have different data transformation requirements • Transformations are highly complex • real-timeliness

  27. Solutions for ETL • Business-oriented process modeling that involves business stakeholders and ensures proper communication between IT and lines of business • Fully graphical development environment that greatly improves productivity and facilitates maintenance • Highly scalable and fast execution platform that leverages a grid of commodity hardware, and the only solution to support the dual ETL + ELT architecture. • Broadest connectivity to support all systems and get access to all the production data and easily add new source systems • Built-in advanced components for ETL, including string manipulations, Slowly Changing Dimensions, automatic lookup handling, bulk loads support, etc.

  28. ETL工具评价标准 • 对平台的支持 • 对数据源的支持 • 数据转换功能 • 管理和调度功能 • 集成和开放性 • 对元数据管理

  29. ETL工具 • 专业ETL厂商和产品 Ascential: DataStageXE Sagent: DataFlow Informatica: PowerCenter • 整体方案提供商和产品 Oracle: Warehouse Builder IBM: Warehouse Manager Microsoft: Data Transformation Services (DTS) • 开源产品 kettle -- Pentaho Data Integration Talend Open Studio

  30. Correlative Links • download user documentation: http://www.talend.com/resources/documentation.php • community tools on Talend's Forge such as Forum, Bugtracker, Wiki,Ecosystem (to share components and connectors), etc. www.talendforge.org • Talend On Demand for sharing your project repository in a SaaS mode http://www.talend.com/talend-on-demand/talend-on-demand.php • Talend Integration Suite for enterprise scale development,deployment and monitoring http://www.talend.com/products-data-integration/talend-integration-suite.php • Technical Support on our products http://www.talend.com/professional-support/support.php

More Related