Data Integration Checklist



Similar documents
ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Integrating data in the Information System An Open Source approach

Integrating Ingres in the Information System: An Open Source Approach

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR

Ganzheitliches Datenmanagement

BUSINESSOBJECTS DATA INTEGRATOR

Roadmap Talend : découvrez les futures fonctionnalités de Talend

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Why All Enterprise Data Integration Products Are Not Equal

Data processing goes big

Service Oriented Data Management

The Future of Data Management

Luncheon Webinar Series May 13, 2013

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Data Virtualization and ETL. Denodo Technologies Architecture Brief

Integrating Salesforce Using Talend Integration Cloud

What does SAS Data Management do? Why is SAS Data Management important? For whom is SAS Data Management designed? Key Benefits

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

SOA REFERENCE ARCHITECTURE: SERVICE TIER

Oracle Data Integration: CON7926 Oracle Data Integration: A Crucial Ingredient for Cloud Integration

WHITE PAPER. Data Migration and Access in a Cloud Computing Environment INTELLIGENT BUSINESS STRATEGIES

IBM WebSphere Cast Iron Cloud Integration for JD Edwards EnterpriseOne

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Informatica PowerCenter Data Virtualization Edition

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

IBM WebSphere Cast Iron Cloud integration

Databricks. A Primer

WebSphere Cast Iron Cloud integration

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

AtScale Intelligence Platform

IBM WebSphere Cast Iron Cloud integration

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

BUSINESSOBJECTS DATA INTEGRATOR

JOURNAL OF OBJECT TECHNOLOGY

Oracle Warehouse Builder 10g

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Informatica and the Vibe Virtual Data Machine

Information Architecture

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Databricks. A Primer

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Analance Data Integration Technical Whitepaper

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Data Virtualization A Potential Antidote for Big Data Growing Pains

Big Data & Analytics Reference Architecture

IBM WebSphere Cast Iron Cloud integration

SAP Data Services 4.X. An Enterprise Information management Solution

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

Accelerate Data Loading for Big Data Analytics Attunity Click-2-Load for HP Vertica

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Analance Data Integration Technical Whitepaper

IBM WebSphere Cast Iron Cloud integration

OWB Users, Enter The New ODI World

Market Overview: Big Data Integration

IBM Cognos 8 Business Intelligence Reporting Meet all your reporting requirements

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

ORACLE DATA INTEGRATOR ENTEPRISE EDITION FOR BUSINESS INTELLIGENCE

Microsoft Big Data. Solution Brief

Jitterbit Technical Overview : Microsoft Dynamics CRM

Five Steps to Integrate SalesForce.com with 3 rd -Party Systems and Avoid Most Common Mistakes

IBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look

Oracle Data Integrator Technical Overview. An Oracle White Paper Updated December 2006

Ultimus Adaptive BPM Suite V8

Lofan Abrams Data Services for Big Data Session # 2987

TimeScapeTM EDM + The foundation for your decisions. Risk Management. Competitive Pressures. Regulatory Compliance. Cost Control

Combining new technologies: SAP Cloud for Sales and HANA Cloud Integration at Cavalier

Big Data Architectures. Tom Cahill, Vice President Worldwide Channels, Jaspersoft

What's New in SAS Data Management

DATA REPLICATION FOR REAL-TIME DATA WAREHOUSING AND ANALYTICS

Jitterbit Technical Overview : Salesforce

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

Data Integrity and Integration: How it can compliment your WebFOCUS project. Vincent Deeney Solutions Architect

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

Building a Data Warehouse

Decision Ready Data: Power Your Analytics with Great Data. Murthy Mathiprakasam

Ad Hoc Analysis of Big Data Visualization

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

SAS Business Intelligence Online Training

Managing Data in Motion

A roadmap to enterprise data integration.

By Makesh Kannaiyan 8/27/2011 1

White Paper. Unified Data Integration Across Big Data Platforms

Unified Data Integration Across Big Data Platforms

Integrating Netezza into your existing IT landscape

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Applying OpenText Integration Center to Information Integration for the ECM Suite. Enterprise Data Archiving

Centerprise Data Integrator

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

BIG DATA TRENDS AND TECHNOLOGIES

Reporting component for templates, reports and documents. Formerly XML Publisher.

Enterprise Data Integration

Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain

Independent process platform

Transcription:

The need for data integration tools exists in every company, small to large. Whether it is extracting data that exists in spreadsheets, packaged applications, databases, sensor networks or social media feeds, there is a significant benefit to share and reuse information instead of having duplicate processes and silos of information. It is also important to select a solution that can address all your data integration needs, whether it be data integration, data migration, big data integration, data warehouse integration, or integration with business intelligence systems. The following checklist provides key functional requirements for implementing and deploying data integration in an enterprise environment. Use the list to validate and prioritize your needs. Included Connect and Deliver Connect to Traditional Data Sources Description Connect to data stored in relational databases, OLAP applications, non-relational structures like flat files, XML, common packaged applications like SAP, cloud-based applications such as salesforce.com, semi-structured (e.g Excel) data, unstructured (e.g. audio, video) data, and messaging systems. Support for industry standards like EDI. Connect to Big Data and NoSQL Integration with big data technologies (e.g. Hadoop, Hbase, Hive), big data platforms (e.g. Cloudera, Hortonworks, MapR) and NoSQL databases (e.g. MongoDB, Cassandra) Data Movement The ability for data consumers to receive data in many ways. Support bulk data movement, data services, data federation, change data capture (CDC) and direct data replication between data sources. Data Synchronization Support Extract Transform and Load (ETL) and Extract Load and Transform (ELT), real-time delivery, and event-driven delivery (trigger or changed data). 2

Transformation Simple Transformations Such as calculations, data type conversions, string manipulations, aggregations, automatic lookup and replace operations. Advanced Transformations Such as slowly changing dimensions, normalization of data, advanced parsing capabilities and transformation to complex standards (EDIFACT, HL7, and others) Custom Transformations Ability to create new custom transformations, as well as extend existing transformations. Enrichment Solution should have the capability to use enrichment data from a wide variety of sources. Enrichment data might come in various file formats and schemas both internal and external. It may come from online sources through service APIs, commercial partners or data providers. Development and Data Modeling Single Product Support for all data delivery and integration operations, from connect, transform and load, via a single product Graphical Tooling Easy-to-use, graphical, drag-and-drop tools to build processes and transformations, and design data models, metadata and data flows. Graphical representation of objects and connectors. Wizards to automate common tasks. Business Model Tooling A non-technical tool that enables collaboration between technical and business users to structure all relevant documentation and technical elements supporting the data integration process. Data Model Creation Ability to create and maintain data models. Use graphical 3

and Management tools to define relationships. Metadata Management Provides automated discovery of metadata. Ability to search metadata across multiple sources and show its lineage. Use a single repository of metadata across all product features, with the ability to seamlessly share and synchronize metadata between data integration tools and other tools (e.g. data quality, data profiling and master data management). Business Rules and Workflow The ability to define and manage business rules and execution flows. Process execution can be scheduled immediately, at a set time, or based on an event. Versioning Developers can easily version metadata, routines, processes, transformations or any other object used in the integration process. Then have the ability to see changes and roll-back to a prior version if necessary. Collaboration A set of tools for each user, i.e. business users, developers, and IT operations staff; and a shared repository consolidating all project information and enterprise metadata shared by all stakeholders. Testing, Debugging and Tuning Tools to test processes with data in the graphical tool, then interactively debug and tune for optimum performance. Impact Analysis Use graphical tools to compare processes, assess the impact of change and view data lineage to see where changes occurred. Standards Support To facilitate ramp-up time and leverage existing resources, products should embrace standards such as Eclipse, Java, JDBC, ODBC, and Web services. Reusability Should be able to reuse projects, metadata processes, cleansing, validation, enrichment and other highly used 4

routines in a fast and easy manner. Customizable Generated artifacts can be customized for maximum flexibility. Ability to create your own custom components. Easy to customize and extend transformations. Data Governance Integration with data quality tools Integrated functionality with tools that profile and cleanse data, parse and standardize data, and match, merge and identify duplicate records to then be rationalized based on your requirements. The ability to define business rules to be applied to data. Integration with data profiling tools Integrated functionality with tools that do column-based analyses, dependency analyses, trend analyses and custom analyses. Integration with MDM tools Integrated functionality or out-of-the-box integration with tools to create a unified view of information and manage that master view over time. Reports and Dashboards Deploy Multi-Platform Runtime Support Pre-built and customizable reports that show key data quality metrics over time. Provide the ability to export results in a variety of formats including XML, PDF, HTML, etc. Provide a dashboard (web-based) reporting system of data quality metrics and provide metadata to business intelligence (BI) systems. Ability to seamlessly deploy to Unix-based, Linux-based and Windows systems. Ability to run on-premises and in the Cloud and virtualization environments. Ability to run in big data (MapReduce) distributed processing environments. Ideally generates code for portability and 5

performance. Load Balancing and Scalability Clustering capabilities to spread server load over several machines. The ability to handle very large data volumes, working with big data and multi-terabyte data warehouses. Failover Ability to rollback a transaction and continue processing if there is a server failure without losing data. Remote Execution Ability to run processes remotely on various operating systems using the same configuration Data Integration Services Ability to deploy all aspects of run-time functionality as services within a service-oriented architecture. Middleware Compatibility Integrated functionality with MOM and ESB systems Hadoop Support Deploy native MapReduce jobs directly to a cluster with no needed appliances or additional software installed on the cluster. Have the ability to scale MapReduce processes with the cluster without code changes. Monitor and Manage Centralized Administration Ability to monitor and manage all resources and deployments from one location. Web-based Monitoring Ability to monitor resources and deployments from any browser. Reports and Dashboards Pre-built and customizable reports that show key data integration metrics. The dashboard shows information and statistics over time, e.g. performance, load volumes, subtask individual metrics such as database read and rights or enrichment service response times. 6

Exception Reporting and Management Ability to define, report and handle exceptions when they occur. Capability to invoke special processes when violations to data integration rules. Examples include an e-mail alert, text message, or halting a process. Security Controls A mechanism to secure in-flight messages between applications as well as user/role-based security in the tool itself, LDAP. Business User Interaction Solution should provide an easy-to-use environment for business users to follow the key performance indicators for data integration, e.g. PDF reports and web-based portals. Cloud Support Ability to setup, deploy, and shutdown a cloud instance, e.g. Amazon EC2. Enables you to expand your computing capacity for your integration processes. Support Comprehensive Support Provides the support you need when you need it, e.g. community forums, web knowledge-base, email support, and phone support. 24x7 mission critical support. SLAs for response time, bug fixes and maintenance updates. Training Classroom, Online, and On-demand training for newbies, advanced developers and administrators. Professional Services Vendor offers a complete spectrum of consultative services: assessment, strategy and architecture, quickstart, design and development, tuning, technical audit, and custom offerings. 7

Talend Data Integration Talend Data Integration provides an extensible, highly-scalable platform to access, transform and integrate data from any business system in real time or batch to meet both operational and analytical data integration needs. With over 800 connectors, Talend connects natively to databases, packaged applications (ERP, CRM, etc.), SaaS and Cloud applications, mainframes, files, Web services, data warehouses, data marts, and OLAP applications. Talend 2014 8