#TalendSandbox for Big Data

Similar documents
Talend Big Data. Delivering instant value from all your data. Talend

Roadmap Talend : découvrez les futures fonctionnalités de Talend

HDP Hadoop From concept to deployment.

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

HDP Enabling the Modern Data Architecture

A Modern Data Architecture with Apache Hadoop

The Future of Data Management with Hadoop and the Enterprise Data Hub

Upcoming Announcements

Modern Data Architecture for Predictive Analytics

Big Data: Making Sense of it all!

Big Data Realities Hadoop in the Enterprise Architecture

Comprehensive Analytics on the Hortonworks Data Platform

Bringing Big Data to People

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Dominik Wagenknecht Accenture

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Information Builders Mission & Value Proposition

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Workshop on Hadoop with Big Data

Deploying Hadoop with Manager

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

The Future of Data Management

Self-service BI for big data applications using Apache Drill

Next Gen Hadoop Gather around the campfire and I will tell you a good YARN

Big Data Advanced Analytics for Game Monetization. Kimberly Chulis

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Data Security in Hadoop

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

WHITE PAPER. Four Key Pillars To A Big Data Management Solution

Hadoop Ecosystem B Y R A H I M A.

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

HADOOP VENDOR DISTRIBUTIONS THE WHY, THE WHO AND THE HOW? Guruprasad K.N. Enterprise Architect Wipro BOTWORKS

Apache Hadoop: The Big Data Refinery

Cloudera Enterprise Data Hub in Telecom:

Unified Batch & Stream Processing Platform

Luncheon Webinar Series May 13, 2013

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Native Connectivity to Big Data Sources in MSTR 10

Peers Techno log ies Pv t. L td. HADOOP

Hadoop, the Data Lake, and a New World of Analytics

Apache Hadoop's Role in Your Big Data Architecture

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Self-service BI for big data applications using Apache Drill

Ganzheitliches Datenmanagement

Big data for the Masses The Unique Challenge of Big Data Integration

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Modernizing Your Data Warehouse for Hadoop

Please give me your feedback

Qsoft Inc

Getting Started with Hadoop. Raanan Dagan Paul Tibaldi

Apache Hadoop: Past, Present, and Future

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Talend Open Studio for Big Data. Release Notes 5.2.1

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Saving Millions through Data Warehouse Offloading to Hadoop. Jack Norris, CMO MapR Technologies. MapR Technologies. All rights reserved.

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Why Spark on Hadoop Matters

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Hortonworks Data Platform for Hadoop and SAP HANA

Apache Hadoop: The Pla/orm for Big Data. Amr Awadallah CTO, Founder, Cloudera, Inc.

Talend Big Data Sandbox

Hadoop 101. Lars George. NoSQL- Ma4ers, Cologne April 26, 2013

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

Data Integration Checklist

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

Data processing goes big

Community Driven Apache Hadoop. Apache Hadoop Basics. May Hortonworks Inc.

Hadoop Introduction. Olivier Renault Solution Engineer - Hortonworks

Hadoop Trends and Practical Use Cases. April 2014

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Big Data Management and Security

Market Overview: Big Data Integration

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Microsoft SQL Server 2012 with Hadoop

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

SQL on NoSQL (and all of the data) With Apache Drill

MySQL and Hadoop. Percona Live 2014 Chris Schneider

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer,

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

The Enterprise Data Hub and The Modern Information Architecture

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

The Next Wave of Data Management. Is Big Data The New Normal?

Bringing the Power of SAS to Hadoop. White Paper

Transcription:

Evalua&on von Apache Hadoop mit der #TalendSandbox for Big Data Julien Clarysse @whatdoesdatado @talend 2015 Talend Inc. 1

Connecting the Data-Driven Enterprise 2

Talend Overview Founded in 2006 BRAND AWARENESS 400 employees in 7 countries VIBRANT COMMUNITY CUSTOMER LOYALTY MONETIZATION Dual HQ in Los Altos, CA and Paris, France Open Core business model SubscripBon license Services & training 2007 2008 2009 2010 2011 2012 2013 2014 3

Key messages Talend helps data- driven companies get successful Easiest and fastest computa&on with native code genera&on Open source state- of- the- art technology standards Start small / think Big with future- proof unified architecture Predictable investment with lowest TCO 4

The Power of Hadoop 2015 Talend Inc. 5

Hadoop Strategy: Divide and Conquer Monolithic Parallel Hugely expensive Precious engineering Single points of failure Geographically isolated Commodity hardware Self- healing, redundant Geographically distributed 6

Example of Hadoop Use Cases Big Data - Velocity - Variety - Volume Analysis - Social media - Customer/Clickstream - Opera&onal/Server Logs - Fraud & Compliance - Sensor/Machine - Geographic - etc. value Page 7 7

Hadoop inser&on in Your IT- environment NoSQL Web Logs IOT ERP distro Metadata NoSQL Standard Reports Ad-hoc Query Tools Data Mining Data explosion Batch to Real-Time DBMS / EDW Legacy Systems MDD/ OLAP Analytical Applications Longer active data DWH/Data Marts 8

The Value of Talend 2015 Talend Inc. 9

The Hadoop ecosystem today Ambari, chukwa, DRILL, Flume, Ganglla, GIRAPH, hadoop, HBASE, HCatalog, HDFS, HIVE, MapReduce, mahout, oozie, PIG, Spark, sqoop, Storm, Whirr, YARN, Zookeeper source: j2eedev.org 10

Brief History of Hadoop and Talend Apache Project Established 2006 Enterprise Hadoop distribubon Vendors Hortonworks, Cloudera, Pivotal, 10Gen, 2008 2010 2012 New performance capabilibes 2014 Widely adopted concepts and technologies 1 st Open Source IntegraBon SoPware 2006 1 st on Hadoop HDFS + Map Reduce 2008 2010 2012 1 st on YARN, HIVE, Spark and Storm 2014 à Talend is matching and supporfng Hadoop ecosystem nafvely 2015 Preferred solufon for Big Data integrafon 11

Talend Big Data Integra&on Visual, Drag and Drop UI 800+ Pre-built connectors Generates MapReduce, Java or SQL Run at cluster scale Load balancing & failover Code optimization Supports Big Data management consoles Integrates with native security Centralized scheduling, monitoring and mgmt Shared repository Auto-documentation 101010101010 Design 10101101010101010101 01010101010101010101 Scale 11010110101010101010 Collaborate 01010101101010101010 1011010101010101 Manage 0101010110Deploy Zero Talend install on Hadoop Cleanse and enrich Native support for Kerberos 12

Zero to Big Data in 10 Minutes! 2015 Talend Inc. 13

Talend Big Data Sandbox à Free virtual image* including: A ready- to- run Pla[orm for Big Data 30 days evalua&on included A distribu&on of Apache Hadoop based on either Cloudera, Hortonworks, or MapR A step- by- step Big Data Insights Cookbook with four big data ready- to- run scenarios * Runs on Oracle VirtualBox 4.2+, VMware Fusion 5.0 + (Mac) or VMware Player (Win) 14

Talend Big Data Architecture NoSQL Web Logs Internet of Things ERP DBMS / EDW Legacy Systems Ingestion Develop and Test Studio Talend Big Data Map Profile Parse Match Cleanse Standardize Share Native Change Data Capture Operations Team Schedule Machine Learning Access NoSQL Standard Reports Ad-hoc Query Tools Data Mining MDD/ OLAP Analytical Applications Benefits Increased Productivity Lowest TCO Future Proof Architecture 15

Talend Big Data Scenarios Clickstream analysis Sen&ment analysis with social media data Log stream analysis using Apache weblogs ETL offloading with Hadoop 16

Example scenario: Sen&ment Analysis with social media data 2015 Talend Inc. 17

Sen&ment Analysis: Overview #Hashtag à Twitter API à Sentiment dictionary à Time zones à Google Geocharts 18

Sen&ment Analysis: Anatomy 19

Take away 2015 Talend Inc. 20

Key messages Talend helps data- driven companies get successful Easiest and fastest computa&on with native code genera&on Open source state- of- the- art technology standards Start small / think Big with future- proof unified architecture Predictable investment with lowest TCO 21

Do You like it? 22

Julien Clarysse PRE- SALES CONSULTANT Office +49(0)228 76 37 76 0 Mobile +49(0)170 5768201 Email jclarysse@talend.com Skype jclarysse_talend Twiper whatdoesdatado www.talend.com Talend Germany GmbH Serva&usstraße 53-53175 Bonn - Germany 23

Screenshot 24

Tweets with #CeBIT 25

Tweets with #IoT 26