Data Science in the Travel Industry

Size: px
Start display at page:

Download "Data Science in the Travel Industry"

Transcription

1 Data Science in the Travel Industry Real-world experience using current leading frameworks Paul Balm September 2015

2 Hello! _I am from Amsterdam, since 2005 in Madrid _ : Ph.D. in particle physics at Fermilab in Chicago _ : Data Processing projects for the European Space Agency _2014 present: Data Scientist at Amadeus Travel Intelligence Page 2

3 Agenda _ Travel Industry and Amadeus _ A User Story _ Experiences with Different Technologies _ Conclusions & Final Remarks All views expressed in this presentation are entirely my own Page 3

4 1 The Travel Industry and Amadeus

5 Amadeus in a few words 1 _ Introduction Amadeus is a technology company dedicated to the global travel industry. We are present in 195 countries with a worldwide team of more than 12,000 people. Our solutions help improve the business performance of travel agencies, corporations, airlines, airports, hotels, railways and more Amadeus IT Group SA Page 5

6 We started our journey in _ Amadeus history Page 6

7 A history of shaping the future of travel 2 _ Amadeus history 1987 Amadeus is founded 1992 First booking made through Amadeus 1995 World leader in number of travel agency locations 1996 e-commerce division launched MILLION bookings made on a single day for the first time 2000 Partnership with BA and Qantas to launch Amadeus Altéa our core Airline IT offering 2010 Amadeus diversifies into IT for hotel, rail and airport 2014 Contracts with Ryanair and Southwest Airlines and a strategic technology relationship with IHG 2015 Amadeus IT Group SA Page 7

8 Robust global operations 3 _ Amadeus today 1.6+ billion data requests processed per day 525+ million travel agency bookings processed in million Passengers Boarded (PBs) in % of the wrld s scheduled network airline seats 2015 Amadeus IT Group SA Page 8

9 Connecting the entire travel ecosystem Cruiselines Airlines Hotels Insurance companies Car rental Travel agencies Ground handlers Airports Ground transportation Ferry operators 2015 Amadeus IT Group SA Page 9

10 Supporting the entire traveller life cycle Post-trip Inspire On trip Search Pre-trip Buy/Purchase 2015 Amadeus IT Group SA Page 10

11 2015 Amadeus IT Group SA 2 A User Story

12 A Network Carrier _An airline operating a network with hubs AMS BER _Any risks or opportunities for this operation? FRA Page 12

13 Schedule Analysis Analyse global schedules to alert of any changes in capacity Schedule information may come from _Commercially provided feeds _Airline inventory data _Publicly available databases (EuroStats, US DoT) Page 13

14 Schedule Analysis Page 14

15 Schedule Analysis Batch processing of different formats of input data _Different formats, different sources and a priori unclear which sources will be useful or when _Store everything always within legal and moral boundaries _Central data site with large NAS (network attached storage) _HDFS based storage at hosted processing clusters _Maintain separation of different customer projects Page 15

16 A Network Carrier _An airline operating a network with hubs AMS BER _Schedule analysis shows competitor entering the market FRA _What s the risk? Page 16

17 Quality of Service QSI: Quality of Service Index The prime quality metric: Time to get from A to B In practice many rules to build QSI that require tuning and iteration Agility as a requirement Need for scalable processing capacity Page 17

18 A Network Carrier _An airline operating a network with hubs AMS BER _Schedule analysis shows competitor entering the market FRA _Risk of loss of market share assessed via QSI analysis _How to react? Page 18

19 Future revenue streams Information about future revenue comes in many forms _Tickets issued _Reservations (foreseen revenue) _Searches _Capacity changes _Sentiments on social networks, weather forecasts and the like _How are average fares developing? _How are passenger counts developing? Requires fast updates live if possible Page 19

20 Streaming? No hard truths here considerations of options only! _Response times of the order of seconds are not required in the presented use-case _We still require to process the large amount of historical data _We think processing the historical data is better done in batch Mature and reliable approach Can we reuse the streaming code for batch processing? _Flink: Suitability for batch processing to be demonstrated _Storm: Using Summingbird, combine with Scalding? _Spark Streaming: Combine with Spark for batches. Maturity? Page 20

21 A Network Carrier _An airline operating a network with hubs AMS BER _Schedule analysis shows competitor entering the market FRA _Risk of loss of market share assessed via QSI analysis _How to react? Page 21

22 3 Technologically

23 The Travel Intelligence Engine Scalable cloud-based Hadoop platform Amadeus Data (Altéa and other) Customer Private Data Industry Weather Demographics Social Media Etc. Visualization layer Applications Raw Data Storage * Batch and streaming processingoutput storage Data Warehouse (* All the raw data is kept after processing to allow for re-processing when required) Page 23

24 Lambda architecture Image reproduced from Page 24

25 Page 25

26 Page 26

27 Page 27 Scoobi

28 Page 28 Scoobi

29 Scoobi Scalding Page 29

30 Benchmarking Spark vs. Scalding Source: Box Engineering Blog Page 30

31 Scoobi Scaldin g Impala Page 31

32 Scoobi Scaldin g Impala Storm Kafka Page 32

33 Scoobi Scaldin g Kafka Impala Storm Web Services Visualization tools Page 33

34 Scoobi Scaldin g Kafka Impala Storm Web Services Python: Flask Scala: Spray Visualization tools Page 34

35 Scoobi Scaldin g Kafka Impala Storm Web Services Python: Flask Scala: Spray Visualization Tools HTML5/JS Page 35

36 Recap Data science opportunities in the travel sector How we are dealing with these opportunities at Amadeus Travel Intelligence An overview of our experience with different tools What s on our horizon for evaluation and why Page 36

37 Some final remarks Different technologies currently under evaluation In all areas (batch, streaming, visualizations) too many options to carefully evaluate all Also, development effort from the community is diluted Consolidation would be welcome! Even so, tools are advancing rapidly and within their limitations, many are of great value Page 37

38 Thank

Where should DMOs spend their marketing budget?

Where should DMOs spend their marketing budget? Where should DMOs spend their marketing budget? International Seminar on Knowledge Sharing for Tourism Destinations Sandro Cuzzolin [email protected] Francisco Santos [email protected]

More information

Cloud-based data warehousing to power aviation analytics

Cloud-based data warehousing to power aviation analytics Cloud-based data warehousing to power aviation analytics Big Data Workshop Transportation Research Board Annual Meeting January 12, 2014 Dr. Tulinda Larsen Vice President [email protected] +1 443.510.3566

More information

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014 Copyright 2014, Oracle and/or its affiliates. All

More information

How To Create A Data Visualization With Apache Spark And Zeppelin 2.5.3.5

How To Create A Data Visualization With Apache Spark And Zeppelin 2.5.3.5 Big Data Visualization using Apache Spark and Zeppelin Prajod Vettiyattil, Software Architect, Wipro Agenda Big Data and Ecosystem tools Apache Spark Apache Zeppelin Data Visualization Combining Spark

More information

Designing Agile Data Pipelines. Ashish Singh Software Engineer, Cloudera

Designing Agile Data Pipelines. Ashish Singh Software Engineer, Cloudera Designing Agile Data Pipelines Ashish Singh Software Engineer, Cloudera About Me Software Engineer @ Cloudera Contributed to Kafka, Hive, Parquet and Sentry Used to work in HPC @singhasdev 204 Cloudera,

More information

The Internet of Things and Big Data: Intro

The Internet of Things and Big Data: Intro The Internet of Things and Big Data: Intro John Berns, Solutions Architect, APAC - MapR Technologies April 22 nd, 2014 1 What This Is; What This Is Not It s not specific to IoT It s not about any specific

More information

Jeff Edwards Head of Global Hotels Group June 2013

Jeff Edwards Head of Global Hotels Group June 2013 Jeff Edwards Head of Global s Group June 2013 Agenda Why? Industry context and business opportunity Distribution strategy and business proposition IT Competitive environment Our strategy Our product offering

More information

BIG DATA ARCHITECTURE AND ANALYTICS BIG DATA STRATEGIES FOR BUSINESS GROWTH

BIG DATA ARCHITECTURE AND ANALYTICS BIG DATA STRATEGIES FOR BUSINESS GROWTH BIG DATA ARCHITECTURE AND ANALYTICS BIG DATA STRATEGIES FOR BUSINESS GROWTH Aaron Werman [email protected] or [email protected] if you are in the payments industry, LinkedIn group Big Data

More information

Are You Big Data Ready?

Are You Big Data Ready? ACS 2015 Annual Canberra Conference Are You Big Data Ready? Vladimir Videnovic Business Solutions Director Oracle Big Data and Analytics Introduction Introduction What is Big Data? If you can't explain

More information

A Case Study of Hadoop in Healthcare

A Case Study of Hadoop in Healthcare Leading a Healthcare Company to the Big Data Promised Land: A Case Study of Hadoop in Healthcare Mohammad Quraishi (IT Senior Principal - Cigna) [email protected] About me BS in Computer Science and Engineering

More information

SAP Travel Management with Amadeus. Sales & e-commerce

SAP Travel Management with Amadeus. Sales & e-commerce SAP Travel Management with Amadeus Sales & e-commerce Contents Seamless integration for a true end-to-end Travel Management Solution 4 For a truly integrated travel management programme 6 Return on Investment

More information

Ali Ghodsi Head of PM and Engineering Databricks

Ali Ghodsi Head of PM and Engineering Databricks Making Big Data Simple Ali Ghodsi Head of PM and Engineering Databricks Big Data is Hard: A Big Data Project Tasks Tasks Build a Hadoop cluster Challenges Clusters hard to setup and manage Build a data

More information

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook Hadoop Ecosystem Overview CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook Agenda Introduce Hadoop projects to prepare you for your group work Intimate detail will be provided in future

More information

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time? Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time? Kai Wähner [email protected] @KaiWaehner www.kai-waehner.de Disclaimer! These opinions are my own and do not necessarily

More information

Modern marketing trends - Ancillary Revenue

Modern marketing trends - Ancillary Revenue Modern marketing trends - Ancillary Revenue Ancillary revenue Ryanair ancillary services 22.1% in 2011 (breakdown) 11.10 per booked passenger Non-flight scheduled revenues including airport check-in, excess

More information

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: [email protected] Website: www.qburst.com

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: bdg@qburst.com Website: www.qburst.com Lambda Architecture Near Real-Time Big Data Analytics Using Hadoop January 2015 Contents Overview... 3 Lambda Architecture: A Quick Introduction... 4 Batch Layer... 4 Serving Layer... 4 Speed Layer...

More information

HDP Hadoop From concept to deployment.

HDP Hadoop From concept to deployment. HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some

More information

Bringing Intergalactic Data Speak (a.k.a.: SQL) to Hadoop Martin Willcox [@willcoxmnk], Director Big Data Centre of Excellence (Teradata

Bringing Intergalactic Data Speak (a.k.a.: SQL) to Hadoop Martin Willcox [@willcoxmnk], Director Big Data Centre of Excellence (Teradata Bringing Intergalactic Data Speak (a.k.a.: SQL) to Hadoop Martin Willcox [@willcoxmnk], Director Big Data Centre of Excellence (Teradata International) 4 th June 2015 Agenda A (very!) short history of

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

Big Data at Spotify. Anders Arpteg, Ph D Analytics Machine Learning, Spotify

Big Data at Spotify. Anders Arpteg, Ph D Analytics Machine Learning, Spotify Big Data at Spotify Anders Arpteg, Ph D Analytics Machine Learning, Spotify Quickly about me Quickly about Spotify What is all the data used for? Quickly about Spark Hadoop MR vs Spark Need for (distributed)

More information

Big Data and Data Science. The globally recognised training program

Big Data and Data Science. The globally recognised training program Big Data and Data Science The globally recognised training program Certificate in Big Data Analytics Duration 5 days Big Data and Data Science enables value creation from data, through the use of calculative

More information

Big Data: Making sense of the numbers

Big Data: Making sense of the numbers EXPERT INSIGHTS Big Data: Making sense of the numbers Big Data is about collecting all the information that comes in from multiple sources, analysing it and using it to inform decision-making. This can

More information

Cross-Sell Ancillary Services Add to the flight, a car, insurance and a night

Cross-Sell Ancillary Services Add to the flight, a car, insurance and a night Cross-Sell Ancillary Services Add to the flight, a car, insurance and a night Amadeus Cross-Sell Ancillary Services is quite simply about making the planning and booking of a journey easier for your customers.

More information

Cloudera Enterprise Data Hub in Telecom:

Cloudera Enterprise Data Hub in Telecom: Cloudera Enterprise Data Hub in Telecom: Three Customer Case Studies Version: 103 Table of Contents Introduction 3 Cloudera Enterprise Data Hub for Telcos 4 Cloudera Enterprise Data Hub in Telecom: Customer

More information

Amadeus Media Solutions. Media Solutions. Set your objectives. & aim at your target

Amadeus Media Solutions. Media Solutions. Set your objectives. & aim at your target Amadeus Media Solutions Media Solutions Set your objectives & aim at your target 2 Media Solutions About Amadeus IT Group Amadeus IT Group is the leading technology partner to the world s travel industry.

More information

Unlocking the True Value of Hadoop with Open Data Science

Unlocking the True Value of Hadoop with Open Data Science Unlocking the True Value of Hadoop with Open Data Science Kristopher Overholt Solution Architect Big Data Tech 2016 MinneAnalytics June 7, 2016 Overview Overview of Open Data Science Python and the Big

More information

Accelerate your Big Data Strategy. Execute faster with Capgemini and Cloudera s Enterprise Data Hub Accelerator

Accelerate your Big Data Strategy. Execute faster with Capgemini and Cloudera s Enterprise Data Hub Accelerator Accelerate your Big Data Strategy Execute faster with Capgemini and Cloudera s Enterprise Data Hub Accelerator Enterprise Data Hub Accelerator enables you to get started rapidly and cost-effectively with

More information

The Top 10 7 Hadoop Patterns and Anti-patterns. Alex Holmes @

The Top 10 7 Hadoop Patterns and Anti-patterns. Alex Holmes @ The Top 10 7 Hadoop Patterns and Anti-patterns Alex Holmes @ whoami Alex Holmes Software engineer Working on distributed systems for many years Hadoop since 2008 @grep_alex grepalex.com what s hadoop...

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically

More information

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations Beyond Lambda - how to get from logical to physical Artur Borycki, Director International Technology & Innovations Simplification & Efficiency Teradata believe in the principles of self-service, automation

More information

NOT IN KANSAS ANY MORE

NOT IN KANSAS ANY MORE NOT IN KANSAS ANY MORE How we moved into Big Data Dan Taylor - JDSU Dan Taylor Dan Taylor: An Engineering Manager, Software Developer, data enthusiast and advocate of all things Agile. I m currently lucky

More information

Big Data Success Step 1: Get the Technology Right

Big Data Success Step 1: Get the Technology Right Big Data Success Step 1: Get the Technology Right TOM MATIJEVIC Director, Business Development ANDY MCNALIS Director, Data Management & Integration MetaScale is a subsidiary of Sears Holdings Corporation

More information

Big Data Use Case: Business Analytics

Big Data Use Case: Business Analytics Big Data Use Case: Business Analytics Starting point A telecommunications company wants to allude to the topic of Big Data. The established Big Data working group has access to the data stock of the enterprise

More information

The digital future and dealing with disruption

The digital future and dealing with disruption The digital future and dealing with disruption Dr Giles Nelson, Senior Vice President of Product Marketing and Strategy May 2015 1 BIG CHANGE due to digitization 2 2 billion internet users worldwide 40%

More information

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning

More information

How Companies are! Using Spark

How Companies are! Using Spark How Companies are! Using Spark And where the Edge in Big Data will be Matei Zaharia History Decreasing storage costs have led to an explosion of big data Commodity cluster software, like Hadoop, has made

More information

HiBench Introduction. Carson Wang ([email protected]) Software & Services Group

HiBench Introduction. Carson Wang (carson.wang@intel.com) Software & Services Group HiBench Introduction Carson Wang ([email protected]) Agenda Background Workloads Configurations Benchmark Report Tuning Guide Background WHY Why we need big data benchmarking systems? WHAT What is

More information

Cloud Big Data Architectures

Cloud Big Data Architectures Cloud Big Data Architectures Lynn Langit QCon Sao Paulo, Brazil 2016 About this Workshop Real-world Cloud Scenarios w/aws, Azure and GCP 1. Big Data Solution Types 2. Data Pipelines 3. ETL and Visualization

More information

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer Ericsson is a world-leading provider of telecommunications equipment & services to mobile & fixed network operators. Over 1,000 networks in more than 180 countries use Ericsson equipment, & more than 40

More information

Go with the flow: Transportation information management solutions from IBM.

Go with the flow: Transportation information management solutions from IBM. IBM Government Industry Solution Brief Smarter Transportation Go with the flow: Transportation information management solutions from IBM. Highlights: Extend the life of your transportation assets, optimize

More information

CRITEO INTERNSHIP PROGRAM 2015/2016

CRITEO INTERNSHIP PROGRAM 2015/2016 CRITEO INTERNSHIP PROGRAM 2015/2016 A. List of topics PLATFORM Topic 1: Build an API and a web interface on top of it to manage the back-end of our third party demand component. Challenge(s): Working with

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful

More information

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84 Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics

More information

Data Pipeline with Kafka

Data Pipeline with Kafka Data Pipeline with Kafka Peerapat Asoktummarungsri AGODA Senior Software Engineer Agoda.com Contributor Thai Java User Group (THJUG.com) Contributor Agile66 AGENDA Big Data & Data Pipeline Kafka Introduction

More information

Information-Driven Transformation in Retail with the Enterprise Data Hub Accelerator

Information-Driven Transformation in Retail with the Enterprise Data Hub Accelerator Introduction Enterprise Data Hub Accelerator Retail Sector Use Cases Capabilities Information-Driven Transformation in Retail with the Enterprise Data Hub Accelerator Introduction Enterprise Data Hub Accelerator

More information

TECHNOLOGY TRANSFER PRESENTS INTERNATIONAL. Rome, December 3-4 2015 Residenza di Ripetta Via di Ripetta, 231 CONFERENCE BIG DATA

TECHNOLOGY TRANSFER PRESENTS INTERNATIONAL. Rome, December 3-4 2015 Residenza di Ripetta Via di Ripetta, 231 CONFERENCE BIG DATA TECHNOLOGY TRANSFER PRESENTS Rome, December 3-4 2015 Residenza di Ripetta Via di Ripetta, 231 INTERNATIONAL CONFERENCE 2 0 1 5 BIG DATA A B O U T T H E S U M M I T The last twelve months can only be described

More information

Hyperion is Business Performance Management

Hyperion is Business Performance Management Hyperion is Business Performance Management Hyperion is a $0.5 Billion business performance management software company serving more than 6,000 customers worldwide. Annual Revenues ($ millions) EPS (diluted)

More information

Real Time Data Processing using Spark Streaming

Real Time Data Processing using Spark Streaming Real Time Data Processing using Spark Streaming Hari Shreedharan, Software Engineer @ Cloudera Committer/PMC Member, Apache Flume Committer, Apache Sqoop Contributor, Apache Spark Author, Using Flume (O

More information

Tap into Hadoop and Other No SQL Sources

Tap into Hadoop and Other No SQL Sources Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data

More information

Business Travel in Asia Pacific: Setting the Smart Course. The Essential Asia Pacific Corporate Travel Handbook

Business Travel in Asia Pacific: Setting the Smart Course. The Essential Asia Pacific Corporate Travel Handbook Business Travel in Asia Pacific: Setting the Smart Course The Essential Asia Pacific Corporate Travel Handbook 3 Table of Contents Foreword Executive summary All roads lead to Asia Pacific Technology

More information

Driving Growth in Insurance With a Big Data Architecture

Driving Growth in Insurance With a Big Data Architecture Driving Growth in Insurance With a Big Data Architecture The SAS and Cloudera Advantage Version: 103 Table of Contents Overview 3 Current Data Challenges for Insurers 3 Unlocking the Power of Big Data

More information

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015 Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document

More information

3 Reasons Enterprises Struggle with Storm & Spark Streaming and Adopt DataTorrent RTS

3 Reasons Enterprises Struggle with Storm & Spark Streaming and Adopt DataTorrent RTS . 3 Reasons Enterprises Struggle with Storm & Spark Streaming and Adopt DataTorrent RTS Deliver fast actionable business insights for data scientists, rapid application creation for developers and enterprise-grade

More information

In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet

In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet Ema Iancuta [email protected] Radu Chilom [email protected] Buzzwords Berlin - 2015 Big data analytics / machine

More information

Enabling the Modern Business through IT. How Cloud ERP Can Help Meet Rapidly Evolving Business Needs

Enabling the Modern Business through IT. How Cloud ERP Can Help Meet Rapidly Evolving Business Needs Enabling the Modern Business through IT How Cloud ERP Can Help Meet Rapidly Evolving Business Needs As the pace of innovation accelerates, customer needs change and industries converge, many companies

More information

Introduction to Apache Kafka And Real-Time ETL. for Oracle DBAs and Data Analysts

Introduction to Apache Kafka And Real-Time ETL. for Oracle DBAs and Data Analysts Introduction to Apache Kafka And Real-Time ETL for Oracle DBAs and Data Analysts 1 About Myself Gwen Shapira System Architect @Confluent Committer @ Apache Kafka, Apache Sqoop Author of Hadoop Application

More information

Detecting Anomalous Behavior with the Business Data Lake. Reference Architecture and Enterprise Approaches.

Detecting Anomalous Behavior with the Business Data Lake. Reference Architecture and Enterprise Approaches. Detecting Anomalous Behavior with the Business Data Lake Reference Architecture and Enterprise Approaches. 2 Detecting Anomalous Behavior with the Business Data Lake Pivotal the way we see it Reference

More information

Travelport. Product Type(s) Contact Details. Company Information. Product Information. Air Reservations. Car Reservations. Global Distribution System

Travelport. Product Type(s) Contact Details. Company Information. Product Information. Air Reservations. Car Reservations. Global Distribution System Travelport Product Type(s) Global Distribution System Contact Details Travelport GDS, Axis Park 10 Hurricane Way Langley, Berkshire SL3 8AG United Kingdom Tel: offices worldwide see website Web: www.travelport.com

More information

Testing 3Vs (Volume, Variety and Velocity) of Big Data

Testing 3Vs (Volume, Variety and Velocity) of Big Data Testing 3Vs (Volume, Variety and Velocity) of Big Data 1 A lot happens in the Digital World in 60 seconds 2 What is Big Data Big Data refers to data sets whose size is beyond the ability of commonly used

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment

More information

CAPTURING & PROCESSING REAL-TIME DATA ON AWS

CAPTURING & PROCESSING REAL-TIME DATA ON AWS CAPTURING & PROCESSING REAL-TIME DATA ON AWS @ 2015 Amazon.com, Inc. and Its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent

More information

How to Use Boards for Competitive Intelligence

How to Use Boards for Competitive Intelligence How to Use Boards for Competitive Intelligence Boards are highly customized, interactive dashboards that ubervu via Hootsuite users can personalize to fit a specific task, job function or use case like

More information

Qantas cross-sell Case study

Qantas cross-sell Case study 1 Amadeus Qantas cross-sell case study Qantas cross-sell Case study Qantas cross-sells its way to new revenues and happy customers. A success story on the benefits of leveraging third party ancillary services

More information

[Hadoop, Storm and Couchbase: Faster Big Data]

[Hadoop, Storm and Couchbase: Faster Big Data] [Hadoop, Storm and Couchbase: Faster Big Data] With over 8,500 clients, LivePerson is the global leader in intelligent online customer engagement. With an increasing amount of agent/customer engagements,

More information

The Truth about Finding the Lowest Airfare

The Truth about Finding the Lowest Airfare The Truth about Finding the Lowest Airfare My employees tell me they can find lower airfares on the Internet 2011 Hurley Travel Experts, Inc. Reproduction Prohibited. This paper includes data that shall

More information

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, [email protected]

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Apache Hadoop in the Enterprise Dr. Amr Awadallah, CTO/Founder @awadallah, [email protected] Cloudera The Leader in Big Data Management Powered by Apache Hadoop The Leading Open Source Distribution of Apache

More information

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Volker Markl [email protected] dima.tu-berlin.de dfki.de/web/research/iam/ bbdc.berlin Based on my 2014 Vision Paper On

More information

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy Presented by: Jeffrey Zhang and Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop?

More information

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld Tapping into Hadoop and NoSQL Data Sources in MicroStrategy Presented by: Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop? Customer Case

More information

Big Data Analytics! Architectures, Algorithms and Applications! Part #3: Analytics Platform

Big Data Analytics! Architectures, Algorithms and Applications! Part #3: Analytics Platform Big Data Analytics! Architectures, Algorithms and Applications! Part #3: Analytics Platform Simon Wu! HTC (Prior: Twitter & Microsoft)! Edward Chang 張 智 威 HTC (prior: Google & U. California)! 1/26/2015

More information

Dansk IT Big Data i de største danske banker

Dansk IT Big Data i de største danske banker Dansk IT Big Data i de største danske banker How can we realize the benefits Presentation 7/4-2016 Jens Chr. Ipsen, head of Information Management & Data Warehouse The essence of Danske Bank Vision To

More information

2015 European business travel and expense analysis. Ten key insights into business travel and expense trends and corporate practices

2015 European business travel and expense analysis. Ten key insights into business travel and expense trends and corporate practices European business travel and expense analysis Ten key insights into business travel and expense trends and corporate practices Foreword This white paper reviews the highlights of an independent study conducted

More information

How to Run a Successful Big Data POC in 6 Weeks

How to Run a Successful Big Data POC in 6 Weeks Executive Summary How to Run a Successful Big Data POC in 6 Weeks A Practical Workbook to Deploy Your First Proof of Concept and Avoid Early Failure Executive Summary As big data technologies move into

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

Six top tips for travel managers to create savings in 2015

Six top tips for travel managers to create savings in 2015 Six top tips for travel managers to create savings in 2015 E-Guide 2 Introduction Savings remain a key focal point for Travel Managers in 2015 and through regular reviews and analysis, using management

More information

Amadeus Cars Technology Solutions. Cars. Love cars. & love Amadeus

Amadeus Cars Technology Solutions. Cars. Love cars. & love Amadeus Amadeus Cars Technology Solutions Cars Love cars & love Amadeus Love cars, love Amadeus & seize new opportunities We re passionate about cars and we re passionate about you. Working together, we can build

More information

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/

More information

Apache Flink Next-gen data analysis. Kostas Tzoumas [email protected] @kostas_tzoumas

Apache Flink Next-gen data analysis. Kostas Tzoumas ktzoumas@apache.org @kostas_tzoumas Apache Flink Next-gen data analysis Kostas Tzoumas [email protected] @kostas_tzoumas What is Flink Project undergoing incubation in the Apache Software Foundation Originating from the Stratosphere research

More information

More Data in Less Time

More Data in Less Time More Data in Less Time Leveraging Cloudera CDH as an Operational Data Store Daniel Tydecks, Systems Engineering DACH & CE Goals of an Operational Data Store Load Data Sources Traditional Architecture Operational

More information

www.pwc.com Implementation of Big Data and Analytics Projects with Big Data Discovery and BICS March 2015

www.pwc.com Implementation of Big Data and Analytics Projects with Big Data Discovery and BICS March 2015 www.pwc.com Implementation of Big Data and Analytics Projects with Big Data Discovery and BICS Agenda Big Data Discovery Oracle Business Intelligence Cloud Services (BICS) Use Cases How to start and our

More information

Deploying an Operational Data Store Designed for Big Data

Deploying an Operational Data Store Designed for Big Data Deploying an Operational Data Store Designed for Big Data A fast, secure, and scalable data staging environment with no data volume or variety constraints Sponsored by: Version: 102 Table of Contents Introduction

More information

Increasing Demand Insight and Forecast Accuracy with Demand Sensing and Shaping. Ganesh Wadawadigi, Ph.D. VP, Supply Chain Solutions, SAP

Increasing Demand Insight and Forecast Accuracy with Demand Sensing and Shaping. Ganesh Wadawadigi, Ph.D. VP, Supply Chain Solutions, SAP Increasing Demand Insight and Forecast Accuracy with Demand Sensing and Shaping Ganesh Wadawadigi, Ph.D. VP, Supply Chain Solutions, SAP Legal disclaimer The information in this presentation is confidential

More information

Airline Schedule Development

Airline Schedule Development Airline Schedule Development 16.75J/1.234J Airline Management Dr. Peter Belobaba February 22, 2006 Airline Schedule Development 1. Schedule Development Process Airline supply terminology Sequential approach

More information

BIG DATA TECHNOLOGY. Hadoop Ecosystem

BIG DATA TECHNOLOGY. Hadoop Ecosystem BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big

More information

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management Big Data and New Paradigms in Information Management Vladimir Videnovic Institute for Information Management 2 "I am certainly not an advocate for frequent and untried changes laws and institutions must

More information

Big Data. threat or opportunity? Philip Louis [email protected] CEO Vine Cube UK Singapore India

Big Data. threat or opportunity? Philip Louis philip@vine-cube.com CEO Vine Cube UK Singapore India Big Data threat or opportunity? Philip Louis [email protected] CEO Vine Cube UK Singapore India Depends on your vision and courage and it is coming your way! opportunity response It is not just BIG

More information

P&G. Harnessing the Power of Real-Time Information Helping P&G make better, faster decisions.

P&G. Harnessing the Power of Real-Time Information Helping P&G make better, faster decisions. P&G Harnessing the Power of Real-Time Information Helping P&G make better, faster decisions. Guy Peri 20 September, 2011 Agenda 1 About P&G/GBS 2 The Challenge 3 4 Going Digital Business Intelligence at

More information

The Challenge of the Low-cost Airlines

The Challenge of the Low-cost Airlines The Challenge of the Low-cost Airlines Air Transport Management Seminar Universidade Lusofona Lisbon 7th - 11th January 2008 Dr Keith Mason Director Business Travel Research Centre www.businesstravelresearch.com

More information

Getting Real Real Time Data Integration Patterns and Architectures

Getting Real Real Time Data Integration Patterns and Architectures Getting Real Real Time Data Integration Patterns and Architectures Nelson Petracek Senior Director, Enterprise Technology Architecture Informatica Digital Government Institute s Enterprise Architecture

More information

TDWI: BUSINESS INTELLIGENCE & DATA WAREHOUSING EDUCATION EUROPE

TDWI: BUSINESS INTELLIGENCE & DATA WAREHOUSING EDUCATION EUROPE TDWI: BUSINESS INTELLIGENCE & DATA WAREHOUSING EDUCATION EUROPE TDWI In-Depth Courses 1st Half 2016 In-Depth course: Data Visualization In-Depth course: Big Data In-Depth course: Hadoop CBIP Preparation

More information

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Accelerating Hadoop MapReduce Using an In-Memory Data Grid Accelerating Hadoop MapReduce Using an In-Memory Data Grid By David L. Brinker and William L. Bain, ScaleOut Software, Inc. 2013 ScaleOut Software, Inc. 12/27/2012 H adoop has been widely embraced for

More information

How To Understand The Benefits Of Big Data

How To Understand The Benefits Of Big Data Findings from the research collaboration of IBM Institute for Business Value and Saïd Business School, University of Oxford Analytics: The real-world use of big data How innovative enterprises extract

More information

The Future of Data Management with Hadoop and the Enterprise Data Hub

The Future of Data Management with Hadoop and the Enterprise Data Hub The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees

More information