A Perfect Storm. Oracle Big Data Science for Enterprise R and SAS Users. Marcos Arancibia, Consulting Product Manager marcos.arancibia@oracle.
|
|
- Catherine Caldwell
- 8 years ago
- Views:
Transcription
1 A Perfect Storm Oracle Big Data Science for Enterprise R and SAS Users Mark Hornick, Director, Advanced Analytics Marcos Arancibia, Consulting Product Manager marcos.arancibia@oracle.com
2 Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle s products remains at the sole discretion of Oracle. 2
3 Perfect Storm: something that describes an actual phenomenon that happens to occur in such a confluence, resulting in an event of unusual magnitude - Wikipedia 3
4 Big Data Cloud Masters in Data Science 4
5 Events of unusual magnitude? Massive migrations A changing of the guard 5
6 Agenda What is R? Who is using R and why? Overview of Oracle R Technologies Global customer tour Demonstration 6
7 What is R? R is an Open Source scripting language and environment for statistical computing and graphics Started in 1994 as an Alternative to SAS, SPSS and other proprietary Statistical Environments The R environment R is an integrated suite of software facilities for data manipulation, calculation and graphical display Millions of R users worldwide Widely taught in Universities Many Corporate Analysts and Data Scientists know and use R Thousands of open sources packages to enhance productivity such as: Bioinformatics Spatial Statistics Financial Market Analysis Social Network Analysis
8 Why statisticians, data analysts, data scientists use R R environment is.. Powerful Extensible Graphical Extensive statistics OOTB functionality with many knobs but smart defaults Ease of installation and use Free R is a statistics language similar to Base SAS or SPSS statistics
9 R s Popularity Number of Google Scholar hits SPSS SAS R SPSS has a clear lead, but you can see that its dominance peaked in 2007 and its use is now in sharp decline. SAS never came close to SPSS level of dominance, and it peaked in Robert A. Muenchen
10 R s Popularity Number of Google Scholar Documents SAS and SPSS removed R the use of R is experiencing very rapid growth and is pulling away from the pack, solidifying its position in third place. Robert A. Muenchen
11 R s Popularity Job Trends R vs. SPSS R vs. SAS
12 KDNuggets Poll: Languages used for Analytics / Data Mining 12
13 Customer Pain Points with Advanced Analytics for example It takes too long to get my data or to get the right data I can t analyze or mine all of my data it has to be sampled Putting models and results into production is ad hoc and complex Recoding models into SQL, C, or Java takes time and is error prone Our company is concerned about data security, backup and recovery We need to build 10s of thousands of models fast to meet business objectives 13
14 Oracle R Technologies 14
15 Oracle R Distribution Ability to dynamically load Intel Math Kernel Library AMD Core Math Library Solaris Sun Performance Library Oracle Support An Oracle-Supported Redistribution of Open Source R Enhanced linear algebra performance via dynamically loaded libraries Improve scalability at client and database for embedded R execution Enterprise support for customers of Oracle Advanced Analytics option, Big Data Appliance, and Oracle Linux Free download Oracle contributes bug fixes and enhancements to open source R
16 ROracle Oracle Database ROracle R package enabling scalable and performant connectivity to Oracle Database Open source, publicly available on CRAN Oracle is maintainer Oracle Database Interface (DBI) for R Re-implemented and optimized driver based on OCI Execute SQL statements from R interface Enables transactional behavior for insert, update, and delete
17 Oracle R Enterprise Oracle Advanced Analytics Option to Oracle Database Eliminate memory constraint of client R engine Minimize or eliminate data movement latency Leverage Oracle Database as HPC environment Execute R scripts through database server machine for scalability and performance Leverage parallel, distributed in-database data mining algorithms Execute and manage R scripts via SQL Operationalize R scripts in production applications eliminate porting R code Avoid reinventing code to integrate R results into existing applications Client R Engine ORE packages Oracle Database User tables In-db stats SQL Interfaces SQL*Plus, SQLDeveloper, Database Server Machine 17
18 Database-centric architecture Smart meter scenario Oracle Database Data c1 c2 ci cn R Datastore R Script Repository f(dat,args, ) f(dat,args, ) f(dat,args, ) f(dat,args, ) 200K Households f(dat,args, ) { R Script build model Model c1 Model c2 Model ci Model cn }
19 Database-centric architecture Smart meter scenario Oracle Database Data c1 c2 ci cn R Datastore Model R Script Repository f(dat,args, ) f(dat,args, ) f(dat,args, ) f(dat,args, ) 200K Households f(dat,args, ) { } R Script score data scores c1 scores c2 scores ci scores cn
20 Build models and store in database, partition on CUST_ID ore.groupapply (CUST_USAGE_DATA, 14 lines CUST_USAGE_DATA$CUST_ID, function(dat, ds.name) { cust_id <- dat$cust_id[1] mod <- lm(consumption ~. -CUST_ID, dat) mod$effects <- mod$residuals <- mod$fitted.values <- NULL name <- paste("mod", cust_id,sep="") assign(name, mod) ds.name1 <- paste(ds.name,".",cust_id,sep="") ore.save(list=paste("mod",cust_id,sep=""), name=ds.name1, overwrite=true) TRUE }, ds.name="mydatastore", ore.connect=true, parallel=true ) 20
21 Score customers in database, partition on CUST_ID ore.groupapply(cust_usage_data_new, CUST_USAGE_DATA_NEW$CUST_ID, 16 lines function(dat, ds.name) { cust_id <- dat$cust_id[1] ds.name1 <- paste(ds.name,".",cust_id,sep="") ore.load(ds.name1) name <- paste("mod", cust_id,sep="") mod <- get(name) prd <- predict(mod, newdata=dat) prd[as.integer(rownames(prd))] <- prd res <- cbind(cust_id=cust_id, PRED = prd) data.frame(res) }, ds.name="mydatastore", ore.connect=true, parallel=true, FUN.VALUE=data.frame(CUST_ID=numeric(0), PRED=numeric(0)) ) 21
22 Execution (sec) Performance with DOP= Models Data: 26,280,000 rows Total build time: 65.2 seconds Total scoring time: 25.7 seconds (all data) 50,000 Models Data: 1,314,000,000 rows Total build time: minutes Total scoring time: 18 minutes (all data) 10,000 Models Data: 262,800,000 rows Total build time: 516 seconds Total scoring time: 217 seconds (all data) 1 Model/Customer # rows (millions) Build Time Score Time 22
23 Seconds (log scale) Oracle Advanced Analytics on Exadata X3-2 ¼ Rack Scalability of the new distributed ore.lm() Linear Regression Model Building using 30 numeric variables: Leading Vendor on a machine connected directly to the same Exadata box took 2+ hours for ETL+Exec on 34mi records 7, minutes Leading Vendor-34mi OAA-34mi OAA-180mi OAA-299mi OAA-2.99Bi Engine/Database size (records)
24 Oracle Advanced Analytics Option Fastest Way to Deliver Scalable Enterprise-wide Predictive Analytics Better Decisions with Deeper Insights & Predictive Analytics Understand and predict customer behavior for churn, fraud, cross-sell, and many other business problems Easy to Use Data analysts: Mining work flow GUI (part of SQL Developer) Data scientists: R and SQL languages supported DBA: SQL integration Comprehensive Analytics on a Simple Architecture Performance and scalability of the Oracle Database Lowest Total Cost of Ownership No need for separate analytical servers Components Oracle R Enterprise Oracle Data Mining
25 HCache Hadoop Abstraction Layer Oracle R Advanced Analytics for Hadoop Oracle Big Data Connectors option to Big Data Appliance ORD R Client R script {CRAN packages} Hadoop Job Mapper Reducer R HDFS R MapReduce R Hive R sqoop/olh Hadoop Cluster MapReduce Nodes {CRAN packages} HDFS Nodes ORD Transparent access to Hadoop Cluster from R Manipulate data in HDFS, Hive, database, and file system Write and execute MapReduce jobs with R Leverage CRAN R packages to work on HDFS-resident data Prepackaged parallel, distributed algorithms Oracle Database
26 Oracle Big Data Platform Oracle Big Data Appliance Optimized for Hadoop, R, and NoSQL Processing Oracle Big Data Connectors Oracle Exadata System of Record Optimized for DW/OLTP Oracle Exalytics Optimized for Analytics & In-Memory Workloads Hadoop Oracle R Distribution Oracle NoSQL Database Applications Oracle R Oracle Big Data Advanced Analytics Connectors for Hadoop + Oracle Data Integrator Oracle R Oracle Oracle Enterprise Advanced Advanced Analytics Analytics Oracle Data Mining Data Warehouse Oracle Database Oracle R Distribution Oracle Enterprise Performance Management Oracle Business Intelligence Applications Oracle Business Intelligence Tools Oracle Endeca Information Discovery Stream Acquire Organize Discover & Analyze
27 A Global Customer Tour 27
28 Quick Houston Facts: Most populous city in Texas Metropolitan area is the fifth-most populated in the U.S., with over 6 million people Leading in energy, manufacturing, aeronautics, transportation, health care sectors and building oilfield equipment Only New York City is home to more Fortune 500 Headquarters. Panoramic Houston skyline Oracle R Enterprise at Apache Oil: Segmentation of drilling problems to understand potential problems ahead of time Predictive maintenance of assets to prevent waiting a day for replacement of drill bits or other components helps optimize Revenues The space shuttle Challenger atop its Boeing 747 SCA, flying over Johnson Space Center,
29 Quick Guatemala Facts: 15.8 mi inhabitants Guatemala City is the Capital Spanish spoken by 93% of Population 21 Mayan and 2 Amerindian languages also spoken Service sector is largest component of GDP at 63%, followed by industry sector at 23.8% and agriculture sector at 13.2% (2010 est.) Mayan City of Tikal Oracle R Enterprise at TIGO: Customer Behavior of 5.5M customers with 1.8B transactions Generate 5 models per customer to understand mobility using Lat/Long of the Cell Tower of each transaction Evaluate 27.5M segmentation models in 25 minutes, or over 1M models/minute Guatemala City Today 29
30 Quick Cincinnati Facts: First major American city founded after American Revolution First major inland purely American city in country Cincinnati Reds have a storied history as being first professional club, hosting first night game, and dominating 1970s as the "Big Red Machine" American Sign Museum Oracle Advanced Analytics at dunnhumby: Very long ETL time eliminated with in-database Advanced Analytics Modeling behavior of millions of shoppers Coupon optimization for Retailers on Billions of transactions Insights and exploration on all data vs. samples Average Project time from 19 days to 3 days Music Hall Cincinnati Reds: Great American Ballpark
31 Quick Orlando Facts: Nicknamed "The City Beautiful" Symbol is the fountain at Lake Eola Theme Park Capital of the World 51+ million tourists a year, 3.6 million international Walt Disney World Resort: Magic Kingdom, Hollywood Studios, Epcot, Animal Kingdom Universal Studios Orlando SeaWorld Oracle R Enterprise at Olive Garden: Olive Garden, traditionally managing its 830 restaurants nationally, transitioned to a localized approach with the help of predictive analytics Evaluated 115 million transactions in just 5 percent the time required by previous BI tool Supporting Olive Garden s latest remodel campaign, continuing to uncover millions in profits by optimizing pricing and menu assortment TODAY, 2:00PM! Moscone South 308 CON2898
32 Quick Lima, Peru Facts: Capital and the largest city of Peru with 9M citizens Most populous metropolitan area of Peru Fifth largest city in the Americas (as defined by "city proper") Home to one of the oldest higher learning institutions in the New World National University of San Marcos, founded on May 12, 1551 Oracle R Enterprise at Financiera Uno: Reduce time to build credit scoring models to ensure their market relevancy Scale to handle big data volumes Rapidly deploy credit scoring models into production applications
33 Quick London Facts: One of world's leading financial centers Has fifth-or sixth-largest metropolitan area GDP in the world depending on measurement World cultural capital World's most-visited city as measured by international arrivals World's largest city airport system measured by passenger traffic Oracle R Enterprise at Major Financial Company: Earnings calculations reduced from 7 hours to 4 minutes Scoring on written premium reduced from 100 minutes to 7 minutes Scoring on earned premium reduced from 25 minutes to 8 minutes with added functionality Load time improved by factor of 15X 33
34 Quick Geneva Facts: Most populous city of Romandy, the Frenchspeaking part of Switzerland A financial center Worldwide center for diplomacy Headquarters of many of the agencies of the United Nations and the Red Cross Hosts highest number of international organizations in the world Oracle R Enterprise at CERN: Real time monitoring and anomaly detection of tens of thousands of events per second CERN Central Logging Service: complex in-database time series analysis and forecasting Electrical Consumption forecast for LHC and experiments 34
35 Quick Croatia Facts: Member of European Union (EU) and United Nations (UN) Tourism is a significant source of revenue during the summer Ranked 18th most popular tourist destination in the world Oracle R Enterprise at ZABA Bank: Historical Customer Behavior Analysis shortened from several months to 2 weeks Specialized Variable Clustering algorithm running in parallel to replace leading vendor solution Faster model development resulted in better model quality and increasing bottom line
36 Quick Korea Facts: Roughly half of the country's 50 million people reside in the metropolitan area surrounding its capital, Seoul Seoul Capital Area is the second largest in the world with over 25 million residents Eighth largest country in international trade A regional power with world's 10th largest defense budget Oracle R Enterprise at BISTEL: Oracle ORE enables BISTEL to perform analytics with much more data faster and enables them to gain more insight (root cause and prediction) With Oracle Exadata BISTEL can do enterprise advanced process control in Mega/Giga fabs in high-tech manufacturing
37 37
38 Video 39
39 Demonstration 40
40 compare 41
41 See these Advanced Analytics Talks at OOW 14 CON Developing Relevant Dining Visits with Oracle Advanced Analytics at Olive Garden CON Extending the Power of In-Database Analytics with Oracle Big Data Appliance CON Predictive Analytics with Oracle Data Mining CON Market Basket Analysis at Dunkin Brands CON Big Data and Predictive Analytics: Fiserv Data Mining Case Study 42
42 Learn More about Oracle s R Technologies Join us at our booth Moscone South SLD-115 Oracle Demogrounds 43
43 Oracle Confidential Internal/Restricted/Highly Restricted 44
44
Massive Predictive Modeling using Oracle R Technologies Mark Hornick, Director, Oracle Advanced Analytics
Massive Predictive Modeling using Oracle R Technologies Mark Hornick, Director, Oracle Advanced Analytics Safe Harbor Statement The following is intended to outline our general product direction. It is
More informationBig Data Analytics Scaling R to Enterprise Data user! 2013 Albacete Spain #user2013
Big Analytics Scaling R to Enterprise user! 2013 Albacete Spain #user2013 Luis Campos Mark Hornick 1 Big Solutions Lead, Oracle EMEA Director, Oracle base Advanced Analytics @luigicampos @MarkHornick 2
More informationOracle Big Data Handbook
ORACLG Oracle Press Oracle Big Data Handbook Tom Plunkett Brian Macdonald Bruce Nelson Helen Sun Khader Mohiuddin Debra L. Harding David Segleau Gokula Mishra Mark F. Hornick Robert Stackowiak Keith Laker
More informationStarting Smart with Oracle Advanced Analytics
Starting Smart with Oracle Advanced Analytics Great Lakes Oracle Conference Tim Vlamis Thursday, May 19, 2016 Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed
More informationOracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
More informationBig Data Are You Ready? Thomas Kyte http://asktom.oracle.com
Big Data Are You Ready? Thomas Kyte http://asktom.oracle.com The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
More informationIntroducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
More informationExecutive Summary... 2 Introduction... 3. Defining Big Data... 3. The Importance of Big Data... 4 Building a Big Data Platform...
Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5 Infrastructure Requirements... 5 Solution Spectrum... 6 Oracle s Big Data
More informationOracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>
s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline
More informationAdvanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
More informationHadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
More informationBig Data Use Cases Update
Big Data Use Cases Update Sanat Joshi Industry Solutions Manufacturing Industries Business Unit 1 Data Explosion Web & social networks experienced it first Infographic by Go-gulf.com 2 Number Of Connected
More informationAn Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
More informationExtend your analytic capabilities with SAP Predictive Analysis
September 9 11, 2013 Anaheim, California Extend your analytic capabilities with SAP Predictive Analysis Charles Gadalla Learning Points Advanced analytics strategy at SAP Simplifying predictive analytics
More informationOracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya
Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now
More informationTUT NoSQL Seminar (Oracle) Big Data
Timo Raitalaakso +358 40 848 0148 rafu@solita.fi TUT NoSQL Seminar (Oracle) Big Data 11.12.2012 Timo Raitalaakso MSc 2000 Work: Solita since 2001 Senior Database Specialist Oracle ACE 2012 Blog: http://rafudb.blogspot.com
More informationOracle Big Data Strategy Simplified Infrastrcuture
Big Data Oracle Big Data Strategy Simplified Infrastrcuture Selim Burduroğlu Global Innovation Evangelist & Architect Education & Research Industry Business Unit Oracle Confidential Internal/Restricted/Highly
More informationSafe Harbor Statement
Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment
More informationAn Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture
An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP ESG Data Systems Architecture Big Data & Analytics as a Service Components Unstructured Data / Sparse Data of Value
More informationBig Data Are You Ready? Jorge Plascencia Solution Architect Manager
Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data: The Datafication Of Everything Thoughts Devices Processes Thoughts Things Processes Run the Business Organize data to do something
More informationAn Oracle White Paper October 2011. Oracle: Big Data for the Enterprise
An Oracle White Paper October 2011 Oracle: Big Data for the Enterprise Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationOracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features
Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features Charlie Berger, MS Eng, MBA Sr. Director Product Management, Data Mining and Advanced Analytics charlie.berger@oracle.com www.twitter.com/charliedatamine
More informationOracle Big Data Building A Big Data Management System
Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following
More informationAGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
More informationAn Oracle White Paper June 2013. Oracle: Big Data for the Enterprise
An Oracle White Paper June 2013 Oracle: Big Data for the Enterprise Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5 Infrastructure
More informationESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
More informationOracle Database 12c Plug In. Switch On. Get SMART.
Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.
More informationSAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics
SAP Brief SAP HANA Objectives Transform Your Future with Better Business Insight Using Predictive Analytics Dealing with the new reality Dealing with the new reality Organizations like yours can identify
More informationANALYTICS CENTER LEARNING PROGRAM
Overview of Curriculum ANALYTICS CENTER LEARNING PROGRAM The following courses are offered by Analytics Center as part of its learning program: Course Duration Prerequisites 1- Math and Theory 101 - Fundamentals
More informationOracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining
Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining R The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
More informationIn-Memory Analytics for Big Data
In-Memory Analytics for Big Data Game-changing technology for faster, better insights WHITE PAPER SAS White Paper Table of Contents Introduction: A New Breed of Analytics... 1 SAS In-Memory Overview...
More informationData Analysis with Various Oracle Business Intelligence and Analytic Tools
Data Analysis with Various Oracle Business Intelligence and Analytic Tools Session ID: 108680 Prepared by: Tim and Dan Vlamis Vlamis Software Solutions www.vlamis.com @TimVlamis Agenda What we will talk
More informationSAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SAP @cgadalla SESSION CODE: 603
SAP Predictive Analytics: An Overview and Roadmap Charles Gadalla, SAP @cgadalla SESSION CODE: 603 Advanced Analytics SAP Vision Embed Smart Agile Analytics into Decision Processes to Deliver Business
More informationOracle Advanced Analytics - Option to Oracle Database: Oracle R Enterprise and Oracle Data Mining. Data Warehouse Global Leaders Winter 2013
Oracle Advanced Analytics - Option to Oracle Database: Oracle R Enterprise and Oracle Data Mining Data Warehouse Global Leaders Winter 2013 Dan Vlamis, Vlamis Software Solutions Tim Vlamis, Vlamis Software
More informationAn Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise
An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our
More informationAn Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics
An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,
More informationOracle Big Data Essentials
Oracle University Contact Us: Local: 1800 103 4775 Intl: +91 80 40291196 Oracle Big Data Essentials Duration: 3 Days What you will learn This Oracle Big Data Essentials training deep dives into using the
More informationImplement Hadoop jobs to extract business value from large and varied data sets
Hadoop Development for Big Data Solutions: Hands-On You Will Learn How To: Implement Hadoop jobs to extract business value from large and varied data sets Write, customize and deploy MapReduce jobs to
More informationMySQL and Hadoop: Big Data Integration. Shubhangi Garg & Neha Kumari MySQL Engineering
MySQL and Hadoop: Big Data Integration Shubhangi Garg & Neha Kumari MySQL Engineering 1Copyright 2013, Oracle and/or its affiliates. All rights reserved. Agenda Design rationale Implementation Installation
More informationAn Oracle White Paper September 2014. Oracle: Big Data for the Enterprise
An Oracle White Paper September 2014 Oracle: Big Data for the Enterprise Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform...
More informationArchitectures for Big Data Analytics A database perspective
Architectures for Big Data Analytics A database perspective Fernando Velez Director of Product Management Enterprise Information Management, SAP June 2013 Outline Big Data Analytics Requirements Spectrum
More informationBIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
More informationUp Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata
Up Your R Game James Taylor, Decision Management Solutions Bill Franks, Teradata Today s Speakers James Taylor Bill Franks CEO Chief Analytics Officer Decision Management Solutions Teradata 7/28/14 3 Polling
More informationI/O Considerations in Big Data Analytics
Library of Congress I/O Considerations in Big Data Analytics 26 September 2011 Marshall Presser Federal Field CTO EMC, Data Computing Division 1 Paradigms in Big Data Structured (relational) data Very
More informationDisrupt or be disrupted IT Driving Business Transformation
Disrupt or be disrupted IT Driving Business Transformation Gokula Mishra VP, Big Data & Advanced Analytics Business Analytics Product Group Copyright 2014 Oracle and/or its affiliates. All rights reserved.
More informationBig Data at Cloud Scale
Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For
More informationCost-Effective Business Intelligence with Red Hat and Open Source
Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,
More informationPreview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.
Preview of Oracle Database 12c In-Memory Option 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any
More informationTax Fraud in Increasing
Preventing Fraud with Through Analytics Satya Bhamidipati Data Scientist Business Analytics Product Group Copyright 2014 Oracle and/or its affiliates. All rights reserved. 2 Tax Fraud in Increasing 27%
More informationNews and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
More informationSession 1: Introduction to Oracle's R Technologies
Session 1: Introduction to Oracle's R Technologies Mark Hornick, Director, Oracle Advanced Analytics Development Oracle Advanced Analytics Topics What is R? Oracle R Enterprise motivation
More informationFrom Spark to Ignition:
From Spark to Ignition: Fueling Your Business on Real-Time Analytics Eric Frenkiel, MemSQL CEO June 29, 2015 San Francisco, CA What s in Store For This Presentation? 1. MemSQL: A real-time database for
More informationBig Data: Are You Ready? Kevin Lancaster
Big Data: Are You Ready? Kevin Lancaster Director, Engineered Systems Oracle Europe, Middle East & Africa 1 A Data Explosion... Traditional Data Sources Billing engines Custom developed New, Non-Traditional
More informationData processing goes big
Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,
More informationNew Clinical Research & Care Opportunities Through Big Data Informatics
New Clinical Research & Care Opportunities Through Big Data Informatics Gregory A. Jones Chief Technology Officer Health Sciences Global Business Unit September 2014 Safe Harbor Statement The following
More informationUNIFY YOUR (BIG) DATA
UNIFY YOUR (BIG) DATA ANALYTIC STRATEGY GIVE ANY USER ANY ANALYTIC ON ANY DATA Scott Gnau President, Teradata Labs scott.gnau@teradata.com t Unify Your (Big) Data Analytic Strategy Technology excitement:
More informationOracle Big Data Discovery Unlock Potential in Big Data Reservoir
Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014 Copyright 2014, Oracle and/or its affiliates. All
More informationSafe Harbor Statement
Defining a Roadmap to Big Data Success Robert Stackowiak, Oracle Vice President, Big Data 17 November 2015 Safe Harbor Statement The following is intended to outline our general product direction. It is
More informationIBM Netezza High Capacity Appliance
IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data
More informationHigh-Performance Analytics
High-Performance Analytics David Pope January 2012 Principal Solutions Architect High Performance Analytics Practice Saturday, April 21, 2012 Agenda Who Is SAS / SAS Technology Evolution Current Trends
More informationEMC Greenplum Driving the Future of Data Warehousing and Analytics. Tools and Technologies for Big Data
EMC Greenplum Driving the Future of Data Warehousing and Analytics Tools and Technologies for Big Data Steven Hillion V.P. Analytics EMC Data Computing Division 1 Big Data Size: The Volume Of Data Continues
More informationWHAT S NEW IN SAS 9.4
WHAT S NEW IN SAS 9.4 PLATFORM, HPA & SAS GRID COMPUTING MICHAEL GODDARD CHIEF ARCHITECT SAS INSTITUTE, NEW ZEALAND SAS 9.4 WHAT S NEW IN THE PLATFORM Platform update SAS Grid Computing update Hadoop support
More informationGetting Started with Oracle Data Miner 11g R2. Brendan Tierney
Getting Started with Oracle Data Miner 11g R2 Brendan Tierney Scene Setting This is not about DB log mining This is an introduction to ODM And how ODM can be included in OBIEE (next presentation) Domain
More informationBig Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014
Big Data Analytics An Introduction Oliver Fuchsberger University of Paderborn 2014 Table of Contents I. Introduction & Motivation What is Big Data Analytics? Why is it so important? II. Techniques & Solutions
More informationConstructing a Data Lake: Hadoop and Oracle Database United!
Constructing a Data Lake: Hadoop and Oracle Database United! Sharon Sophia Stephen Big Data PreSales Consultant February 21, 2015 Safe Harbor The following is intended to outline our general product direction.
More informationCustomized Report- Big Data
GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.
More informationArchitecting your Business for Big Data Your Bridge to a Modern Information Architecture
Architecting your Business for Big Data Your Bridge to a Modern Information Architecture Robert Stackowiak Vice President, Information Architecture & Big Data Oracle Safe Harbor Statement The following
More informationwhitepaper Predictive Analytics with TIBCO Spotfire and TIBCO Enterprise Runtime for R
Predictive Analytics with TIBCO Spotfire and TIBCO Enterprise Runtime for R Table of Contents 3 Predictive Analytics with TIBCO Spotfire 4 TIBCO Spotfire Statistics Services 8 TIBCO Enterprise Runtime
More informationHow To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
More informationWHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics
WHITE PAPER Harnessing the Power of Advanced How an appliance approach simplifies the use of advanced analytics Introduction The Netezza TwinFin i-class advanced analytics appliance pushes the limits of
More informationPredictive Analytics: Turn Information into Insights
Predictive Analytics: Turn Information into Insights Pallav Nuwal Business Manager; Predictive Analytics, India-South Asia pallav.nuwal@in.ibm.com +91.9820330224 Agenda IBM Predictive Analytics portfolio
More informationGanzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
More informationBuilding and Deploying Customer Behavior Models
Building and Deploying Customer Behavior Models February 20, 2014 David Smith, VP Marketing and Community, Revolution Analytics Paul Maiste, President and CEO, Lityx In Today s Webinar About Revolution
More informationSunnie Chung. Cleveland State University
Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:
More informationHow to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW
How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden
More informationBig Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum
Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All
More informationOracle BI Roadmap & Visual Analyzer Ljiljana Perica, Oracle Business Solution Leader Ljiljana.perica@oracle.com
Oracle BI Roadmap & Visual Analyzer Ljiljana Perica, Oracle Business Solution Leader Ljiljana.perica@oracle.com Copyright 2015, Oracle and/or its affiliates. All rights reserved. 1 Safe Harbor Statement
More informationConnecting Hadoop with Oracle Database
Connecting Hadoop with Oracle Database Sharon Stephen Senior Curriculum Developer Server Technologies Curriculum The following is intended to outline our general product direction.
More informationMike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.
Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,
More informationHigh Performance Data Management Use of Standards in Commercial Product Development
v2 High Performance Data Management Use of Standards in Commercial Product Development Jay Hollingsworth: Director Oil & Gas Business Unit Standards Leadership Council Forum 28 June 2012 1 The following
More informationPerformance and Scalability Overview
Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics Platform. Contents Pentaho Scalability and
More informationHadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics
In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning
More informationUsing OBIEE for Location-Aware Predictive Analytics
Using OBIEE for Location-Aware Predictive Analytics Jean Ihm, Principal Product Manager, Oracle Spatial and Graph Jayant Sharma, Director, Product Management, Oracle Spatial and Graph, MapViewer Oracle
More informationArchitecting for the Internet of Things & Big Data
Architecting for the Internet of Things & Big Data Robert Stackowiak, Oracle North America, VP Information Architecture & Big Data September 29, 2014 Safe Harbor Statement The following is intended to
More informationUsing Data Mining and Machine Learning in Retail
Using Data Mining and Machine Learning in Retail Omeid Seide Senior Manager, Big Data Solutions Sears Holdings Bharat Prasad Big Data Solution Architect Sears Holdings Over a Century of Innovation A Fortune
More informationHadoop & SAS Data Loader for Hadoop
Turning Data into Value Hadoop & SAS Data Loader for Hadoop Sebastiaan Schaap Frederik Vandenberghe Agenda What s Hadoop SAS Data management: Traditional In-Database In-Memory The Hadoop analytics lifecycle
More informationHow Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization
More informationUsing MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com
Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A
More informationBIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
More informationThis Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
More informationThe Future of Data Management with Hadoop and the Enterprise Data Hub
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees
More informationNative Connectivity to Big Data Sources in MSTR 10
Native Connectivity to Big Data Sources in MSTR 10 Bring All Relevant Data to Decision Makers Support for More Big Data Sources Optimized Access to Your Entire Big Data Ecosystem as If It Were a Single
More informationHarnessing the power of advanced analytics with IBM Netezza
IBM Software Information Management White Paper Harnessing the power of advanced analytics with IBM Netezza How an appliance approach simplifies the use of advanced analytics Harnessing the power of advanced
More informationTap into Hadoop and Other No SQL Sources
Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data
More informationWhite Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics
White Paper Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics Contents Self-service data discovery and interactive predictive analytics... 1 What does
More informationHIGH PERFORMANCE ANALYTICS FOR TERADATA
F HIGH PERFORMANCE ANALYTICS FOR TERADATA F F BORN AND BRED IN FINANCIAL SERVICES AND HEALTHCARE. DECADES OF EXPERIENCE IN PARALLEL PROGRAMMING AND ANALYTICS. FOCUSED ON MAKING DATA SCIENCE HIGHLY PERFORMING
More informationForecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
More informationDeveloping Relevant Dining Visits with Oracle Advanced Analytics Olive Garden s transition toward tailoring guests experiences
Developing Relevant Dining Visits with Oracle Advanced Analytics Olive Garden s transition toward tailoring guests experiences Matt Fritz Senior Data Scientist Business Challenge Darden comprises several
More informationIn-Database Analytics
Embedding Analytics in Decision Management Systems In-database analytics offer a powerful tool for embedding advanced analytics in a critical component of IT infrastructure. James Taylor CEO CONTENTS Introducing
More information