Big Data & The Cloud
|
|
- Steven Waters
- 8 years ago
- Views:
Transcription
1 DAMA NY CHAPTER PRESENTATION Big Data & The Cloud Extreme Performance Data Warehousing Inside Of The Cloud Robert J. Abate, CBIP, CDMP Solutions Principal, EIM & Analytics Practice EMC CConsulting sut January 19 th,
2 DAMA NY CHAPTER PRESENTATION Big Data & The Cloud AGENDA Background & Definitions The Challenge Architectural t Solutions To Big Data It s A Brave New World Example Case Studies Open Discussion 2
3 Background & Definitions 3
4 Big data will represent a hugely disruptive force during the next five years enabling levels of insight that are currently unachievable through any other means Gartner May
5 We Are Awash In Data In the information age, every organization is in the data business Data is growing exponentially, so are the challenges Complexity is causing insight to be lost Source: IDC Digital Universe White Paper, Sponsored by EMC, May
6 Pictorial Representation Of Information 6
7 Big Data: More Than Just About Volume Consider: Master Data, Fidelity, Complexity, Validity, Perishability, Linking Data Structured Transactional Data: POS transactions, call detail records, credit card transactions, shipping updates, purchase orders, payments, shipments, account transactions Unstructured Data: Web logs, newsfeeds, social media, geolocation, mobile, consumer comments, claims, doctor s notes, clinical studies, images, video, audio Device-generated Data: RFID sensors, smart meters, smart grids, GPS spatial, micro-payments Social Sensor/ locationbased Web traffic Variety Industryspecific Velocity Documents Transactional Data Video Volume Text Complexity Audio Images Smart Grid 7
8 The Typical BI/DW Environment Today 8
9 Big Data s Potential For Actionable Insight Today s Situation Less than 10% of the enterprise s data Rear-view mirror reporting, dashboards d and analysis Weeks, months, or even quarters old Incomplete, inaccurate, and disjointed data Architectures and methods that take 6 to 18 months to exploit Big Data Ramifications Vast majority of available sources and external data Forward looking or Windshield-view i predictions with recommendations Real-time near real-time Correlated, high confidence, governed data Vastly accelerated time to market 9
10 Time Really is Money! Value The Time Value Curve THE TIME VALUE CURVE Dr. Richard Hackathorn, Bolder Technology, Inc., All Rights Reserved. Used with Permission. Business Event Value Lo ost Capture Latency Data Ready For Analysis Analysis Latency Information Delivered Decision Latency Action Time Action Taken Time Data Lifecycle 10
11 Data Is Coming At Us Faster In a recent TDWI survey of 450 CIO s 17% have a real time data warehouse 90% plan on having a real time warehouse 75% will replace to get to a real-time solution REAL TIME IS A RAPIDLY BECOMING A NECESSARY FOUNDATION TO A DATA SOLUTION AND WITHOUT ARCHITECTURE THERE IS CHAOS! 11
12 Data Is Coming From All Directions Data is now commonly entering into the enterprise from external sources Government (Census, Revenues, ) Neilson, NPD Group (Sales) Bloomberg, NYSE (Financial Position) Experian, TransUnion, Equifax (Credit Reporting) Google Maps, MapInfo (Geospatial, ) Radian 6, Biz360, (Client Trend Data) Etc. 12
13 Need For Data Trust Compliance with laws Revenue Canada, Sarbanes Oxley [SOX], BASIL II, HIPAA, etc. Lack of confidence in the data Reports utilizing same data do not report same totals or computations Data not defined d and readily available Multiple sources of data have to be rationalized at each project start-up thereby wasting valuable time & $ on every yproject Data timeliness Manual process to collect, analyze and provide results Data integrityit Unknown filters, varying calculation/computations, fields used for data not indicative of field names, data passed along from one person to another to another to another.. 13
14 Summation Of Challenges We Are Observing Business mandate to obtain more value out of the data (get answers) Variety of sources, amounts, types and granularity of data that customers want to integrate is growing exponentially Need to shrink the latency between the business event and the data availability for analysis and decision-making Advancing agility of information is key Need for Data trust and Compliance with regulations 14
15 The Challenge Of Big Data 15
16 Old Journey To Information Maturity [EIM] Data Chaos Same type of data means different things in different systems Ex: AT&T is the same as AT&T Inc PROCESSES Master Data Publish and Subscribe to master data Ex: Single view of customer across all information Data Discovery Data systems Governance Data Integration Data Mining Data Analytics Analyzing the data. Looking for trends and correlations Data Chaos Defined Data Master Data Integrated Information Data Analytics Business Optimization TOOLS Data Discover Metadata ETL Suite BI / DW / OLAP Defined Data Integrated Predictive Define common meanings. Ex: Determine the sources, types, and properties of grouped (i.e.: customer) records Information Bring metadata together with information for reporting (BI) and warehousing (drilling and hierarchies). Information Using the analyzed data to optimize operations Wiki Type Sharing Of Self- Provisioned Environments Atomic Data Analytics 16
17 The Information Issue Is Too many organizations are not using information to its full advantage: 1 in 3 business leaders frequently make critical decisions without the information they need 1 in 2 business leaders do not have access to the information across their organization needed to do their jobs. 3 in 4 business leaders say more predictive information would drive better decisions Source: IBM Institute for Business Value, March
18 Information Trust & Business Alignment Harris Interactive recently polled 23,000 U.S. employees and found Only 37% said they have a clear understanding of what their organization is trying to achieve and why Only one in five was enthusiastic ti about their team and the organization s / corporation s goals Only one in five said they have a clear line of sight between their tasks and their team and organization s goals Only 15% felt that their organization fully enables them to execute key goals Only 20% fully trusted the organization they work for 18
19 Viewed Using An Seasonal Analogy If a football team had these players on the field: Only 4 of the 11 players on the field would know which goal is theirs Only 6 of the 11 would care Only 3 of the 11 would know what position they play and what they are supposed to do 9 players out of 11 would, in some way, be competing against their own team rather than the opponent 19
20 Perceived Complicated Landscape BI/DW is perceived as not enabling the business Inhibitor to corporate progress IT systems cannot be changed fast enough to meet market demands, seize opportunity or comply with a new requirement. Weak alignment between IT and business strategy Marked by an intractable language barrier. Business not always sure what Information or Dimensions they want or need How can IT provide without requirements? BI/DW is not known as the source of innovations The complexity of systems has caused BI/DW to be reactive rather than proactive Silo d solutions, db s and applications with trapped business rules Multiple sources of information and no single truth No Architectural Blueprints to the enterprise 20
21 The Business Intelligence Maturity Model 21
22 Advancing The Maturity Of Information 22
23 The big data impacts to both business and IT are significant; early adopters will fundamentally change their industries Business Expectations IT Ramifications More agile, more real-time, more accurate decision-making Predict and spot changes in dynamic and volatile markets Deeper understanding of customer preferences and behavior Greater fidelity in risk assessment and compliance enforcement Enhanced user experience that delivers insights to any device Operationalization of data scientists and analytic insights Tools and processes for data quality, governance, and security Cloud for self-service, collaboration, agility, and cost reduction Big data poses a major opportunity for CIOs to drive added value for the business, by deriving insights and identifying patterns from the huge amounts of data available Through 2015, organizations integrating high value, diverse new information sources and types into a coherent information management infrastructure will outperform industry peers financially by more than 20% Source: Gartner "The New Value Integrator," Insights from the Global Chief Financial Officer Study July
24 Architectural Solutions For Big Data 24
25 Big Data Requires Change Consider 100 GB would store the entire US Census DB basic information set for every living human being on the planet: Age, Sex, Income, Ethnicity, Language, Religion, Housing Status, Location into a 128 bit set That equates to about 6.75 millions rows of about 10 collumns Consider the Large Hadron Collinder at CERN Expected to produce 150,000 times as much raw data each year 25
26 The Big Change In Technologies Consider that Relational technologies were invented to get data in and organized, not designed nor organized to get it out RDBMS s were designed for efficient transactions processing on large data sets Adding, Updating Searching for & retrieving small amounts of data [2] Source: ACM Website The Pathologies of Big Data, Adam Jacobs, 7/6/09 26
27 Data Warehouses Were An Answer DW was classically ll designed d as copy of transaction data specifically structured for query and analysis General approach is bulk ETL into a DB designed for queries Big data changes the answer Traditional RDBMS-based dimensional modeling and cube-based based OLAP turns out to be to slow or to limited to support asking the really interesting questions of warehoused data [2] To achieve acceptable performance for highly order-dependent queries on truly large data, one must be willing to consider abandoning the purely relational database model [2] [2] Source: ACM Website The Pathologies of Big Data, Adam Jacobs, 7/6/09 27
28 Voluminous Data Sets What makes large data sets are repeated observations over time/space Web log has M s visits over handful pages Retailer has 10K products, M custs, but B trans Hi-Res Scientific like fmri 1K GB per view Large datasets t Spatial or Temporal dim s Cardinalities (distinct observations) is usually small with regard to total # of observations 28
29 Technology Solutions Appeared 29
30 Lets Talk Technical Solutions Sequential and/or Distributed File-Based Solutions Oracle Exadata, Hadoop, etc. Columnar (compression) )/ Multi-Level Tables Solves challenge of retrieving entire row Par-Excel, Vertica, Sybase, etc. Distributed MPP Teradata, Greenplum, etc. Polymorphic Combination of Columnar & MPP 30
31 Finding Answers Sequentially With OLTP Random access is slower than sequential The advantage gained by doing all data access in sequential order is often 4x 10x Many orders of magnitude! [2] Source: ACM Website The Pathologies of Big Data, Adam Jacobs, 7/6/09 31
32 Distributed File: Partitioning With OLTP Partitioning can solve challenges of data growth, but true distributed processing utilizing MPP is best (author s opinion) 32
33 Distributed File: Partitioning Viewed Q: What was the total transactions (sales) amount for May 20 and May ? Sales Table 5/17 Select sum(sales_amount) From SALES Where sales_date between to_date( 05/20/2009, MM/DD/YYYY ) And to_date( 05/22/2009, MM/DD/YYYY ); Only the 2 relevant partitions are read 5/18 5/19 5/20 5/21 5/22 Source: Extreme Performance With Oracle Data Warehousing 33
34 Distributed File: Open Source (Hadoop) Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license. It enables applications to work with thousands of nodes and petabytes of data Hadoop was inspired by Google's MapReduce and Google File System (GFS) papers. Hadoop is a top-level Apache project being built and used by a global community of contributors using the Java programming language. Yahoo! has been the largest contributor to the project, and uses Hadoop extensively across its businesses. Source: Wikipedia Hadoop 34
35 Distributed File: Hash-Based Distribution In a hash-based data distribution, the data is distributed across multiple platforms for parallelism li of queries 35
36 Columnar: Storage In a table with say 256 columns, a lookup will retrieve all the data in the row (disk bound) Columnar storage reduces this I/O bandwidth by storing column data using compression State (50 combinations stored) Master (compressed) table has pointers to State Source: Vertica Website 36
37 Columnar: Multi-Level Table Partitioning In multi-level table partitioning, data distribution occurs across multiple platforms in segmented tables for distribution of columnar queries This reduces the amount of work performed by each platform 37
38 MPP Shared Nothing Architectures Extreme scalability Elastic Expansion & Self-Healing Fault-Tolerance Unified Analytics Source: Greenplum Database 4.0: Critical Mass Innovation, White Paper, August
39 MPP Shared Nothing Architectures Source: Greenplum Database 4.0: Critical Mass Innovation, White Paper, August
40 The Ideal MPP Shared Nothing Poly-Morphic Storage Tabular, Columnar, NoSQL, etc. 40
41 It s A Brave New World 41
42 From the Old Stack to a New Ecosystem: Drivers for Change Many new data sources (organic growth, data services, M&A) Impractical to add new data sources because of tightly coupled pipeline More unstructured t data, including social media Lack of access to unstructured data; need analytics and classifiers that operate on it Less up front data integration Can t assume data is pre-integrated have to be able to locate and to query federated sources of data and content More need to track and leverage metadata Metadata is fragmented, jailed and inconsistent need agile, community approach Need for flexible, agile data structures Current structures are too rigid, and too close to the sources or the business reports More emphasis son dynamic views for purpose pose Need dynamic planning, creation and structuring of views that support analytics Information governance and management in a federated, regulated world Need flexible policy expression and enforcement, not just at point of access 42
43 An Information Platform with New DNA To Promote Agility, Business Value and Community 1. Coordinated ingestion of diverse information, changes, events 2. Metadata driven processing and management 3. Nuanced optimization on demand, multi-source, matching information needs 4. Broader reach of query contextual search, federation, materialization 5. Freedom from imposed information structure roll your own structure! 6. Navigation through information contextual, faceted, multi-dimensional 7. Visualization of information heat, clouds, clusters, flows 8. New data paths engendered by patterned consumption of entities 9. Reasoning about data set location, derivation, freshness, and obligations 10. User empowerment collaboration and talent development 43
44 Businesses Want Integrated, Timely Information for Purpose Area Latency Enrichment Query Federation Source Revolution Microbatch is the new Batch Tagging is the new Transformation Query is the new ETL Query Director is the new Query Optimizer Purposeful View is the new Master 44
45 Some Of The Newer Trends In Big Data Powerful Analytics What if, What will happen next, Self-service analytics? Build your own sandbox of data a Data Cloud Surrounded Warehouse Data Virtualization Abstracting the data from the systems, it complements existing data warehouses Many times the size of structured warehouse Provides for rapid analytic iterations 45
46 When You Link Structured & Unstructured Information You Get 46
47 Powerful Analytical Engines What is the best price to sell my product? 47
48 How Do I Do This? 48
49 How Do I Do This #2? 49
50 How Do I Do This #3? 50
51 Visualize The Information 51
52 Analytics: A Picture Is Worth A 1,000 Words 52
53 Data Virtualization Example 53
54 Data Virtualization In Practice 54
55 Enterprise Big Data Cloud 55
56 The Future Of Data Warehousing? The Ideal Abate Enterprise Data Cloud Truly Virtualized Data Environment Extreme Scale, Elastic Expansion Automated Metadata Discovery, Classification & Tagging Linearly Scalable Add 1x and get 2x performance Self Service Provisioning Single Point Of Management Resource utilization optimization Secure, Unified Data Access Single Point of Entry Portal based sharing of data sandboxes (wiki-type) Reduce TCO By Eliminating Excessive Licensing Fees Use of open source community to improve solution 56
57 Example Case Studies 57
58 BIG DATA ANALYTICS USE CASE Telecomm eco Provider Learns A Lesson Before investing $M of dollars on infrastructure, a provider learned where to invest their monies that would payoff Challenge 100TB Traditional EDW, Single Source Of Truth Operational Reporting & Financial Consolidation Heavy Governance And Control Unable To Support Critical Business Initiatives Customer Loyalty And Churn The #1 Business Initiative From The CEO Enterprise Data Cloud Architecture-Based Solution Extracted Data From EDW & Other Sources Generated Social Graph From Call Detail And Subscriber Data Within 2 Weeks Found Connected Subscribers 7X More Likely To Churn Than Average Users Now Deploying 1PB Production 58
59 BIG DATA ANALYTICS USE CASE Drive Multi-channel Campaign Optimization Retailer increases in-flight multi-channel effectiveness with customer and product insights HIGH Likeliho ood Of Conversi ion Legacy System Advanced Analytics Monitor crosschannel product sales effectiveness Big Data Analytics Integrate t customer behavioral data with social media sentiment data to yield new market, product and campaign insights LOW 59
60 BIG DATA ANALYTICS USE CASE Innovate With Big Data Analytics Big Data Analytics Accelerate Health Care 2.0 for Evidence-based Care Provider HIGH Legacy System BI Reporting Advanced Analytics Quality of Care Delivering 10 Years Of Data In Seconds Associative Rule Mining and User Clustering Improves Pathways Big Data Analytics External Data Sources Enable Personalized Medicine LOW Treatment Pathways ays on Summary Data Treatment Pathways ays on All the Data TRADITIONAL DATA LEVERAGED BIG DATA LEVERAGED 60
61 Open Exchange Of Ideas Speaker Contact Information: Robert J. Abate, CBIP, CDMP (201)
62 Credits To Quoted Authors Adam Jacobs is senior software engineer at 1010data Inc., where, among other roles, he leads the continuing development of Tenbase, the company s ultra-high-performance analytical database engine. He has more than 10 years of experience with distributed processing of big datasets, starting in his earlier career as a computational neuroscientist at Weill Medical College of Cornell University (where he holds the position of Visiting Fellow) and at UCLA. He holds a Ph.D. in neuroscience from UC Berkeley and a B.A. in linguistics from Columbia University. (QUOTED FROM: The Pathologies of Big Data, 7/6/09) Bill Schmarzo has over two decades of experience in data warehousing, BI and analytic applications (Metaphor Computers, 1984). Bill authored the Business Benefits Analysis methodology that links an organization s strategic business initiatives with their supporting data and analytic requirements, and coauthored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data Warehouse Institute faculty as the head of the analytic applications i curriculum. Bill was VP of Analytics at Yahoo where he was responsible for the development of Yahoo s Advertiser and Web Site analytics products, including the delivery of actionable insights through a holistic user experience. For Business Objects, Bill oversaw the Analytic Applications business unit including the development, marketing and sales of Business Objects industry-leading analytic applications. Donald Sutton has over 20 years experience in Data Architecture, Analysis, Modeling, ETL, Implementation and Integration in the areas of Data Entry (OLTP) or ERP and 3rd Party COTS Applications, Operational Data Store (ODS), Master Data Store (MDS), Data Warehouse (DW) and Data Marts (DM) while providing Business Intelligence (BI) from multiple sources above. Passionate and motivated about sound design of data structures in all different data layers and the representation and transformation ti of data with the accounting and governance of data throughout h t all data layers while Providing Business Intelligence (BI) and analytics with Key Performance Indicators (KPI) along with business modeling in translating business requirements to data requirements. (QUOTED FROM: Current Warehousing Environment & Analytics Visualizations) 62
63
Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.
Big Data Analytics 1 Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their
More informationArchitecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
More informationAdvanced In-Database Analytics
Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationGetting Started Practical Input For Your Roadmap
Getting Started Practical Input For Your Roadmap Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson
More informationThe Enterprise Data Hub and The Modern Information Architecture
The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader
More informationThe Lab and The Factory
The Lab and The Factory Architecting for Big Data Management April Reeve DAMA Wisconsin March 11 2014 1 A good speech should be like a woman's skirt: long enough to cover the subject and short enough to
More informationUNIFY YOUR (BIG) DATA
UNIFY YOUR (BIG) DATA ANALYTIC STRATEGY GIVE ANY USER ANY ANALYTIC ON ANY DATA Scott Gnau President, Teradata Labs scott.gnau@teradata.com t Unify Your (Big) Data Analytic Strategy Technology excitement:
More informationAn Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise
An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our
More informationBig Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum
Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All
More informationVIEWPOINT. High Performance Analytics. Industry Context and Trends
VIEWPOINT High Performance Analytics Industry Context and Trends In the digital age of social media and connected devices, enterprises have a plethora of data that they can mine, to discover hidden correlations
More informationBig Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
More informationBig Data and Your Data Warehouse Philip Russom
Big Data and Your Data Warehouse Philip Russom TDWI Research Director for Data Management April 5, 2012 Sponsor Speakers Philip Russom Research Director, Data Management, TDWI Peter Jeffcock Director,
More informationApache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com
Apache Hadoop in the Enterprise Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Cloudera The Leader in Big Data Management Powered by Apache Hadoop The Leading Open Source Distribution of Apache
More informationEMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT
EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT Leveraging analytics for actionable insight ESSENTIALS Put your Big Data to work for you Pick the best-fit, priority business opportunity and
More informationData Virtualization A Potential Antidote for Big Data Growing Pains
perspective Data Virtualization A Potential Antidote for Big Data Growing Pains Atul Shrivastava Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and
More information5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
More informationAligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
More informationSAP Real-time Data Platform. April 2013
SAP Real-time Data Platform April 2013 Agenda Introduction SAP Real Time Data Platform Overview SAP Sybase ASE SAP Sybase IQ SAP EIM Questions and Answers 2012 SAP AG. All rights reserved. 2 Introduction
More informationEMC/Greenplum Driving the Future of Data Warehousing and Analytics
EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,
More informationBig Data Are You Ready? Jorge Plascencia Solution Architect Manager
Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data: The Datafication Of Everything Thoughts Devices Processes Thoughts Things Processes Run the Business Organize data to do something
More informationDatenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
More informationIntroducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
More informationIntegrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April 9 2013
Integrating Hadoop Into Business Intelligence & Data Warehousing Philip Russom TDWI Research Director for Data Management, April 9 2013 TDWI would like to thank the following companies for sponsoring the
More informationThe Business Analyst s Guide to Hadoop
White Paper The Business Analyst s Guide to Hadoop Get Ready, Get Set, and Go: A Three-Step Guide to Implementing Hadoop-based Analytics By Alteryx and Hortonworks (T)here is considerable evidence that
More informationAchieving Business Value through Big Data Analytics Philip Russom
Achieving Business Value through Big Data Analytics Philip Russom TDWI Research Director for Data Management October 3, 2012 Sponsor 2 Speakers Philip Russom Research Director, Data Management, TDWI Brian
More informationNEWLY EMERGING BEST PRACTICES FOR BIG DATA
2000-2012 Kimball Group. All rights reserved. Page 1 NEWLY EMERGING BEST PRACTICES FOR BIG DATA Ralph Kimball Informatica October 2012 Ralph Kimball Big is Being Monetized Big data is the second era of
More informationBIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
More informationOracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya
Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now
More informationBig Data Analytics Platform @ Nokia
Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform
More informationRamesh Bhashyam Teradata Fellow Teradata Corporation bhashyam.ramesh@teradata.com
Challenges of Handling Big Data Ramesh Bhashyam Teradata Fellow Teradata Corporation bhashyam.ramesh@teradata.com Trend Too much information is a storage issue, certainly, but too much information is also
More informationThe Future of Data Management with Hadoop and the Enterprise Data Hub
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees
More informationANALYTICS BUILT FOR INTERNET OF THINGS
ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that
More informationManaging Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
More informationW H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
More informationThis Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
More informationTap into Hadoop and Other No SQL Sources
Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data
More informationHDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
More informationBig Data Success Step 1: Get the Technology Right
Big Data Success Step 1: Get the Technology Right TOM MATIJEVIC Director, Business Development ANDY MCNALIS Director, Data Management & Integration MetaScale is a subsidiary of Sears Holdings Corporation
More informationDAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY
Big Data Analytics DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Tom Haughey InfoModel, LLC 868 Woodfield Road Franklin Lakes, NJ 07417 201 755 3350 tom.haughey@infomodelusa.com
More informationA TECHNICAL WHITE PAPER ATTUNITY VISIBILITY
A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY Analytics for Enterprise Data Warehouse Management and Optimization Executive Summary Successful enterprise data management is an important initiative for growing
More informationHow to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW
How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden
More informationBig Data Patterns. Ron Bodkin Founder and President, Think Big
Big Data Patterns Ron Bodkin Founder and President, Think Big 1 About Me Ron Bodkin Founder and President, Think Big I have 9 years experience working with Big Data and Hadoop. In 2010, I founded Think
More informationEnd to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ
End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,
More informationHP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica
HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica So What s the market s definition of Big Data? Datasets whose volume, velocity, variety
More informationSpeeding ETL Processing in Data Warehouses White Paper
Speeding ETL Processing in Data Warehouses White Paper 020607dmxwpADM High-Performance Aggregations and Joins for Faster Data Warehouse Processing Data Processing Challenges... 1 Joins and Aggregates are
More informationExploiting Data at Rest and Data in Motion with a Big Data Platform
Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags
More informationTraditional BI vs. Business Data Lake A comparison
Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses
More informationArchitecting your Business for Big Data Your Bridge to a Modern Information Architecture
Architecting your Business for Big Data Your Bridge to a Modern Information Architecture Robert Stackowiak Vice President, Information Architecture & Big Data Oracle Safe Harbor Statement The following
More informationTrends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum
Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms
More informationTiber Solutions. Understanding the Current & Future Landscape of BI and Data Storage. Jim Hadley
Tiber Solutions Understanding the Current & Future Landscape of BI and Data Storage Jim Hadley Tiber Solutions Founded in 2005 to provide Business Intelligence / Data Warehousing / Big Data thought leadership
More informationIl mondo dei DB Cambia : Tecnologie e opportunita`
Il mondo dei DB Cambia : Tecnologie e opportunita` Giorgio Raico Pre-Sales Consultant Hewlett-Packard Italiana 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject
More informationThe Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
More informationThe Potential of Big Data in the Cloud. Juan Madera Technology Consultant juan.madera.jimenez@accenture.com
The Potential of Big Data in the Cloud Juan Madera Technology Consultant juan.madera.jimenez@accenture.com Agenda How to apply Big Data & Analytics What is it? Definitions, Technology and Data Science
More informationTRANSFORM BIG DATA INTO ACTIONABLE INFORMATION
TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION Make Big Available for Everyone Syed Rasheed Solution Marketing Manager January 29 th, 2014 Agenda Demystifying Big Challenges Getting Bigger Red Hat Big
More informationBig Data at Cloud Scale
Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For
More informationQLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment
More informationBig Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.
Big Data Technology ดร.ช ชาต หฤไชยะศ กด Choochart Haruechaiyasak, Ph.D. Speech and Audio Technology Laboratory (SPT) National Electronics and Computer Technology Center (NECTEC) National Science and Technology
More informationBig Data Technologies Compared June 2014
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
More informationCollaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.
Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!
More informationBig Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA
Big Data: What You Should Know Mark Child Research Manager - Software IDC CEMA Agenda Market Dynamics Defining Big Data Technology Trends Information and Intelligence Market Realities Future Applications
More informationEvolution to Revolution: Big Data 2.0
Evolution to Revolution: Big Data 2.0 An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for Actian March 2014 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents
More informationTHE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS
THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS WHITE PAPER Successfully writing Fast Data applications to manage data generated from mobile, smart devices and social interactions, and the
More informationTapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru
Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy Presented by: Jeffrey Zhang and Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop?
More informationIntegrating a Big Data Platform into Government:
Integrating a Big Data Platform into Government: Drive Better Decisions for Policy and Program Outcomes John Haddad, Senior Director Product Marketing, Informatica Digital Government Institute s Government
More informationCisco IT Hadoop Journey
Cisco IT Hadoop Journey Srini Desikan, Program Manager IT 2015 MapR Technologies 1 Agenda Hadoop Platform Timeline Key Decisions / Lessons Learnt Data Lake Hadoop s place in IT Data Platforms Use Cases
More informationNews and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
More informationHortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved
Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment
More informationWell packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances
INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA
More informationMaster Data Management and Data Warehousing. Zahra Mansoori
Master Data Management and Data Warehousing Zahra Mansoori 1 1. Preference 2 IT landscape growth IT landscapes have grown into complex arrays of different systems, applications, and technologies over the
More informationIII JORNADAS DE DATA MINING
III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE
More informationBig Data Challenges and Success Factors. Deloitte Analytics Your data, inside out
Big Data Challenges and Success Factors Deloitte Analytics Your data, inside out Big Data refers to the set of problems and subsequent technologies developed to solve them that are hard or expensive to
More informationApplied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA
Applied Business Intelligence Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Agenda Business Drivers and Perspectives Technology & Analytical Applications Trends Challenges
More informationModern Data Warehouse
1 Modern Data Warehouse Are you ready for Big Data? Does your DWH / BI roadmap contain all the necessary components? IDG: Big data technologies describe a new generation of technologies and architectures,
More informationBig Data and Trusted Information
Dr. Oliver Adamczak Big Data and Trusted Information CAS Single Point of Truth 7. Mai 2012 The Hype Big Data: The next frontier for innovation, competition and productivity McKinsey Global Institute 2012
More informationExtend your analytic capabilities with SAP Predictive Analysis
September 9 11, 2013 Anaheim, California Extend your analytic capabilities with SAP Predictive Analysis Charles Gadalla Learning Points Advanced analytics strategy at SAP Simplifying predictive analytics
More informationHow the oil and gas industry can gain value from Big Data?
How the oil and gas industry can gain value from Big Data? Arild Kristensen Nordic Sales Manager, Big Data Analytics arild.kristensen@no.ibm.com, tlf. +4790532591 April 25, 2013 2013 IBM Corporation Dilbert
More informationExecutive Summary... 2 Introduction... 3. Defining Big Data... 3. The Importance of Big Data... 4 Building a Big Data Platform...
Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5 Infrastructure Requirements... 5 Solution Spectrum... 6 Oracle s Big Data
More informationMike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.
Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,
More informationUSING BIG DATA FOR INTELLIGENT BUSINESSES
HENRI COANDA AIR FORCE ACADEMY ROMANIA INTERNATIONAL CONFERENCE of SCIENTIFIC PAPER AFASES 2015 Brasov, 28-30 May 2015 GENERAL M.R. STEFANIK ARMED FORCES ACADEMY SLOVAK REPUBLIC USING BIG DATA FOR INTELLIGENT
More informationHow to Leverage Big Data in the Cloud to Gain Competitive Advantage
How to Leverage Big Data in the Cloud to Gain Competitive Advantage James Kobielus, IBM Big Data Evangelist Editor-in-Chief, IBM Data Magazine Senior Program Director, Product Marketing, Big Data Analytics
More informationOracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics. An Oracle White Paper October 2013
An Oracle White Paper October 2013 Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics Introduction: The value of analytics is so widely recognized today that all mid
More informationA Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities
A Whole New World Big Data Technologies Big Discovery Big Insights Endless Possibilities Dr. Phil Shelley Query Execution Time Why Big Data Technology? Days EDW Hours Hadoop Minutes Presto Seconds Milliseconds
More informationHadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard
Hadoop and Relational base The Best of Both Worlds for Analytics Greg Battas Hewlett Packard The Evolution of Analytics Mainframe EDW Proprietary MPP Unix SMP MPP Appliance Hadoop? Questions Is Hadoop
More informationData Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here
Data Virtualization for Agile Business Intelligence Systems and Virtual MDM To View This Presentation as a Video Click Here Agenda Data Virtualization New Capabilities New Challenges in Data Integration
More informationTop Ten Data Management Trends
Top Ten Data Management Trends September, 2013 Raj Gill Founder and President, Scalability Experts Executive Summary The amount of data that companies need to manage is doubling every couple of years and
More informationNavigating Big Data business analytics
mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what
More informationIndustry Impact of Big Data in the Cloud: An IBM Perspective
Industry Impact of Big Data in the Cloud: An IBM Perspective Inhi Cho Suh IBM Software Group, Information Management Vice President, Product Management and Strategy email: inhicho@us.ibm.com twitter: @inhicho
More informationBIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
More informationUsing Tableau Software with Hortonworks Data Platform
Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data
More informationUsing Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM
Using Big Data for Smarter Decision Making Colin White, BI Research July 2011 Sponsored by IBM USING BIG DATA FOR SMARTER DECISION MAKING To increase competitiveness, 83% of CIOs have visionary plans that
More informationBIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data
More informationBIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.
More informationGanzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
More informationTransforming the Telecoms Business using Big Data and Analytics
Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe
More informationDeploying Big Data to the Cloud: Roadmap for Success
Deploying Big Data to the Cloud: Roadmap for Success James Kobielus Chair, CSCC Big Data in the Cloud Working Group IBM Big Data Evangelist. IBM Data Magazine, Editor-in- Chief. IBM Senior Program Director,
More informationBIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization
BIG DATA STRATEGY Rama Kattunga Chair at American institute of Big Data Professionals Building Big Data Strategy For Your Organization In this session What is Big Data? Prepare your organization Building
More informationBIG DATA TECHNOLOGY. Hadoop Ecosystem
BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big
More informationBig Data and Its Impact on the Data Warehousing Architecture
Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research
More information