Big Data Defined Introducing DataStack 3.0
|
|
- Anna Cobb
- 8 years ago
- Views:
Transcription
1 Big Data Big Data Defined Introducing DataStack 3.0 Inside: Executive Summary... 1 Introduction... 2 Emergence of DataStack DataStack 1.0 to DataStack 2.0 Refined for Large Data & Analytics... 4 DataStack 3.0: Confluence of New Technologies & DataStack Conclusion... 7 Executive Summary The technology of data warehouse and data mining has evolved over the years. Increasingly businesses are implementing these technologies to get new insights into their business. However, businesses are also dealing with a deluge of data, structured and unstructured, from various sources customers, partners, employees. The technology community has coined the term Big Data to describe this new era of data and data management. However, there is a lot of confusion around exactly what Big Data is and more importantly, how enterprises should think about Big Data within their organizations. Enterprises can think about Big Data as the third data stack in the organization. We refer to this as DataStack 3.0. DataStack 3.0, gives enterprises a framework to think about how Big Data fits into their data infrastructure architecture. This paper defines DataStack 3.0 in the context of historical trends in data and enterprises existing data infrastructure. This paper aims to help CIOs, MIS managers and functional heads of businesses take advantage of DataStack 3.0 in their organizations Persistent Systems Ltd. All rights reserved. 1
2 Introduction The technology of data warehousing and data mining has evolved over the years. Increasingly, businesses are implementing these technologies to get new insights into their business. However, businesses are also dealing with a deluge of data, structured and unstructured, from various sources customers, partners, employees. Unless this stream of data is tapped in a meaningful way, chances are that the insights companies are looking for will remain elusive. The technology community has coined the term Big Data to describe this new era of data and data management. However, there is a lot of confusion around exactly what Big Data is and more importantly, how enterprises should think about Big Data within their organizations. How do we take advantage of Big Data in the context of our current data infrastructure? Do we have to throw out existing data infrastructure to take advantage of Big Data? Does Big Data only apply to unstructured data or does it refer to structure data as well? Persistent has introduced the term DataStack 3.0 as a way to help enterprises think about how Big Data fits into their organization s data infrastructure. This paper defines DataStack 3.0 in the context of historical trends in data and an enterprise s existing data infrastructure. This paper aims to help CIOs, MIS managers and functional heads take advantage of Big Data within their organizations. DataStack New technology stack capable of handling Big Data in terms of volume, data source and type of data (structured, semistructured and unstructured) With the evolution of technologies that gather and analyze data, there are three distinct inflexion points that are discernible. We have tried to capture these technologies and the data in three different generations of what we term as the Data Stacks. The characteristics of the three data stacks differ in what type of data is collected and how it is processed. We introduce these terms below that will be elaborated later in the paper. DataStack 1.0 DataStack 2.0 DataStack 3.0 Application specific data typically used to store and record transactional data of a specific application Aggregation of application data, creating dimensional views and analytics on structured data New technology stack capable of handling Big Data both in terms of volume, data source and type of data (structured, semi-structured and unstructured). The majority of this paper will focus on DataStack 3.0. We elaborate on DataStack 1.0 and DataStack 2.0 separately on page 4 to give context to DataStack Persistent Systems Ltd. All rights reserved. 2
3 Emergence of DataStack 3.0 As data warehouses matured there was a mind-set change emerging with internet companies such as Google, Yahoo and Facebook. With tens of millions of users and more than a billion page views every day, these companies started collecting massive amounts of data without knowing what they were looking for. This led to a fundamental shift in thinking and the need for DataStack 3.0, which gives businesses the ability to leverage the data that is generated without knowing precisely how one plans to use it. One of the challenges that companies have faced is developing a scalable way of storing and processing all these bytes. Companies understand that using this historical data is a large part of how they can improve the user experience. The emergence of Big Data frameworks such as NoSql, Hadoop and related technologies, which provide a framework for large scale parallel processing using a distributed file system and the map-reduce programming paradigm, enabled developers to start interesting projects that were previously impossible due to their massive computational requirements. Some of these early projects have matured into publicly released features (i.e. the Facebook Lexicon) or are being used in the background to improve user experience on Facebook, Yahoo and Google. The list of projects that are using Big Data infrastructure has proliferated - from those generating mundane statistics about site usage, to others being used to fight spam and determine application quality. 80% of new information growth is unstructured content with 90% of that unmanaged. IDC One other trend that emerged in parallel is that a number of non-relational or NoSql databases such as Cassandra offer the ability to process vast amounts of data in a clustered environment. Traditional SQL servers allow users to create a schema and define the structure of that schema which often has very rigid rules. NoSql databases attempt to provide flexibility and scalability for web-scale data, new applications such as social apps and frequent schema changes due to changing data. The key motivators that are driving the adoption of DataStack 3.0 are: Shared Nothing Architecture to increase parallelism Reducing Infrastructure Cost by using commodity hardware Linear Scalability ability to add incremental capacity to storage systems with minimal overhead and no downtime. In some cases, the system should automatically balance load and adjust utilization across the new hardware High Write Throughput most of the applications store (and optionally index) tremendous amounts of data and require high aggregate write throughput High Availability & Disaster Recovery Need to provide a service with very high uptime to users that cover both planned and unplanned events such as software upgrades and unplanned failure of hardware Unstructured & Structured Data While DataStack 1.0 and DataStack 2.0 looked mostly at structured tabular data, DataStack 3.0 considers unstructured data as well. In fact, most of the explosion in data has been in unstructured data such as text files, documents, weblogs, , etc Persistent Systems Ltd. All rights reserved. 3
4 DataStack 1.0 to 2.0 Businesses have harnessed enterprise software such as Another trend that happened side by side was the change Online Transaction Processing (OLTP) systems through in the nature of data itself. Initially data was application the last two decades to make their businesses more driven and largely structured. We characterize this as efficient. From pure OLTP systems, the software has also DataStack 1.0. With the following explosion of data, a new evolved to add capabilities such as Data Warehouses to set of applications, which looked at aggregated data, help businesses get key insights into their customers and came into the fold. These applications allowed the meaningful reports on their data. OLTP systems have their enterprises to aggregate data, form dimensional views origin in the eighties and most enterprises such as banks and conduct analytics. This is what we call DataStack 2.0. used them for key transactions. With the emergence of Businesses driven by demand for more sophisticated UNIX and Windows server systems in the nineties, vendors queries on ever increasing data sets pushed the such as Oracle, IBM, and Microsoft introduced databases envelope of DataStack 1.0. As the workload increased, it for enterprises. As OLTP systems proliferated, SQL was evident that techniques had to evolve to tackle this became the standard method of implementation of OLTP workload. This led to DataStack 2.0, which addressed systems. SQL also enabled businesses to query some of the scalability and volume of data issues. transactional data to produce operational reports. For example, the transactional system to implement sales transactions could also produce reports on the sales by geographies or product codes. DataStack 2.0 Refined for Large Data & Analytics In the last few years data warehouses continued to refine. Softwares such as Cognos, Business Objects and Hyperion appeared in the market which enabled businesses to slice and dice their data, form dimensional views and produce dashboards of key metrics for their businesses. For example a telecommunications company could load their summarized call data records onto a data warehouse through a process known as ETL (Extract, Transform & Load) and use tools such as Cognos to view KPIs such as: 1) calls by region 2) calls by time of day and 3) voice vs. data calls. This data enabled the telecommunication companies to focus their marketing efforts, refine promotions and adjust pricing plans. non-sql algorithms to be easily embedded in the processing elements of its MPP streams without the typical intricacies of parallel or grid programming. The ability to run analytics of any complexity on stream against huge data volumes, eliminates the delays and costs of moving data to separate hardware. It also accelerates performance by orders of magnitude, making these data warehouse appliances the ideal platform for the convergence of data warehousing and advanced analytics. This evolution in data warehouse platforms and applications, which we term as DataStack 2.5, was driven mainly by the inadequacy of DataStack 2.0 to handle large volumes of data for analytics. Another phenomenon that has been taking place in data warehouse solutions is the emergence of customized hardware/software marketed as appliances such Oracle Exadata and IBM Netezza. These data warehouse appliance hardware components and intelligent system software are closely intertwined. The software is designed to fully exploit the hardware capabilities of the appliance and incorporate numerous innovations to offer exponential performance gains, whether for simple inquiries, complex ad-hoc queries or deep analytics. Systems such as Exadata and Netezza, which have brought the principles of MPP and data processing close to the source, are suited for advanced analytics on large data sets. These data warehouse appliances allow complex Some of the new technologies which became the foundation of Data Stack 2.5 are: Massively Parallel Processing (MPP) architectures enabled parallel execution of queries and eased the load on the processors by shifting some of the load to the storage path Column oriented Databases DBMS that stores its content by column rather than row. This has advantages in certain data warehouses, where aggregates are computed over large numbers of similar data items In memory Databases speeding up queries through caching of the database in memory 2012 Persistent Systems Ltd. All rights reserved. 4
5 A Comparison of DataStack 1.0, 2.0 and 3.0 DataStack 1.0 DataStack 2.0 DataStack 3.0 Relational Database Systems for Operational Store Enterprise Data Warehouse for Decision Support Integrated Platform for structured, semi-structured & unstructured data from any source Business Case/Need Record business events Support for Decision making Tap all data sources for insights Data Arrangement Highly normalized data Un-normalized dimensional model Schema less approach Data Horizon Short Couple of years Multi-year Data Quality Extremely high Not as essential as DataStack 1.0 Not a key requirement Size of Data GBs TBs PBs End user Access Through enterprise apps Through reporting systems Currently directly Language of Access SQL SQL/MDX MapReduce Type of Data Structured Structured Structured, semi-structured & unstructured 2012 Persistent Systems Ltd. All rights reserved. 5
6 DataStack 3.0: Confluence of New Technologies & DataStack 2.0 As businesses explore Big Data, they need to keep in mind that DataStack 3.0 is a confluence of new technologies for Big Data such as Cassandra, Hbase, Hadoop, MapReduce with the existing DataStack 2.0. The implementation of DataStack 3.0 has to integrate with existing DataStack 2.0. If DataStack 3.0 is not a complimentary extension to the existing data infrastructure, benefits will be nebulous. From the planning phases for DataStack 3.0, integration should be mapped out. When an organization is planning to rollout DataStack 3.0, various stakeholders across the organization should be involved to make it a successful. The various functional groups that need to participate are: IT Administration Plan the deployment of and configure DataStack 3.0. This needs to happen with participation of resources skilled in DataStack 3.0 deployment and planning Compliance & Audit departments Just like data for key transactional applications, companies should consider the compliance needs for the data collected, privacy requirements and the enforcement of those requirements ETL Administration Consider all data sources necessary for DataStack 3.0 Data Provider Identifies the data sources and is responsible for providing access to the technology team Data Architect Expert in modeling database schemas, knowledgeable in database implementation best practices and familiar with the company s particular database schema Data Scientist Acts as the bridge between the business and the technical team, helping to define the business insights that need to be extracted Reporting/Dashboards Developer Expert in development of reports and dashboards with one of the reporting/dashboarding tools such as IBM Cognos, Pentaho, MSTR, etc. If DataStack 3.0 is not a complimentary extension to the existing data infrastructure, benefits will be nebulous. Early in its maturity, DataStack 3.0 is primarily used for discovering use cases, and it is not uncommon for the results from the discovery exercise to be pushed back to DataStack 2.0 so as to enhance the analysis. All of the above personnel have to work closely to plan and execute how DataStack 3.0 will be leveraged to get the desired end results. They need to review the technology available, capabilities of resources (and service providers, if applicable) and bring together the right team. This figure below describes how the three data stacks can co-exist in an Enterprise. DataStack 3.0 stores historical information for analysis, just like DataStack 2.0. The difference is that DataStack 3.0 will handle the data sources which cannot be handled by DataStack 2.0. Early in its maturity, DataStack 3.0 is primarily used for discovering use cases, and it is not uncommon for the results from the discovery exercise to be pushed back to DataStack 2.0 so as to enhance the analysis. Essentially, all three data stacks will co-exist in the enterprise and the chance of Big Data replacing any one of them will be low for the foreseeable future. All three data stacks will co-exist in the enterprise and the chance of Big Data replacing any one of them will be low for the foreseeable future. Figure 1:DataStack Persistent Systems Ltd. All rights reserved. 6
7 Conclusion Data warehouse solutions are moving to what can be termed as DataStack 3.0. This has been pioneered by internet companies such Facebook, Google and Yahoo. The technology, incubated in those companies, is being rapidly adopted by mainstream enterprises. When it is adopted as part of a business, attention must be paid to how to integrate with DataStack 2.0 to get the desired return on investment. The traditional Enterprise Data Warehouses (EDWs) are evolving with the need to deal with petabyte scale data, which includes unstructured data.however, as enterprises rethink their EDW solutions, they will have to keep in mind that mere introduction of core Hadoop technologies such as MapReduce, HDFS, Hive will not necessarily give them greater insight into their business. All of the investments in traditional DWs, data marts, data hubs, operational data stores, etc. are reasonably safe from obsolescence. The reality is that the EDW is evolving into a platform in which all of these database architectures can and will co-exist. We see these requirements coming directly from CTOs and other senior decision-makers in large organizations who are driving convergence of investments across all of these formerly separate technology domains. Vendors are racing to address this convergence in their product portfolios. We believe that DataStack 3.0 will be a confluence of various technologies including traditional EDWs and they will co-exist. The enterprises who manage this confluence well, will benefit most from the right blend of established and emerging technologies. About Persistent Systems Established in 1990, Persistent Systems (BSE & NSE: PERSISTENT) is a global company specializing in software product development services. For more than two decades, Persistent has been an innovation partner for the world s largest technology brands, leading enterprises and pioneering start-ups. With a global team of 6,600+ employees, Persistent has 350+ customers spread across North America, Europe, and Asia. Today, Persistent focuses on developing bestin-class solutions in four key next-generation technology areas: Cloud Computing, Mobility, BI & Analytics, Collaboration across technology, telecommunications, life sciences, consumer packaged goods, banking & financial services and healthcare verticals. For more information, please visit:. India Persistent Systems Limited Bhageerath, 402, Senapati Bapat Road Pune Tel: +91 (20) Fax: +91 (20) USA Persistent Systems, Inc Laurelwood Road, Suite 210 Santa Clara, CA Tel: +1 (408) Fax: +1 (408) info@persistentsys.com DISCLAIMER: The trademarks or trade names mentioned in this paper are property of their respective owners and are included for reference only and do not imply a connection or relationship between Persistent Systems and these companies Persistent Systems Ltd. All rights reserved. 7
How to Enhance Traditional BI Architecture to Leverage Big Data
B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...
More informationManaging Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
More informationW H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
More informationHow To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
More informationOffload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper
Offload Enterprise Data Warehouse (EDW) to Big Data Lake Oracle Exadata, Teradata, Netezza and SQL Server Ample White Paper EDW (Enterprise Data Warehouse) Offloads The EDW (Enterprise Data Warehouse)
More informationQLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment
More informationBig Data at Cloud Scale
Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For
More informationBig Data Integration: A Buyer's Guide
SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology
More informationHarnessing the power of advanced analytics with IBM Netezza
IBM Software Information Management White Paper Harnessing the power of advanced analytics with IBM Netezza How an appliance approach simplifies the use of advanced analytics Harnessing the power of advanced
More informationIBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!
The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader
More informationAlexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
More informationA Tour of the Zoo the Hadoop Ecosystem Prafulla Wani
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationInteractive data analytics drive insights
Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has
More informationHow To Turn Big Data Into An Insight
mwd a d v i s o r s Turning Big Data into Big Insights Helena Schwenk A special report prepared for Actuate May 2013 This report is the fourth in a series and focuses principally on explaining what s needed
More informationBig Data Technologies Compared June 2014
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
More informationwww.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization
More informationIntroducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
More informationArchitecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
More informationUsing Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM
Using Big Data for Smarter Decision Making Colin White, BI Research July 2011 Sponsored by IBM USING BIG DATA FOR SMARTER DECISION MAKING To increase competitiveness, 83% of CIOs have visionary plans that
More informationBusiness Intelligence / Big Data Consulting Service
Business Intelligence / Big Data Consulting Service DATASHEET Business Problem Enterprises and IT businesses have been accumulating an enormous amount of data for years (according to IDC data is growing
More informationSQL Maestro and the ELT Paradigm Shift
SQL Maestro and the ELT Paradigm Shift Abstract ELT extract, load, and transform is replacing ETL (extract, transform, load) as the usual method of populating data warehouses. Modern data warehouse appliances
More informationNavigating the Big Data infrastructure layer Helena Schwenk
mwd a d v i s o r s Navigating the Big Data infrastructure layer Helena Schwenk A special report prepared for Actuate May 2013 This report is the second in a series of four and focuses principally on explaining
More informationGetting Started Practical Input For Your Roadmap
Getting Started Practical Input For Your Roadmap Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson
More informationHadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard
Hadoop and Relational base The Best of Both Worlds for Analytics Greg Battas Hewlett Packard The Evolution of Analytics Mainframe EDW Proprietary MPP Unix SMP MPP Appliance Hadoop? Questions Is Hadoop
More informationCustomized Report- Big Data
GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.
More informationLuncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
More informationBig Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth
MAKING BIG DATA COME ALIVE Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth Steve Gonzales, Principal Manager steve.gonzales@thinkbiganalytics.com
More informationEMC/Greenplum Driving the Future of Data Warehousing and Analytics
EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,
More informationKeywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop
Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Transitioning
More informationAligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
More informationAdvanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
More informationImplement Hadoop jobs to extract business value from large and varied data sets
Hadoop Development for Big Data Solutions: Hands-On You Will Learn How To: Implement Hadoop jobs to extract business value from large and varied data sets Write, customize and deploy MapReduce jobs to
More informationWell packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances
INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA
More informationBig Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
More informationWHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics
WHITE PAPER Harnessing the Power of Advanced How an appliance approach simplifies the use of advanced analytics Introduction The Netezza TwinFin i-class advanced analytics appliance pushes the limits of
More informationBig Data and Analytics 21 A Technical Perspective Abhishek Bhattacharya, Aditya Gandhi and Pankaj Jain November 2012
Big Data and Analytics 21 A Technical Perspective Abhishek Bhattacharya, Aditya Gandhi and Pankaj Jain November 2012 Between the dawn of civilization and 2003, the human race created 5 exabytes of data
More informationBig Data and Your Data Warehouse Philip Russom
Big Data and Your Data Warehouse Philip Russom TDWI Research Director for Data Management April 5, 2012 Sponsor Speakers Philip Russom Research Director, Data Management, TDWI Peter Jeffcock Director,
More informationDatenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
More informationA Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel
A Next-Generation Analytics Ecosystem for Big Data Colin White, BI Research September 2012 Sponsored by ParAccel BIG DATA IS BIG NEWS The value of big data lies in the business analytics that can be generated
More informationESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
More informationBig Data on the Open Cloud
Big Data on the Open Cloud Rackspace Private Cloud, Powered by OpenStack, Helps Reduce Costs and Improve Operational Efficiency Written by Niki Acosta, Cloud Evangelist, Rackspace Big Data on the Open
More informationBringing Big Data into the Enterprise
Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?
More informationExtending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012
Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster Nov 7, 2012 Who I Am Robert Lancaster Solutions Architect, Hotel Supply Team rlancaster@orbitz.com @rob1lancaster Organizer of Chicago
More informationThe 3 questions to ask yourself about BIG DATA
The 3 questions to ask yourself about BIG DATA Do you have a big data problem? Companies looking to tackle big data problems are embarking on a journey that is full of hype, buzz, confusion, and misinformation.
More informationNative Connectivity to Big Data Sources in MSTR 10
Native Connectivity to Big Data Sources in MSTR 10 Bring All Relevant Data to Decision Makers Support for More Big Data Sources Optimized Access to Your Entire Big Data Ecosystem as If It Were a Single
More informationApache Hadoop Patterns of Use
Community Driven Apache Hadoop Apache Hadoop Patterns of Use April 2013 2013 Hortonworks Inc. http://www.hortonworks.com Big Data: Apache Hadoop Use Distilled There certainly is no shortage of hype when
More informationBIG DATA-AS-A-SERVICE
White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers
More informationBIG DATA IS MESSY PARTNER WITH SCALABLE
BIG DATA IS MESSY PARTNER WITH SCALABLE SCALABLE SYSTEMS HADOOP SOLUTION WHAT IS BIG DATA? Each day human beings create 2.5 quintillion bytes of data. In the last two years alone over 90% of the data on
More informationEvaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing
Evaluating NoSQL for Enterprise Applications Dirk Bartels VP Strategy & Marketing Agenda The Real Time Enterprise The Data Gold Rush Managing The Data Tsunami Analytics and Data Case Studies Where to go
More informationArchitecting for the Internet of Things & Big Data
Architecting for the Internet of Things & Big Data Robert Stackowiak, Oracle North America, VP Information Architecture & Big Data September 29, 2014 Safe Harbor Statement The following is intended to
More informationTraditional BI vs. Business Data Lake A comparison
Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses
More informationThe Principles of the Business Data Lake
The Principles of the Business Data Lake The Business Data Lake Culture eats Strategy for Breakfast, so said Peter Drucker, elegantly making the point that the hardest thing to change in any organization
More informationNavigating Big Data business analytics
mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what
More informationBIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
More informationChapter 6. Foundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
More informationBig Data Zurich, November 23. September 2011
Institute of Technology Management Big Data Projektskizze «Competence Center Automotive Intelligence» Zurich, November 11th 23. September 2011 Felix Wortmann Assistant Professor Technology Management,
More informationBIG DATA AND MICROSOFT. Susie Adams CTO Microsoft Federal
BIG DATA AND MICROSOFT Susie Adams CTO Microsoft Federal THE WORLD OF DATA IS CHANGING Cloud What s making this possible? Electrical efficiency of computers doubles every year and ½. Laptops and mobile
More informationOracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>
s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline
More informationUnderstanding the Value of In-Memory in the IT Landscape
February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to
More informationHow Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization
More informationData warehouse and Business Intelligence Collateral
Data warehouse and Business Intelligence Collateral Page 1 of 12 DATA WAREHOUSE AND BUSINESS INTELLIGENCE COLLATERAL Brains for the corporate brawn: In the current scenario of the business world, the competition
More informationOracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
More informationORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process
ORACLE OLAP KEY FEATURES AND BENEFITS FAST ANSWERS TO TOUGH QUESTIONS EASILY KEY FEATURES & BENEFITS World class analytic engine Superior query performance Simple SQL access to advanced analytics Enhanced
More informationAGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
More informationAchieving Business Value through Big Data Analytics Philip Russom
Achieving Business Value through Big Data Analytics Philip Russom TDWI Research Director for Data Management October 3, 2012 Sponsor 2 Speakers Philip Russom Research Director, Data Management, TDWI Brian
More informationMicrosoft Analytics Platform System. Solution Brief
Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal
More informationBIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP
BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP Business Analytics for All Amsterdam - 2015 Value of Big Data is Being Recognized Executives beginning to see the path from data insights to revenue
More informationEvolving Data Warehouse Architectures
Evolving Data Warehouse Architectures In the Age of Big Data Philip Russom April 15, 2014 TDWI would like to thank the following companies for sponsoring the 2014 TDWI Best Practices research report: Evolving
More informationForecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
More informationBig Data: Business Insight for Power and Utilities
Big Data: Business Insight for Power and Utilities A Look at Big Data By now, most enterprises have encountered the term Big Data. What they encounter less is an understanding of what Big Data means for
More informationTap into Hadoop and Other No SQL Sources
Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data
More informationWhy DBMSs Matter More than Ever in the Big Data Era
E-PAPER FEBRUARY 2014 Why DBMSs Matter More than Ever in the Big Data Era Having the right database infrastructure can make or break big data analytics projects. TW_1401138 Big data has become big news
More informationBig Data and Healthcare Payers WHITE PAPER
Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other
More informationAffordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale
WHITE PAPER Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale Sponsored by: IBM Carl W. Olofson December 2014 IN THIS WHITE PAPER This white paper discusses the concept
More informationNews and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
More informationDriving Peak Performance. 2013 IBM Corporation
Driving Peak Performance 1 Session 2: Driving Peak Performance Abstract We know you want the fastest performance possible for your deployments, and yet that relies on many choices across data storage,
More informationBig Data Can Drive the Business and IT to Evolve and Adapt
Big Data Can Drive the Business and IT to Evolve and Adapt Ralph Kimball Associates 2013 Ralph Kimball Brussels 2013 Big Data Itself is Being Monetized Executives see the short path from data insights
More informationHadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
More informationApache Hadoop: The Big Data Refinery
Architecting the Future of Big Data Whitepaper Apache Hadoop: The Big Data Refinery Introduction Big data has become an extremely popular term, due to the well-documented explosion in the amount of data
More informationBIG DATA THE NEW OPPORTUNITY
Feature Biswajit Mohapatra is an IBM Certified Consultant and a global integrated delivery leader for IBM s AMS business application modernization (BAM) practice. He is IBM India s competency head for
More informationW H I T E P A P E R. Building your Big Data analytics strategy: Block-by-Block! Abstract
W H I T E P A P E R Building your Big Data analytics strategy: Block-by-Block! Abstract In this white paper, Impetus discusses how you can handle Big Data problems. It talks about how analytics on Big
More informationReference Architecture, Requirements, Gaps, Roles
Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture
More informationBig Data Success Step 1: Get the Technology Right
Big Data Success Step 1: Get the Technology Right TOM MATIJEVIC Director, Business Development ANDY MCNALIS Director, Data Management & Integration MetaScale is a subsidiary of Sears Holdings Corporation
More informationP4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland
P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland IBM Center of Excellence for Data Science, Cognitive
More informationATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V
ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate
More informationProtecting Big Data Data Protection Solutions for the Business Data Lake
White Paper Protecting Big Data Data Protection Solutions for the Business Data Lake Abstract Big Data use cases are maturing and customers are using Big Data to improve top and bottom line revenues. With
More informationBig Data Explained. An introduction to Big Data Science.
Big Data Explained An introduction to Big Data Science. 1 Presentation Agenda What is Big Data Why learn Big Data Who is it for How to start learning Big Data When to learn it Objective and Benefits of
More informationHow to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW
How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden
More informationWEBAPP PATTERN FOR APACHE TOMCAT - USER GUIDE
WEBAPP PATTERN FOR APACHE TOMCAT - USER GUIDE Contents 1. Pattern Overview... 3 Features 3 Getting started with the Web Application Pattern... 3 Accepting the Web Application Pattern license agreement...
More informationBig Data: Moving Beyond the Buzzword
by Michael Garzone Solutions Director, Technology Solutions 972-530-5755 michael.garzone@ctghs.com Big Data seems to have become the latest marketing buzzword. While there is a lot of talk about it, do
More informationBIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
More informationThe Potential of Big Data in the Cloud. Juan Madera Technology Consultant juan.madera.jimenez@accenture.com
The Potential of Big Data in the Cloud Juan Madera Technology Consultant juan.madera.jimenez@accenture.com Agenda How to apply Big Data & Analytics What is it? Definitions, Technology and Data Science
More informationBIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
More informationVirtualizing Apache Hadoop. June, 2012
June, 2012 Table of Contents EXECUTIVE SUMMARY... 3 INTRODUCTION... 3 VIRTUALIZING APACHE HADOOP... 4 INTRODUCTION TO VSPHERE TM... 4 USE CASES AND ADVANTAGES OF VIRTUALIZING HADOOP... 4 MYTHS ABOUT RUNNING
More informationChapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:
Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
More informationConverged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities
Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling
More informationApplied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA
Applied Business Intelligence Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Agenda Business Drivers and Perspectives Technology & Analytical Applications Trends Challenges
More informationBIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting
BIG DATA APPLIANCES July 23, TDWI R Sathyanarayana Enterprise Information Management & Analytics Practice EMC Consulting 1 Big data are datasets that grow so large that they become awkward to work with
More informationPerformance and Scalability Overview
Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics Platform. Contents Pentaho Scalability and
More information