The 3 questions to ask yourself about BIG DATA
|
|
|
- Margery Allison
- 10 years ago
- Views:
Transcription
1 The 3 questions to ask yourself about BIG DATA
2 Do you have a big data problem? Companies looking to tackle big data problems are embarking on a journey that is full of hype, buzz, confusion, and misinformation. But if they focus on asking the right questions to solve the right problems, the journey can be much smoother with better results. Gartner predicts that until 2016, confusion around the term big data will inhibit growth and adoption of these tools. According to their report, Recent Gartner surveys show that only 30 percent of organizations have invested in big data, of which only a quarter (eight percent of the total) have made it into production. In the world of big data, there s a sort of enterprise equivalent of keeping up with the Jones s. While many claim to do big data, few have truly adopted enterprise wide solutions in the way that Google, Facebook, Twitter, and only a handful of other companies have. A lot of companies think they re being left behind by not getting into big data. The first question they seem to be asking is how can I get started with big data? But the real question is can I derive value from my big data? One of the biggest things companies overlook is figuring out if they even have a big data problem at all. What is typically missed within all the marketing & buzz is that big data is a different kind of data that requires a different kind of solution. Note that the original use case for big data was to solve a rather unique problem. It started with Google s whitepapers on map-reduce and Google Filesystem (GFS) to solve the problem of storing the contents of every webpage on the Internet and then figuring out how they all link to each other (page rank). That s not a problem most companies have. What are your real analytics challenges? Big data should be considered once the existing analytics solution can t accommodate the new data analysis needs. Companies should ensure that they are using the right approach to develop a big data solution by first understanding the analytical challenges they face. Virtually all companies share a common problem when it comes to building an analytics solution: It s very difficult. Bringing together people, processes, and data from disparate parts of an organization is no simple task. If performing analytics were easy, we could predict the stock market and be rich. If we use building a house as an analogy to building an analytics solution, there are a few equivalent problem areas: Lack of vision results in a mess of disintegrated solutions Right Tools Right Intent Wrong Vision?? Resul4ng Solu4on -- Without a coherent, long term and encompassing vision, the individual components of the house simply don t fit very well together. Poor execution results in integrated tools that are unusable / 2
3 Right Tools Wrong Intent Right Vision Resul4ng Solu4on Defining Big Data As we mentioned, the first thing companies need is to determine before implementing a big data solution is whether or not they have a big data problem. The three V s Volume, Variety and Velocity are the key to defining and understanding the need for big data. -- Even with the right vision, not executing properly will result in something that may look somewhat like a house but clearly isn t what was intended. Having the wrong tools results in components not working together Wrong Tools Right Intent Right Vision Resul4ng Solu4on -- While this doesn t tend to happen very often (in fact most organizations have too many tools), having the wrong tools will make it impossible to even start building your house. When companies run into these problems with their analytical solutions, they re forced to perform manual and time intensive tasks that are prone to human error. Volume, or the amount of data addressed by big data, is beyond that of a traditional database. There are no set limits to the volume, but it s generally accepted to be in the Terabytes to Petabyte range. Companies that employ big data solutions will generally look to preserve all data even if there is no immediate use for it. For companies challenged with handling such large volumes, a big data solution will scale horizontally, seamlessly adding more capacity to meet the increasing load. Variety, or the range and types of data and sources in big data, are endless. Big data solutions process a variety of multistructured data, such as social media, text, server logs, geo-location, etc. A big data solution can support data storage in schema-less architecture to enable a company to analyze a variety of data at extraction, instead of storage. The schemazation is performed at extract instead of load. Velocity is not only the speed with which the data goes in and out, but also the rate with which it changes. Linking varying continuous streams of data from different sources, such as web, logs and monitoring tools can performed by a big data solution. Companies that need to capture, store and access, but not necessarily process data in real-time, can turn to big data. / 3
4 Right Tools Generally, a well-executed traditional analytics solution with the right tools, right execution and a strong vision is sufficient for most companies. So, big data tools should be considered once their existing solution is not longer able to effectively store and processes valuable data. Right Tools Right Intent Right Intent Right Vision Right Vision Is big data a problem worth solving? Resul2ng Solu2on Once a company has established that they do have a big data problem, the next question to ask is is it worth it to solve my big data problem? Many of our clients are looking for ways to reduce costs associated with storing and processing data, and look to big data tools as a possible answer. Partly because tools like Hadoop and NoSQL datastores are open source with Solu3on No Longer Effec3ve New Solu3on using New Tools Exploring The Misconceptions There are a number of common misconceptions about big data, created in part, by marketing campaigns and media hype. Like, for example, that big data is synonymous for a lot of data, or only the Volume aspect of big data. The reality is that companies like IBM & Teradata have been creating solutions for dealing with large data volumes for over 30 years. When choosing a solution to solve a big data problem, all Three V s should be taken into consideration. It s important to keep in mind that big data solutions are not meant to replace existing technologies. While there is some overlap in capabilities between existing solutions and the new open source big data tools, they are meant to complement existing architectures to solve a new kind of problem. It s also important to note that a big data solution is not stand-alone. Unless it can integrate into the rest of your analytics and application stacks, its value will be severely limited. Some of our clients have made the mistake of using the term big data synonymously for a single technology. This is another misconception: big data problems usually cannot be solved with a single technology, such as Hadoop. Another popular misconception is that big data technologies can perform analytics in real-time. This is a common misunderstanding of velocity of data. The reality is that while these tools can easily scale to support capturing large volumes of data in real-time, analyzing the data in a meaningful way (beyond simple counters or with limited data) can actually be quite slow. The general thinking is that data analysis queries that used to take many hours or days can be completed in about 30 minutes to a few hours. / 4
5 relatively cheap commercial support. But in reality, building and maintaining big data solutions is expensive. Big data tools like Hadoop require maintaining clusters of servers and hiring developers to leverage their platforms. They are also usually command line driven which requires the end user to understand things like the Linux command line. So while some money might be saved on licensing costs, there s still hardware, operational, development, and complexity costs to consider. Before pursuing a big data solution, companies need to assess their analytical capabilities. If they struggle to properly analyze their non-big data assets, adding a big data tool will only lead to more complexity. Companies should be at a maturity level of at least level 3 before exploring big data tools. In many cases, simply putting a focus on fixing or enhancing existing analytics capabilities would provide the most value. Another question is whether or not there is value within the big data they are planning to analyze. Simply because one has big data doesn t mean the cost would outweigh the expense of analyzing it. Companies should look to execute a small proof of concept on their big data to see what insights can be determined. Examples could be as simple as manually looking through samples of log files, to leveraging a cloud-based solution, to rapidly prototype a solution based on a static set of data. With this done, an organization can also start to determine the best approach to solving their big data problem. There are many options available such as on-premise, cloud based, custom built or leveraging a packaged industry solution. 1) Gut Driven No central data/ repor4ng Opera4onal reports/ spreadsheets Manual analysis 2) Theory Driven Warehouse/Marts Basic repor4ng capabili4es (canned dashboards) No cohesive strategy or governance 3) Reac4ve Driven Data Governance & strategy Well defined business KPIs Ad- Hoc data analysis 4) Factually Driven Engaging in exploratory analy4cs Beginning to leverage advanced analy4cs, basic predic4ve models 5) Strategy Driven Analy4cs at core of virtually business decisions Advanced models able to predict business / 5
6 New SQL MPP NoSQL Hadoop In Memory Structured Fast Analy9cs Gigabytes- Terabytes Mixed Storage Structured Data Warehouses Terabytes low Petabytes Mixed Storage Semi/ Unstructured Supports Applica9ons Terabytes+ Disk Storage Semi/ Unstructured Batch Processing Many Petabytes+ Solving The Big Data Problem When solving big data problems, companies need to have a full grasp of the problem at hand. There are a wide variety of data solutions available, each with their own advantages and disadvantages, however, there is no one solution that can effectively solve all problems. In order to choose the right technology, you must know what data needs to be captured, where is it going, who is using it, and why they are using it. Many factors need to be taken into account when choosing a technology to solve a big data problem including cost, performance needs, and the underlying structure of the data. The companies that pioneered big data leverage a wide variety of platforms to solve their data problems. For example, companies may use some of these tools in the following ways: A scalable in-memory solution to quickly analyze metrics based on a few days worth of data in order to identify quickly changing trends and address them. An enterprise data warehouse for analyzing big data that has been properly processed, structured, and cleansed along side traditional data for calculating KPIs, creating visualizations, etc. A NoSQL tool such as HBase to serve real time counters, basic metrics, and data signals. Hadoop for storing the most detailed log data with its entire history for both ad-hoc and batch analysis of atomic data. / 6
7 Sources Operational Systems Operational Systems Raw Data Data Storage In-Memory Database Quick analytics on smaller data sets Massively Parallel Processing Store and process large amounts of operational data Hadoop Big Data NoSQL $$$$ $$$ Column Store Distributed Flexible ACID Highly computing on for structured compressed large sets of raw & semi structured data data structured data $ $ $$ Data Virtualization / Information Delivery Analytics Business Intelligence Advanced Analytics Big data tools must be able to integrate into existing architectures to get maximum value. Companies that have successfully adopted big data solutions build them alongside their traditional architectures. They will typically perform ad-hoc and exploratory analytics directly on their big data platforms. Scheduled extracts are also created to get the valuable data into a reportable format and into a platform that is easier for tools and people to leverage. The most important part of analyzing big data is that you can use it along side your existing data tools. This is why virtually all companies who have adopted big data solutions use an enterprise data warehouse a downstream platform for big data analysis. / 7
8 Implementing a big data platform tends to be somewhat different than traditional enterprise applications. In general, different big data tools solve a wide variety of data problems. Due to how new some of these technologies are and how rapidly they are changing, companies must be willing to experiment, fail, and try a different solution to solve their big data problems. And even once the platform is live, there should be continuous development and experimentation. The area of big data is still very new and each company s data problem is unique. A high level approach to big data might look like the following: Analyzing big data can be a daunting task for companies who are just beginning to approach it. If not focused on the right things, it can quickly spiral out of control. By asking the right questions looking beyond the hype and marketing, companies can determine if analyzing big data is right for them. / 8
9 About Slalom Consulting Slalom Consulting brings together business and technology expertise to help companies drive enterprise performance, accelerate innovation, enhance the customer experience, and increase employee productivity. The firm delivers award-winning solutions in areas such as information management and analytics, sales and marketing, organizational effectiveness, CFO advisory, mobility, and cloud through a national network of local offices and major alliance partners, including Microsoft, Salesforce.com, and Amazon Web Services. Founded in 2001 and based in Seattle, WA, Slalom has organically grown to more than 2,200 consultants. The company has been ranked as a Top 10 Best Firms to Work For by Consulting magazine four times, and earned recognition from Microsoft as a Partner of the Year five times. For more information, visit slalom.com. Chris Schrader Solution Principal [email protected] / 9
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization
5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
Next-Generation Cloud Analytics with Amazon Redshift
Next-Generation Cloud Analytics with Amazon Redshift What s inside Introduction Why Amazon Redshift is Great for Analytics Cloud Data Warehousing Strategies for Relational Databases Analyzing Fast, Transactional
Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop
Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Transitioning
W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Big Data Integration: A Buyer's Guide
SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology
INTRODUCTION TO CASSANDRA
INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open
Ten Mistakes to Avoid
EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects
Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores
Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores Composite Software October 2010 TABLE OF CONTENTS INTRODUCTION... 3 BUSINESS AND IT DRIVERS... 4 NOSQL DATA STORES LANDSCAPE...
The Internet of Things and Big Data: Intro
The Internet of Things and Big Data: Intro John Berns, Solutions Architect, APAC - MapR Technologies April 22 nd, 2014 1 What This Is; What This Is Not It s not specific to IoT It s not about any specific
Navigating the Big Data infrastructure layer Helena Schwenk
mwd a d v i s o r s Navigating the Big Data infrastructure layer Helena Schwenk A special report prepared for Actuate May 2013 This report is the second in a series of four and focuses principally on explaining
Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: [email protected] Website: www.qburst.com
Lambda Architecture Near Real-Time Big Data Analytics Using Hadoop January 2015 Contents Overview... 3 Lambda Architecture: A Quick Introduction... 4 Batch Layer... 4 Serving Layer... 4 Speed Layer...
The Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014
Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014 Defining Big Not Just Massive Data Big data refers to data sets whose size is beyond the ability of typical database software tools
GigaSpaces Real-Time Analytics for Big Data
GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and
The Principles of the Business Data Lake
The Principles of the Business Data Lake The Business Data Lake Culture eats Strategy for Breakfast, so said Peter Drucker, elegantly making the point that the hardest thing to change in any organization
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
WA2192 Introduction to Big Data and NoSQL EVALUATION ONLY
WA2192 Introduction to Big Data and NoSQL Web Age Solutions Inc. USA: 1-877-517-6540 Canada: 1-866-206-4644 Web: http://www.webagesolutions.com The following terms are trademarks of other companies: Java
Why Big Data in the Cloud?
Have 40 Why Big Data in the Cloud? Colin White, BI Research January 2014 Sponsored by Treasure Data TABLE OF CONTENTS Introduction The Importance of Big Data The Role of Cloud Computing Using Big Data
Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
Large scale processing using Hadoop. Ján Vaňo
Large scale processing using Hadoop Ján Vaňo What is Hadoop? Software platform that lets one easily write and run applications that process vast amounts of data Includes: MapReduce offline computing engine
Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012
Big Data Buzzwords From A to Z By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords Big data is one of the, well, biggest trends in IT today, and it has spawned a whole new generation
Big Data Readiness. A QuantUniversity Whitepaper. 5 things to know before embarking on your first Big Data project
A QuantUniversity Whitepaper Big Data Readiness 5 things to know before embarking on your first Big Data project By, Sri Krishnamurthy, CFA, CAP Founder www.quantuniversity.com Summary: Interest in Big
Hadoop implementation of MapReduce computational model. Ján Vaňo
Hadoop implementation of MapReduce computational model Ján Vaňo What is MapReduce? A computational model published in a paper by Google in 2004 Based on distributed computation Complements Google s distributed
White Paper: Datameer s User-Focused Big Data Solutions
CTOlabs.com White Paper: Datameer s User-Focused Big Data Solutions May 2012 A White Paper providing context and guidance you can use Inside: Overview of the Big Data Framework Datameer s Approach Consideration
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
Evolution to Revolution: Big Data 2.0
Evolution to Revolution: Big Data 2.0 An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for Actian March 2014 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents
Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.
Big Data Technology ดร.ช ชาต หฤไชยะศ กด Choochart Haruechaiyasak, Ph.D. Speech and Audio Technology Laboratory (SPT) National Electronics and Computer Technology Center (NECTEC) National Science and Technology
ANALYTICS BUILT FOR INTERNET OF THINGS
ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that
QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering
QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering June 2014 Page 1 Contents Introduction... 3 About Amazon Web Services (AWS)... 3 About Amazon Redshift... 3 QlikView on AWS...
NextGen Infrastructure for Big DATA Analytics.
NextGen Infrastructure for Big DATA Analytics. So What is Big Data? Data that exceeds the processing capacity of conven4onal database systems. The data is too big, moves too fast, or doesn t fit the structures
Microsoft Big Data. Solution Brief
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ
End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,
Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved
Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment
HARNESS IT. An introduction to business intelligence solutions. THE SITUATION THE CHALLENGES THE SOLUTION THE BENEFITS
HARNESS IT. An introduction to business intelligence solutions. THE SITUATION THE CHALLENGES THE SOLUTION THE BENEFITS THE SITUATION Data is growing exponentially in size and complexity. Traditional analytics
PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP
PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP Your business is swimming in data, and your business analysts want to use it to answer the questions of today and tomorrow. YOU LOOK TO
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES
COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY The business world is abuzz with the potential of data. In fact, most businesses have so much data that it is difficult for them to process
BIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
BIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to
How To Scale Out Of A Nosql Database
Firebird meets NoSQL (Apache HBase) Case Study Firebird Conference 2011 Luxembourg 25.11.2011 26.11.2011 Thomas Steinmaurer DI +43 7236 3343 896 [email protected] www.scch.at Michael Zwick DI
Big Data Analytics - Accelerated. stream-horizon.com
Big Data Analytics - Accelerated stream-horizon.com StreamHorizon & Big Data Integrates into your Data Processing Pipeline Seamlessly integrates at any point of your your data processing pipeline Implements
Hadoop. http://hadoop.apache.org/ Sunday, November 25, 12
Hadoop http://hadoop.apache.org/ What Is Apache Hadoop? The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using
Using Tableau Software with Hortonworks Data Platform
Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data
OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT
WHITEPAPER OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT A top-tier global bank s end-of-day risk analysis jobs didn t complete in time for the next start of trading day. To solve
Transforming the Telecoms Business using Big Data and Analytics
Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe
Parallel Data Warehouse
MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
BIG DATA TECHNOLOGY. Hadoop Ecosystem
BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big
Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015
Step by Step: Big Data Technology Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015 Data Sources IT Infrastructure Analytics 2 B y 2015, 20% of Global 1000 organizations
THE STATE OF THE DATA WAREHOUSE
March 2015 Sponsored by Introduction As the volume and types of business data have increased at a phenomenal pace, and the cost to store that data has plummeted, businesses have looked to data analytics
Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper
Offload Enterprise Data Warehouse (EDW) to Big Data Lake Oracle Exadata, Teradata, Netezza and SQL Server Ample White Paper EDW (Enterprise Data Warehouse) Offloads The EDW (Enterprise Data Warehouse)
How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning
How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume
You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.
What is this course about? This course is an overview of Big Data tools and technologies. It establishes a strong working knowledge of the concepts, techniques, and products associated with Big Data. Attendees
InfiniteGraph: The Distributed Graph Database
A Performance and Distributed Performance Benchmark of InfiniteGraph and a Leading Open Source Graph Database Using Synthetic Data Objectivity, Inc. 640 West California Ave. Suite 240 Sunnyvale, CA 94086
Big Data Explained. An introduction to Big Data Science.
Big Data Explained An introduction to Big Data Science. 1 Presentation Agenda What is Big Data Why learn Big Data Who is it for How to start learning Big Data When to learn it Objective and Benefits of
Big Data and Transactional Databases Exploding Data Volume is Creating New Stresses on Traditional Transactional Databases
Big Data and Transactional Databases Exploding Data Volume is Creating New Stresses on Traditional Transactional Databases Introduction The world is awash in data and turning that data into actionable
Getting Started Practical Input For Your Roadmap
Getting Started Practical Input For Your Roadmap Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson
The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg
The Big Picture on Big Data Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg Objective of Talk 1. Deliver a Primer on Big Data. 2. How does this emerging topic apply to Quality? 3.
Welcome to The Future of Analytics In Action. 2015 IBM Corporation
Welcome to The Future of Analytics In Action Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss examples
THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS
THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS WHITE PAPER Successfully writing Fast Data applications to manage data generated from mobile, smart devices and social interactions, and the
How To Turn Big Data Into An Insight
mwd a d v i s o r s Turning Big Data into Big Insights Helena Schwenk A special report prepared for Actuate May 2013 This report is the fourth in a series and focuses principally on explaining what s needed
BIG DATA CHALLENGES AND PERSPECTIVES
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
IBM: An Early Leader across the Big Data Security Analytics Continuum Date: June 2013 Author: Jon Oltsik, Senior Principal Analyst
ESG Brief IBM: An Early Leader across the Big Data Security Analytics Continuum Date: June 2013 Author: Jon Oltsik, Senior Principal Analyst Abstract: Many enterprise organizations claim that they already
International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: [email protected]
Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.
Beyond Web Application Log Analysis using Apache TM Hadoop A Whitepaper by Orzota, Inc. 1 Web Applications As more and more software moves to a Software as a Service (SaaS) model, the web application has
SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM
David Chappell SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM A PERSPECTIVE FOR SYSTEMS INTEGRATORS Sponsored by Microsoft Corporation Copyright 2014 Chappell & Associates Contents Business
Data Modeling for Big Data
Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Big Data Are You Ready? Thomas Kyte http://asktom.oracle.com
Big Data Are You Ready? Thomas Kyte http://asktom.oracle.com The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
Chapter 6. Foundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014
Big Data Analytics An Introduction Oliver Fuchsberger University of Paderborn 2014 Table of Contents I. Introduction & Motivation What is Big Data Analytics? Why is it so important? II. Techniques & Solutions
Luncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
Integrated Big Data: Hadoop + DBMS + Discovery for SAS High Performance Analytics
Paper 1828-2014 Integrated Big Data: Hadoop + DBMS + Discovery for SAS High Performance Analytics John Cunningham, Teradata Corporation, Danville, CA ABSTRACT SAS High Performance Analytics (HPA) is a
WHY IT ORGANIZATIONS CAN T LIVE WITHOUT QLIKVIEW
WHY IT ORGANIZATIONS CAN T LIVE WITHOUT QLIKVIEW A QlikView White Paper November 2012 qlikview.com Table of Contents Unlocking The Value Within Your Data Warehouse 3 Champions to the Business Again: Controlled
PEPPERDATA IN MULTI-TENANT ENVIRONMENTS
..................................... PEPPERDATA IN MULTI-TENANT ENVIRONMENTS technical whitepaper June 2015 SUMMARY OF WHAT S WRITTEN IN THIS DOCUMENT If you are short on time and don t want to read the
Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce
Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of
Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84
Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics
Machine Data Analytics with Sumo Logic
Machine Data Analytics with Sumo Logic A Sumo Logic White Paper Introduction Today, organizations generate more data in ten minutes than they did during the entire year in 2003. This exponential growth
Three Open Blueprints For Big Data Success
White Paper: Three Open Blueprints For Big Data Success Featuring Pentaho s Open Data Integration Platform Inside: Leverage open framework and open source Kickstart your efforts with repeatable blueprints
Big data: Unlocking strategic dimensions
Big data: Unlocking strategic dimensions By Teresa de Onis and Lisa Waddell Dell Inc. New technologies help decision makers gain insights from all types of data from traditional databases to high-visibility
SQL Server 2012 Parallel Data Warehouse. Solution Brief
SQL Server 2012 Parallel Data Warehouse Solution Brief Published February 22, 2013 Contents Introduction... 1 Microsoft Platform: Windows Server and SQL Server... 2 SQL Server 2012 Parallel Data Warehouse...
Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere
Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere Agenda 1. Introductions & Objectives 2. Tableau Overview 3. Tableau Products 4. Tableau Architecture 5. Why Tableau? 6.
The Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT
EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT Leveraging analytics for actionable insight ESSENTIALS Put your Big Data to work for you Pick the best-fit, priority business opportunity and
Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA
Big Data: What You Should Know Mark Child Research Manager - Software IDC CEMA Agenda Market Dynamics Defining Big Data Technology Trends Information and Intelligence Market Realities Future Applications
BIG DATA TOOLS. Top 10 open source technologies for Big Data
BIG DATA TOOLS Top 10 open source technologies for Big Data We are in an ever expanding marketplace!!! With shorter product lifecycles, evolving customer behavior and an economy that travels at the speed
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Oracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
