Big Data Big Deal? Salford Systems

Size: px
Start display at page:

Download "Big Data Big Deal? Salford Systems www.salford-systems.com"

Transcription

1 Big Data Big Deal? Salford Systems

2 Copyright Salford Systems Big Data Is The New In Thing Google trends as of September 24, 2015 Difficult to read trade press without encountering Big Data stories

3 Copyright Salford Systems Zoom In On Last 5 Years Measured At January of Each Year Series More than tripling of searches between 2010 and Last year s growth only about 5% suggesting saturation or maturity

4 Copyright Salford Systems More Trends: Predictive Analytics and Machine Learning Predictive analytics has seen a fairly steady increase in searches Machine Learning went from 73 (March 2004) to 32 (March 2009) and back to 70 (March 2015)

5 Salford Systems 2015 Influential High Tech Marketing Book 1991 revised 2015 Incisive review of how new technologies evolve from startups to mainstream Traces successes and failures and looks for common themes Three editions have covered 3 different decades (80s, 90s, 2000s) all with the same conclusions Big Data is moving quickly with earliest components of the technology being about 10 years old

6 Life Cycle of Diffusion of Innovation Big Data In the beginning of Early Majority Phase Internet captured image

7 Copyright Salford Systems Pressure Is On for Big Data Capability Every serious corporation is expected to have a Big Data strategy IT and main lines of business have no choice other than to introduce and deploy Big Data projects and solutions Ideal situation for vendors Buyers do not really know what they are buying or why Buyers have a budget that must be spent A replay of the data warehouse revolution that gathered such strength in the 1990s Requires new hardware (for in-house solutions) Requires new software (some open source, some proprietary) Requires new IT skills and possibly extensive consulting Substantial investment up front with rewards unknown

8 Observation on 2013 Deployments Everyone's Doing It, No One Knows Why According to a recent Gartner report, 64% of enterprises surveyed indicate that they're deploying or planning Big Data projects. Yet even more acknowledge that they still don't know what to do with Big Data. Have the inmates officially taken over the Big Data asylum? The same enterprises that seem most confused about Big Data seem to be the ones launching Big Data projects. What gives? Gartner On Big Data: Everyone's Doing It, No One Knows Why 18/09/2013

9 Observation on 2013 Deployments Everyone's Doing It, No One Knows Why Gartner On Big Data: Everyone's Doing It, No One Knows Why 18/09/2013

10 Copyright Salford Systems Competitive Bragging Rights My Big Data is Bigger than your Big Data My Hadoop Cluster is bigger than your Hadoop Cluster No one is likely to beat Google at this game (although Google uses other Google-only software, not Hadoop) Obviously the question to answer is: what added value comes from all of this Big Data investment A reasonable cluster would have from 20 to 100 nodes At today s prices of $7,000 per node the costs are modest Roughly from $100K to $1million for a starter cluster Of course number of servers might need to go much higher Each server with 16TB storage, 64 GB RAM, 16 cores a solid component for such a cluster (priced at Dell.com)

11 Copyright Salford Systems Using Amazon AWS Servers Bloggers discuss some options which range in cost from $10,000 to $90,000 per month for a 10+ node cluster Cost for one year similar to our estimates for in-house hardware In other words, a relatively serious investment

12 Salford Systems 2015 Is Big Data Really New? VLDB Started in 1974 More than 40 years ago Understanding that data can and will exceed the capacity of one server or a few servers has been evident since the first VLDB conference in 1975 VLDB remained a relatively obscure sub-specialty

13 Copyright Salford Systems KDD Conference 1995 Knowledge Discovery In Data Bringing machine learning and large data bases together

14 What Enabled the new Big Data? Google introduced new technology MapReduce Facilitating massively parallel computation Yahoo developed open source software Hadoop Yahoo developers founded new companies (Cloudera, Hortonworks) Today there is a growing ecosystem of companies extending the initial capabilities Source: Glenn Klockwood,

15 Copyright Salford Systems What Exactly is Special About Big Data? Volume of data is in and of itself not very interesting or useful IF all you have is more and more of exactly the same data as before Data becomes more interesting when it is broadened to include greater variety For example, some new style peer-to-peer lenders leverage information about potential borrowers from social media and from applicant s online behavior Growing interest in text mining New age of Big Data has made it far more practical to unify multiple sources of diverse data into a single useful repository Unification might be virtual and managed by software

16 Copyright Salford Systems What Exactly Is Special About Hadoop Hadoop can function as a universal data store. You can throw anything and everything into it (spreadsheets, video, audio, conventional data tables) You don t have to plan before you store (unlike a traditional database which requires careful planning before you can do anything) Hadoop has been called the new tape. You just write to it. Hadoop data has also been described as write once read never meaning it becomes a data cemetery

17 What Exactly Is Special About Hadoop Hadoop allows you to defer all the work that normally is required for databases If you don t do the work before you wont be able to use the data later until that work is done (create schemas) Source: Glenn Klockwood,

18 Analytics on Hadoop Flood of new interest in making Hadoop useful Original definitive use-case: counting something If your data is so big that it must be distributed across possibly many machines (e.g. 1,000 servers) Each server can be organized to count something of interest in its local data store (doing anything on a single server is easy) We can then collect the subtotals and get a grand total Such a project can be expected to yield fast results no matter how many servers are involved Usual first example of working with Hadoop and MapReduce For more complex analytics Hadoop alone has been found to be intolerably slow Copyright Salford Systems

19 Copyright Salford Systems Romance of Big Data Most non-analytics professionals hated their mandatory university statistics courses and learned little In spite of the power of advanced analytics having been demonstrated time again since at least the 1980s it has been difficult to excite managers about the topic Once the topic of analytics was inextricably linked with Big Data the trade press and the popular press became enthralled Wikileaks and NSA phone call monitoring fueled the notion of Big Data as being all knowing and all powerful Suddenly analytics was seen as a source of power & control

20 Romance of Big Data Wikileaks and NSA phone call monitoring fueled the notion of Big Data as being all knowing and all powerful Suddenly analytics was seen as a source of power & control Source: Srikant Sastri,

21 Copyright Salford Systems Analytics Reality While we might now have access to huge repositories of data it is not the case we require or will use all of this data Impressive predictive models often are constructed from a relatively small number of core predictors Until recently major bank credit risk models might only leverage about 15 essential predictors New age modeling techniques might expand the set of predictors into the several hundreds We might have access to a huge number of predictors but for any given predictive project we will typically end up using a very small fraction of those predictors Once we have narrowed our focus we can continue analysis on the relevant subset of predictors

22 Mathematics of Rare Events Many analytical tasks focus on the prediction of rare events (fraud in a credit card transaction, conversion for an internet ad someone actually makes a purchase) Suppose a rare event occurs 1 in 1,000 chances One million chances will generate just 1,000 events We know that to optimally analyze 1,000 rare events we require all of the data of the rare events and a small sample of the common event (say 10,000) Even if we start with 1 million rows of extensive data we almost surely only need about 11,000 of these rows for first rate modeling In other words, the Big Data problem quickly becomes a small data problem Copyright Salford Systems

23 Value of added GOODs in small samples of BADS Sample Variance as 0:1 Ratio Increases (N1=250) Variance of discriminator as 0:1 ratio increases (N 1 =250) N 0 = Factor*N 1 Variance of Discrminator Vs 0:1 Ratio Series Assume there is a single relevant predictor X and we want to measure difference in mean of X between GOOD and BAD samples. Variance is reduced by 95% when factor is 20

24 Validation ROC Value of Increased Good/Bad Ratio Surprisingly low ratios sufficient Validation ROC by Good/Bad Ratio Bads 491 Bads 942 Bads 2495 Bads Good/Bad Ratio 24 Each curve has a given number of BADs and varies the ratio of GOODs Starting with more BADs yields better results as expected

25 Copyright Salford Systems Single Server Capacity Take a modest 64GB RAM server Can effectively work with a data set of about 2 million rows by 3000 columns Using learning machines that must hold the training data in RAM RAM is also needed for workspace which is why the entire RAM is not dedicated to data storage If this is not enough can easily scale up to 512GB RAM which accommodate 16 million rows instead (for example) Very few predictive modeling problems for which this capacity is far more than sufficient

26 Copyright Salford Systems What We Recommend When it comes to Big Data your organization has no choice Your organization will have to make the investments in hardware, software, and personnel (or get involved in the Cloud) Once the Big Data team has actually pulled something together determine what information they have been able to organize that is not typically available for your analytics Request access to that data or obtain an extract from the big data store that you can comfortably work with on a single server Use modern advanced analytics to ascertain the value (if any) of that added data

27 Copyright Salford Systems Technologies You Need to Know About Gradient Boosting Random Forests Many textbooks available, training videos These can be found in the offerings of all major vendors and in open source Technologies were first brought to the market by Salford Systems

Building your Big Data Architecture on Amazon Web Services

Building your Big Data Architecture on Amazon Web Services Building your Big Data Architecture on Amazon Web Services Abhishek Sinha @abysinha sinhaar@amazon.com AWS Services Deployment & Administration Application Services Compute Storage Database Networking

More information

Data processing goes big

Data processing goes big Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,

More information

BIG DATA MARKETING: THE NEXUS OF MARKETING, ANALYSTS, AND IT

BIG DATA MARKETING: THE NEXUS OF MARKETING, ANALYSTS, AND IT BIG DATA MARKETING: THE NEXUS OF MARKETING, ANALYSTS, AND IT The term Big Data is definitely a leading contender for the marketing buzz-phrase of 2012. On November 11, 2011, a Google search on the phrase

More information

How To Improve Your Profit With Optimized Prediction

How To Improve Your Profit With Optimized Prediction Higher Business ROI with Optimized Prediction Yottamine s Unique and Powerful Solution Forward thinking businesses are starting to use predictive analytics to predict which future business events will

More information

Hybrid: The Next Generation Cloud Interviews Among CIOs of the Fortune 1000 and Inc. 5000

Hybrid: The Next Generation Cloud Interviews Among CIOs of the Fortune 1000 and Inc. 5000 Hybrid: The Next Generation Cloud Interviews Among CIOs of the Fortune 1000 and Inc. 5000 IT Solutions Survey Wakefield Research 2 EXECUTIVE SUMMARY: Hybrid The Next Generation Cloud M ost Chief Information

More information

While a number of technologies fall under the Big Data label, Hadoop is the Big Data mascot.

While a number of technologies fall under the Big Data label, Hadoop is the Big Data mascot. While a number of technologies fall under the Big Data label, Hadoop is the Big Data mascot. Remember it stands front and center in the discussion of how to implement a big data strategy. Early adopters

More information

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate

More information

RevoScaleR Speed and Scalability

RevoScaleR Speed and Scalability EXECUTIVE WHITE PAPER RevoScaleR Speed and Scalability By Lee Edlefsen Ph.D., Chief Scientist, Revolution Analytics Abstract RevoScaleR, the Big Data predictive analytics library included with Revolution

More information

Big Data, Big Traffic. And the WAN

Big Data, Big Traffic. And the WAN Big Data, Big Traffic And the WAN Internet Research Group January, 2012 About The Internet Research Group www.irg-intl.com The Internet Research Group (IRG) provides market research and market strategy

More information

Big Data Integration: A Buyer's Guide

Big Data Integration: A Buyer's Guide SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology

More information

Ubuntu and Hadoop: the perfect match

Ubuntu and Hadoop: the perfect match WHITE PAPER Ubuntu and Hadoop: the perfect match February 2012 Copyright Canonical 2012 www.canonical.com Executive introduction In many fields of IT, there are always stand-out technologies. This is definitely

More information

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the

More information

The Next Wave of Data Management. Is Big Data The New Normal?

The Next Wave of Data Management. Is Big Data The New Normal? The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management

More information

Operational Analytics

Operational Analytics Operational Analytics Version: 101 Table of Contents Operational Analytics 3 From the Enterprise Data Hub to the Enterprise Application Hub 3 Operational Intelligence in Action: Some Examples 4 Requirements

More information

In-Database Analytics

In-Database Analytics Embedding Analytics in Decision Management Systems In-database analytics offer a powerful tool for embedding advanced analytics in a critical component of IT infrastructure. James Taylor CEO CONTENTS Introducing

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

Delivering Real-World Total Cost of Ownership and Operational Benefits

Delivering Real-World Total Cost of Ownership and Operational Benefits Delivering Real-World Total Cost of Ownership and Operational Benefits Treasure Data - Delivering Real-World Total Cost of Ownership and Operational Benefits 1 Background Big Data is traditionally thought

More information

DDN updates object storage platform as it aims to break out of HPC niche

DDN updates object storage platform as it aims to break out of HPC niche DDN updates object storage platform as it aims to break out of HPC niche Analyst: Simon Robinson 18 Oct, 2013 DataDirect Networks has refreshed its Web Object Scaler (WOS), the company's platform for efficiently

More information

BIG DATA USING HADOOP

BIG DATA USING HADOOP + Breakaway Session By Johnson Iyilade, Ph.D. University of Saskatchewan, Canada 23-July, 2015 BIG DATA USING HADOOP + Outline n Framing the Problem Hadoop Solves n Meet Hadoop n Storage with HDFS n Data

More information

Big Data Readiness. A QuantUniversity Whitepaper. 5 things to know before embarking on your first Big Data project

Big Data Readiness. A QuantUniversity Whitepaper. 5 things to know before embarking on your first Big Data project A QuantUniversity Whitepaper Big Data Readiness 5 things to know before embarking on your first Big Data project By, Sri Krishnamurthy, CFA, CAP Founder www.quantuniversity.com Summary: Interest in Big

More information

Big Data Analytics: Today's Gold Rush November 20, 2013

Big Data Analytics: Today's Gold Rush November 20, 2013 Copyright 2013 Vivit Worldwide Big Data Analytics: Today's Gold Rush November 20, 2013 Brought to you by Copyright 2013 Vivit Worldwide Hosted by Bernard Szymczak Vivit Leader Ohio Chapter TQA SIG Copyright

More information

Big Data at Cloud Scale

Big Data at Cloud Scale Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For

More information

Agile Business Intelligence Data Lake Architecture

Agile Business Intelligence Data Lake Architecture Agile Business Intelligence Data Lake Architecture TABLE OF CONTENTS Introduction... 2 Data Lake Architecture... 2 Step 1 Extract From Source Data... 5 Step 2 Register And Catalogue Data Sets... 5 Step

More information

Big Data Processing: Past, Present and Future

Big Data Processing: Past, Present and Future Big Data Processing: Past, Present and Future Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. Orion.Gebremedhin@Neudesic.COM B-orgebr@Microsoft.com @OrionGM

More information

Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud

Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud Laurence Liew General Manager, APAC Economics Is Driving Big Data Analytics to the Cloud Big Data 101 The Analytics Stack Economics of Big Data Convergence of the 3 forces Big Data Analytics in the Cloud

More information

DEFINITELY. GAME CHANGER? EVOLUTION? Big Data

DEFINITELY. GAME CHANGER? EVOLUTION? Big Data Big Data EVOLUTION? GAME CHANGER? DEFINITELY. EMC s Bill Schmarzo and consultant Ben Woo weigh in on whether Big Data is revolutionary, evolutionary, or both. by Terry Brown EMC+ In a recent survey of

More information

Hadoop-based Open Source ediscovery: FreeEed. (Easy as popcorn)

Hadoop-based Open Source ediscovery: FreeEed. (Easy as popcorn) + Hadoop-based Open Source ediscovery: FreeEed (Easy as popcorn) + Hello! 2 Sujee Maniyam & Mark Kerzner Founders @ Elephant Scale consulting and training around Hadoop, Big Data technologies Enterprise

More information

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale The Power of Pentaho and Hadoop in Action Demonstrating MapReduce Performance at Scale Introduction Over the last few years, Big Data has gone from a tech buzzword to a value generator for many organizations.

More information

Cloud Computing Backgrounder

Cloud Computing Backgrounder Cloud Computing Backgrounder No surprise: information technology (IT) is huge. Huge costs, huge number of buzz words, huge amount of jargon, and a huge competitive advantage for those who can effectively

More information

Scalable Forensics with TSK and Hadoop. Jon Stewart

Scalable Forensics with TSK and Hadoop. Jon Stewart Scalable Forensics with TSK and Hadoop Jon Stewart CPU Clock Speed Hard Drive Capacity The Problem CPU clock speed stopped doubling Hard drive capacity kept doubling Multicore CPUs to the rescue!...but

More information

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA

More information

Customized Report- Big Data

Customized Report- Big Data GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.

More information

Chapter 1. Contrasting traditional and visual analytics approaches

Chapter 1. Contrasting traditional and visual analytics approaches Chapter 1 Understanding Big Data Analytics In This Chapter Defining Big Data Understanding Big Data Analytics Contrasting traditional and visual analytics approaches The era of Big Data is upon us. The

More information

Why Big Data in the Cloud?

Why Big Data in the Cloud? Have 40 Why Big Data in the Cloud? Colin White, BI Research January 2014 Sponsored by Treasure Data TABLE OF CONTENTS Introduction The Importance of Big Data The Role of Cloud Computing Using Big Data

More information

The big data revolution

The big data revolution The big data revolution Friso van Vollenhoven (Xebia) Enterprise NoSQL Recently, there has been a lot of buzz about the NoSQL movement, a collection of related technologies mostly concerned with storing

More information

How to pick ediscovery software

How to pick ediscovery software How to pick ediscovery software WWW.CSDISCO.COM How to pick ediscovery software Here, from most important to least, are the factors you should consider in picking ediscovery software: 1 SPEED The most

More information

The State of Unassisted Support 2014

The State of Unassisted Support 2014 The State of Unassisted Support 2014 By John Ragsdale, Vice President, Technology and Social Research, TSIA TECHNOLOGY INSIGHT FIELD SERVICES MANAGED SERVICES SUPPORT SERVICES TSIA-TI-14-024 June 26, 2014

More information

Big Data Defined Introducing DataStack 3.0

Big Data Defined Introducing DataStack 3.0 Big Data Big Data Defined Introducing DataStack 3.0 Inside: Executive Summary... 1 Introduction... 2 Emergence of DataStack 3.0... 3 DataStack 1.0 to 2.0... 4 DataStack 2.0 Refined for Large Data & Analytics...

More information

Expert Reference Series of White Papers. Cloud Computing: What It Is and What It Can Do for You

Expert Reference Series of White Papers. Cloud Computing: What It Is and What It Can Do for You Expert Reference Series of White Papers Cloud Computing: What It Is and What It Can Do for You info@globalknowledge.net www.globalknowledge.net Cloud Computing: What It Is and What It Can Do for You Steve

More information

Big Data With Hadoop

Big Data With Hadoop With Saurabh Singh singh.903@osu.edu The Ohio State University February 11, 2016 Overview 1 2 3 Requirements Ecosystem Resilient Distributed Datasets (RDDs) Example Code vs Mapreduce 4 5 Source: [Tutorials

More information

College of Engineering, Technology, and Computer Science

College of Engineering, Technology, and Computer Science College of Engineering, Technology, and Computer Science Design and Implementation of Cloud-based Data Warehousing In partial fulfillment of the requirements for the Degree of Master of Science in Technology

More information

Best practices for managing the data warehouse to support Big Data

Best practices for managing the data warehouse to support Big Data E-Guide Best practices for managing the data warehouse to support Big Data The new challenge for IT and data warehousing teams is how to leverage existing technology investments along with emerging tools

More information

# Not a part of 1Z0-061 or 1Z0-144 Certification test, but very important technology in BIG DATA Analysis

# Not a part of 1Z0-061 or 1Z0-144 Certification test, but very important technology in BIG DATA Analysis Section 9 : Case Study # Objectives of this Session The Motivation For Hadoop What problems exist with traditional large-scale computing systems What requirements an alternative approach should have How

More information

Table of Contents. June 2010

Table of Contents. June 2010 June 2010 From: StatSoft Analytics White Papers To: Internal release Re: Performance comparison of STATISTICA Version 9 on multi-core 64-bit machines with current 64-bit releases of SAS (Version 9.2) and

More information

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof. CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing University of Florida, CISE Department Prof. Daisy Zhe Wang Cloud Computing and Amazon Web Services Cloud Computing Amazon

More information

Transcription. Founder Interview - Panayotis Vryonis Talks About BigStash Cloud Storage. Media Duration: 28:45

Transcription. Founder Interview - Panayotis Vryonis Talks About BigStash Cloud Storage. Media Duration: 28:45 Transcription Founder Interview - Panayotis Vryonis Talks About BigStash Cloud Storage Media Duration: 28:45 Feel free to quote any passage from this interview for your articles. Please reference cloudwards.net

More information

Big Data Success Step 1: Get the Technology Right

Big Data Success Step 1: Get the Technology Right Big Data Success Step 1: Get the Technology Right TOM MATIJEVIC Director, Business Development ANDY MCNALIS Director, Data Management & Integration MetaScale is a subsidiary of Sears Holdings Corporation

More information

Ubuntu: helping drive business insight from Big Data

Ubuntu: helping drive business insight from Big Data WHITE PAPER Ubuntu: helping drive business insight from Big Data February 2012 Copyright Canonical 2012 www.canonical.com Executive introduction For years, web giants such as Facebook, Google and ebay

More information

Speak<geek> Tech Brief. RichRelevance Distributed Computing: creating a scalable, reliable infrastructure

Speak<geek> Tech Brief. RichRelevance Distributed Computing: creating a scalable, reliable infrastructure 3 Speak Tech Brief RichRelevance Distributed Computing: creating a scalable, reliable infrastructure Overview Scaling a large database is not an overnight process, so it s difficult to plan and implement

More information

Integrating Big Data into Business Processes and Enterprise Systems

Integrating Big Data into Business Processes and Enterprise Systems Integrating Big Data into Business Processes and Enterprise Systems THOUGHT LEADERSHIP FROM BMC TO HELP YOU: Understand what Big Data means Effectively implement your company s Big Data strategy Get business

More information

Big Data and Apache Hadoop Adoption:

Big Data and Apache Hadoop Adoption: Expert Reference Series of White Papers Big Data and Apache Hadoop Adoption: Key Challenges and Rewards 1-800-COURSES www.globalknowledge.com Big Data and Apache Hadoop Adoption: Key Challenges and Rewards

More information

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required. What is this course about? This course is an overview of Big Data tools and technologies. It establishes a strong working knowledge of the concepts, techniques, and products associated with Big Data. Attendees

More information

White Paper: Big Data and the hype around IoT

White Paper: Big Data and the hype around IoT 1 White Paper: Big Data and the hype around IoT Author: Alton Harewood 21 Aug 2014 (first published on LinkedIn) If I knew today what I will know tomorrow, how would my life change? For some time the idea

More information

Expert Reference Series of White Papers. Cloud Computing: What It Is and What It Can Do for You

Expert Reference Series of White Papers. Cloud Computing: What It Is and What It Can Do for You Expert Reference Series of White Papers Cloud Computing: What It Is and What It Can Do for You 1-800-COURSES www.globalknowledge.com Cloud Computing: What It Is and What It Can Do for You Steve Baca, Global

More information

Big Data Comes of Age: Shifting to a Real-time Data Platform

Big Data Comes of Age: Shifting to a Real-time Data Platform An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for SAP April 2013 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents Introduction... 1 Drivers of Change...

More information

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution WHITEPAPER A Technical Perspective on the Talena Data Availability Management Solution BIG DATA TECHNOLOGY LANDSCAPE Over the past decade, the emergence of social media, mobile, and cloud technologies

More information

Investor Presentation. Second Quarter 2015

Investor Presentation. Second Quarter 2015 Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

NoSQL for SQL Professionals William McKnight

NoSQL for SQL Professionals William McKnight NoSQL for SQL Professionals William McKnight Session Code BD03 About your Speaker, William McKnight President, McKnight Consulting Group Frequent keynote speaker and trainer internationally Consulted to

More information

Google Lead Generation for Attorneys

Google Lead Generation for Attorneys 1 Google Lead Generation For Attorneys Leverage The Power Of AdWords To Grow Your Law Business FAST You re about to discover the secrets of fast legal practice success with Google AdWords. Google AdWords

More information

Applying Semantics to Unstructured Data (Big and Getting Bigger)

Applying Semantics to Unstructured Data (Big and Getting Bigger) Applying Semantics to Unstructured Data (Big and Getting Bigger) Wednesday, November 30, 2012 4:00 5:00 Bryan Bell Vice President, Enterprise Solutions, Expert System Lynda Moulton, Analyst & Consultant,

More information

White Paper: Hadoop for Intelligence Analysis

White Paper: Hadoop for Intelligence Analysis CTOlabs.com White Paper: Hadoop for Intelligence Analysis July 2011 A White Paper providing context, tips and use cases on the topic of analysis over large quantities of data. Inside: Apache Hadoop and

More information

Big Data. Fast Forward. Putting data to productive use

Big Data. Fast Forward. Putting data to productive use Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize

More information

Conducting a Successful Cloudmarket CIO

Conducting a Successful Cloudmarket CIO As companies emerge from challenging economic times and turn the corner to face tremendous opportunities, the CIO's role is more and more about strategy and optimizing business results. Today s complex

More information

Online Press Releases For The Offline Business

Online Press Releases For The Offline Business Online Press Releases For The Offline Business How To Get Page #1 Google Rankings In 24-72 Hours And Charge $250 To $1,000+ Submitting Online Press Releases For Brick And Mortar Businesses 2014 Andrew

More information

BUYER S GUIDE. The Unified Communications Buyer s Guide to Picking the Right Cloud Telephony Solution

BUYER S GUIDE. The Unified Communications Buyer s Guide to Picking the Right Cloud Telephony Solution BUYER S GUIDE The Unified Communications Buyer s Guide to Picking the Right Cloud Telephony Solution There s no better time for a business to embrace unified communications solutions. Actually, that s

More information

SOCIAL MEDIA ADVERTISING STRATEGIES THAT WORK

SOCIAL MEDIA ADVERTISING STRATEGIES THAT WORK SOCIAL MEDIA ADVERTISING STRATEGIES THAT WORK ABSTRACT» Social media advertising is a new and fast growing part of digital advertising. In this White Paper I'll present social media advertising trends,

More information

SEO Facts. Why do I need Search Engine Optimization (SEO)?... 2. How do you maintain quality and excellence in the delivery of your product?...

SEO Facts. Why do I need Search Engine Optimization (SEO)?... 2. How do you maintain quality and excellence in the delivery of your product?... Why do I need Search Engine Optimization (SEO)?... 2 Why SEO takes time and requires an on-going investment?... 2 Why is there such a price difference between SEO companies?... 4 Any lock-in Contracts!?...

More information

Ten Mistakes to Avoid

Ten Mistakes to Avoid EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects

More information

Big Data and Natural Language: Extracting Insight From Text

Big Data and Natural Language: Extracting Insight From Text An Oracle White Paper October 2012 Big Data and Natural Language: Extracting Insight From Text Table of Contents Executive Overview... 3 Introduction... 3 Oracle Big Data Appliance... 4 Synthesys... 5

More information

Mind Commerce. http://www.marketresearch.com/mind Commerce Publishing v3122/ Publisher Sample

Mind Commerce. http://www.marketresearch.com/mind Commerce Publishing v3122/ Publisher Sample Mind Commerce http://www.marketresearch.com/mind Commerce Publishing v3122/ Publisher Sample Phone: 800.298.5699 (US) or +1.240.747.3093 or +1.240.747.3093 (Int'l) Hours: Monday - Thursday: 5:30am - 6:30pm

More information

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012 Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster Nov 7, 2012 Who I Am Robert Lancaster Solutions Architect, Hotel Supply Team rlancaster@orbitz.com @rob1lancaster Organizer of Chicago

More information

Advanced Big Data Analytics with R and Hadoop

Advanced Big Data Analytics with R and Hadoop REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional

More information

Sunnie Chung. Cleveland State University

Sunnie Chung. Cleveland State University Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:

More information

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved. Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,

More information

How Liferay Is Improving Quality Using Hundreds of Jenkins Servers

How Liferay Is Improving Quality Using Hundreds of Jenkins Servers How Liferay Is Improving Quality Using Hundreds of Jenkins Servers James Min Sr. Consultant, Liferay, Inc. liferay.com Liferay Background Open Source Portal & Collaboration Java Platform 250 employees

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

Big Data Technologies Compared June 2014

Big Data Technologies Compared June 2014 Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development

More information

5 WAYS TO DOUBLE YOUR WEB SITE S SALES IN THE NEXT 12 MONTHS

5 WAYS TO DOUBLE YOUR WEB SITE S SALES IN THE NEXT 12 MONTHS HIGH CONVERSION LAB 5 WAYS TO DOUBLE YOUR WEB SITE S SALES IN THE NEXT 12 MONTHS By Ryan Berg Head Profit Scientist at HighConversionLab.com The Struggle to Build a Thriving Online Business as the Internet

More information

White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014

White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014 White Paper EMC Isilon: A Scalable Storage Platform for Big Data By Nik Rouda, Senior Analyst and Terri McClure, Senior Analyst April 2014 This ESG White Paper was commissioned by EMC Isilon and is distributed

More information

How to Turn the Promise of the Cloud into an Operational Reality

How to Turn the Promise of the Cloud into an Operational Reality TecTakes Value Insight How to Turn the Promise of the Cloud into an Operational Reality By David Talbott The Lure of the Cloud In recent years, there has been a great deal of discussion about cloud computing

More information

Research Note What is Big Data?

Research Note What is Big Data? Research Note What is Big Data? By: Devin Luco Copyright 2012, ASA Institute for Risk & Innovation Keywords: Big Data, Database Management, Data Variety, Data Velocity, Data Volume, Structured Data, Unstructured

More information

Cloud Computing Paradigm

Cloud Computing Paradigm Cloud Computing Paradigm Julio Guijarro Automated Infrastructure Lab HP Labs Bristol, UK 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice

More information

INTRODUCTION TO CASSANDRA

INTRODUCTION TO CASSANDRA INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open

More information

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data Research Report CA Technologies Big Data Infrastructure Management Executive Summary CA Technologies recently exhibited new technology innovations, marking its entry into the Big Data marketplace with

More information

Fast Analytics on Big Data with H20

Fast Analytics on Big Data with H20 Fast Analytics on Big Data with H20 0xdata.com, h2o.ai Tomas Nykodym, Petr Maj Team About H2O and 0xdata H2O is a platform for distributed in memory predictive analytics and machine learning Pure Java,

More information

Outline. What is Big data and where they come from? How we deal with Big data?

Outline. What is Big data and where they come from? How we deal with Big data? What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,

More information

Using Cloud Services for Test Environments A case study of the use of Amazon EC2

Using Cloud Services for Test Environments A case study of the use of Amazon EC2 Using Cloud Services for Test Environments A case study of the use of Amazon EC2 Lee Hawkins (Quality Architect) Quest Software, Melbourne Copyright 2010 Quest Software We are gathered here today to talk

More information

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/

More information

Terms and Conditions

Terms and Conditions - 1 - Terms and Conditions LEGAL NOTICE The Publisher has strived to be as accurate and complete as possible in the creation of this report, notwithstanding the fact that he does not warrant or represent

More information

Distributed Systems. Lec 2: Example use cases: Cloud computing, Big data, Web services

Distributed Systems. Lec 2: Example use cases: Cloud computing, Big data, Web services Distributed Systems Lec 2: Example use cases: Cloud computing, Big data, Web services 1 Example Use Cases Cloud computing (today) What it means and how it began Big data (today) Role of distributed systems

More information

Big Data - Infrastructure Considerations

Big Data - Infrastructure Considerations April 2014, HAPPIEST MINDS TECHNOLOGIES Big Data - Infrastructure Considerations Author Anand Veeramani / Deepak Shivamurthy SHARING. MINDFUL. INTEGRITY. LEARNING. EXCELLENCE. SOCIAL RESPONSIBILITY. Copyright

More information

ANALYTICS BUILT FOR INTERNET OF THINGS

ANALYTICS BUILT FOR INTERNET OF THINGS ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that

More information

Costs of Data Warehousing & Business Intelligence for the Small to Midsize Business

Costs of Data Warehousing & Business Intelligence for the Small to Midsize Business i White Paper Costs of Data Warehousing & Business Intelligence for the Small to Midsize Business By Ted Mountzuris March 6, 2004 ii Introduction Everyone seems to agree that a Business Intelligence (BI)

More information

Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop

Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop Kanchan A. Khedikar Department of Computer Science & Engineering Walchand Institute of Technoloy, Solapur, Maharashtra,

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

CA Big Data Management: It s here, but what can it do for your business?

CA Big Data Management: It s here, but what can it do for your business? CA Big Data Management: It s here, but what can it do for your business? Mike Harer CA Technologies August 7, 2014 Session Number: 16256 Insert Custom Session QR if Desired. Test link: www.share.org Big

More information

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Big Data Analytics DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Tom Haughey InfoModel, LLC 868 Woodfield Road Franklin Lakes, NJ 07417 201 755 3350 tom.haughey@infomodelusa.com

More information

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved. Object Storage: A Growing Opportunity for Service Providers Prepared for: White Paper 2012 Neovise, LLC. All Rights Reserved. Introduction For service providers, the rise of cloud computing is both a threat

More information