DATAOPT SOLUTIONS. What Is Big Data?

Similar documents
How To Handle Big Data With A Data Scientist

Powerful analytics. and enterprise security. in a single platform. microstrategy.com 1

BIG DATA: BIG CHALLENGE FOR SOFTWARE TESTERS

There s no way around it: learning about Big Data means

Data Refinery with Big Data Aspects

MicroStrategy Analytics Platform

Understanding the Value of In-Memory in the IT Landscape

Big Data Integration: A Buyer's Guide

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau

Navigating Big Data business analytics

How To Make Data Streaming A Real Time Intelligence

!!!!! BIG DATA IN A DAY!

Big Data and Transactional Databases Exploding Data Volume is Creating New Stresses on Traditional Transactional Databases

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

We are Big Data A Sonian Whitepaper

Big Data for the Rest of Us Technical White Paper

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Microsoft Big Data. Solution Brief

Il mondo dei DB Cambia : Tecnologie e opportunita`

Best Practices for Hadoop Data Analysis with Tableau

NoSQL for SQL Professionals William McKnight

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Using Tableau Software with Hortonworks Data Platform

NEWLY EMERGING BEST PRACTICES FOR BIG DATA

BIG DATA TRENDS AND TECHNOLOGIES

BIG Data. An Introductory Overview. IT & Business Management Solutions

How to Enhance Traditional BI Architecture to Leverage Big Data

Simple. Extensible. Open.

Hadoop Big Data for Processing Data and Performing Workload

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

How To Use Big Data Effectively

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Focus on the business, not the business of data warehousing!

Tap into Big Data at the Speed of Business

SQL Server 2012 Parallel Data Warehouse. Solution Brief

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

W H I T E P A P E R. Building your Big Data analytics strategy: Block-by-Block! Abstract

Big Data Processing: Past, Present and Future

How To Learn To Use Big Data

Harnessing the Value of Big Data Analytics

What happens when Big Data and Master Data come together?

Big Data on Microsoft Platform

NZ BI User Group Auckland 18 September, Big Data Analytics with PowerPivot and Power View

BIG Data Analytics Move to Competitive Advantage

How To Use Hp Vertica Ondemand

Tap into Hadoop and Other No SQL Sources

Oracle Cloud: Line of Business PaaS Services. Balaji Yelamanchili Senior Vice President Product Development

Hadoop implementation of MapReduce computational model. Ján Vaňo

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Taming Big Data. 1010data ACCELERATES INSIGHT

INTRODUCTION TO CASSANDRA

SQL Server 2012 Business Intelligence Boot Camp

How SafeVelocity Improves Network Transfer of Files

Reimagining Business with SAP HANA Cloud Platform for the Internet of Things

WHITE PAPER. Get Ready for Big Data:

Modernizing Your Data Warehouse for Hadoop

CitusDB Architecture for Real-Time Big Data

More Data in Less Time

Microsoft SQL Server 2012 with Hadoop

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

So What s the Big Deal?

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

Scalable Enterprise Data Integration Your business agility depends on how fast you can access your complex data

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

HOW THE DATA LAKE WORKS

IBM Software Integrating and governing big data

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

5 Signs You Might Be Outgrowing Your MySQL Data Warehouse*

Cloud Computing and Advanced Relationship Analytics

SQLSaturday #399 Sacramento 25 July, Big Data Analytics with Excel

Data Discovery, Analytics, and the Enterprise Data Hub

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

BIG DATA THE NEW OPPORTUNITY

Information Architecture

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Driving Better Marketing Results with Big Data and Analytics David Corrigan, IBM, Director of Product Marketing

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

A Comprehensive Review of Self-Service Data Visualization in MicroStrategy. Vijay Anand January 28, 2014

Evolution to Revolution: Big Data 2.0

End Small Thinking about Big Data

Report Data Management in the Cloud: Limitations and Opportunities

A technical paper for Microsoft Dynamics AX users

Azure Data Lake Analytics

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

Leveraging Machine Data to Deliver New Insights for Business Analytics

Transcription:

DATAOPT SOLUTIONS What Is Big Data?

WHAT IS BIG DATA? It s more than just large amounts of data, though that s definitely one component. The more interesting dimension is about the types of data. So Big Data is increasingly about more complex structures and how you go about capturing and analyzing transaction and interaction data. We also see a lot of different data sources such as image, text, voice, machine data and so on, whose structure can change depending on the analysis. Big Data is also about big analytics. It s about applying more sophisticated algorithms to get a deeper insight out of the data without compromising on the scale. At the end of the day, the most important thing about the data is what you can do with it. Unlike with other techniques such as BI, Big Data allows people to tap into much larger amounts of data on the fly. It s about using the data to discover new insights.

BUSINESS REQUIREMENT ON BIG DATA Because of the business requirement of analyzing vast amount of ever changing structured and unstructured Big Data almost instantaneously, companies will be hard pressed to do this on their own. But given the fact that Big Data stored in cloud can be accessed from anywhere the internet is available and can be analysed almost instantaneously by third party service providers, outsourcing companies can offer to their clients value added services in the area of Big Data analytics without heavy investments on the part of clients in specialized hardware and software as was the case with traditional data analytics. This will bring down significantly costs (especially fixed costs) associated with building and maintaining analytics infrastructure and solution center. I expect all major IT Services and Consulting companies to invest heavily in building delivery capability in the area of Big Data/Analytics to tap into this opportunity.

WHAT MAKES IT BIG DATA? Volume: The amount of data generated by companies and their customers, competitors, and partners continues to grow exponentially Velocity: Data continues changing at an increasing rate of speed, making it difficult for companies to capture and analyze Variety: It s no longer enough to collect just transactional data Analysts are increasingly interested in new data types. These data types add richness that supports more detailed analyses Complexity: With more details and sources, the data is more complex and difficult to analyze

CHARACTERISTICS OF BIG DATA

AN APPETITE OF DATA

BIG DATA SOURCES

NEW DATA CATEGORIES How does your enterprise s data suddenly balloon from gigabytes to hundreds of terabytes and then on to petabytes? One way is that you start working with entirely new classes of information. While much of this new information is relational in nature, much is not. In the past, most relational databases held records of complete, finalized transactions. In the world of Big Data, sub-transactional data plays a big part, too, and here are a few examples: Click trails through a website Shopping cart manipulation Tweets Feedback & Comments Text messages

WHY NOT REPLACE ANALYTICAL RELATIONAL DATABASES WITH HADOOP? Analytical relational databases were created for rapid access to large data sets by many concurrent users. Typical analytical databases support SQL and connectivity to a large ecosystem of analysis tools. They efficiently combine complex data sets, automate data partitioning and index techniques, and provide complex analytics on structured data. They also offer security, workload management, and service-level guarantees on top of a relational store. Thus, the database abstracts the user from the mundane tasks of partitioning data and optimizing query performance. Since Hadoop is founded on a distributed file system and not a relational database, it removes the requirement of data schema. Unfortunately, Hadoop also eliminates the benefits of an analytical relational database, such as interactive data access and a broad ecosystem of SQL compatible tools. Integrating the best parts of Hadoop with the benefits of analytical relational databases is the optimum solution for a big data analytics architecture.

WHAT YOU CAN DO WITH BIG DATA? Big Data has the potential to revolutionize the way you do business. It can provide new insights into everything about your enterprise, including the following: The way your customers locate and interact with you The way you deliver products and services to the marketplace The position of organization vs. your competitors Strategies you can implement to increase profitability And many more

THE CHALLENGES OF CONVERTING BIG DATA Addressing the multiple challenges posed by big data volumes is not easy. Unlike transactional data, which can be stored in a stable schema that changes infrequently, interactional data types are more dynamic. They require an evolving schema, which is defined dynamically often on-the-fly at query runtime. The ability to load data quickly, and evolve the schema over time if needed, is a tremendous advantage for analysts who want to reduce time to valuable insights. Some data formats may not fit well into a schema without heavy pre-processing or may have requirements for loading and storing in their native format. Dealing with this variety of data types efficiently can be difficult. As a result, many organizations simply delete this data or never bother to capture it at all.