Generating the Business Value of Big Data:
|
|
|
- Bruno Lane
- 10 years ago
- Views:
Transcription
1 Leveraging People, Processes, and Technology Generating the Business Value of Big Data: Analyzing Data to Make Better Decisions Authors: Rajesh Ramasubramanian, MBA, PMP, Program Manager, Catapult Technology Roberto Berezdivin, Ph.D. Systems Architect, Catapult Technology 11 Canal Center Plaza, Floor 2 Alexandria, VA
2 Introduction Big Data refers to large data sets whose size and disparity makes it difficult, if not impossible, for relational database software tools to capture, store, manage, and analyze the data. Relational databases, typical of structured data, cannot handle the scale and agility challenges that face modern applications, nor were they built to take advantage of the relatively inexpensive, cloud-based storage and processing power that is now available. The platforms, tools, and software available to store, process, and analyze the large datasets of unstructured data prevalent today are collectively known as Big Data technologies. As more and more companies incorporate efficient and scalable technology, data management and data storage is no longer the issue. Organizations generate constant data, through the use of the Internet, mobile applications, social media, internal documents, content and automated processes employed by the organization. The solutions available to the big Internet players Opportunity: 80-85% of global to process and analyze this voluminous data are data is unstructured. publicly available by open-source software communities. Meanwhile, the advent of cloud-based solutions has dramatically lowered the cost of storage and processing. Virtual file systems, either open source or vendor-specific, has helped transition from a managed infrastructure to a service-based approach. In addition, innovative designs for database management and cost-effective ways to support massively parallel processing have led to new products like nosql databases and the Apache Hadoop MapReduce platform. NoSQL was developed specifically to respond to the massive data of today, and improve upon the shortcomings of relational databases. Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on commodity hardware. According to a recent study, percent of global data that exists is unstructured, meaning that it has no pre-defined data model or is not organized in a pre-defined manner. It can come from such disparate sources as social media platforms (e.g., Facebook, Twitter); ; online purchases; online profiles; content management system footprint; and photos. Page 2
3 The large Internet players are already discovering great value in their data by identifying new customers, improving their products and service offerings, expanding their markets, and increasing profitability. The real questions for business now are: How do you put all this captured and stored data to valuable use? How do you analyze it to make better business decisions? The 3 V s The 3 V s that define the Big Data are: 1. Volume Currently there is exponential growth in data storage, as data is not just textual but comes in the form of videos, music, images, clickstream and blog content, often through social media channels. It has been recently projected that every individual is predicted to generate over 20 petabytes of data over the course of his/her lifetime. A recent projection by Paypal cites that every individual is predicted to generate over 20 petabytes of data over the course of his/her lifetime. (For context, a terabyte is bytes of digital information; a petabyte is bytes of digital information.) According to International Data Corporation (IDC), the digital universe will grow to 35 zettabytes (i.e. 35 trillion terabytes) globally by The point is, data is exploding. The response to this data boom, as well as the ubiquity of the cloud, will be a significant decrease in a your IT capital expenditure, as many organizations invest in data virtualization. At the same time, there will be an increase in operating expenditure as organizations move towards the use and exploitation of that data using cloud-based storage and processing solutions. 2. Velocity The explosion of data is happening almost in real time, as people turn to social media for updates about what is occurring in the world around them. No one waits for news anymore; the speed with which we are informed has literally become fractions of seconds. An interesting example was the earthquake of 2011 southwest of Washington D.C.; the first news of it arrived via Twitter minutes before the tremor was felt. Page 3
4 As more and more data is produced, it must be collected in shorter timeframes. Therefore, organizations require tools and platforms for real-time processing of data in order to achieve, and maintain, a competitive advantage in the marketplace. 3. Variety In the real world, data comes in different formats, from structured, data typically data contained in relational databases and spreadsheets with specific classification to unstructured data, which can t be as neatly classified (e.g., videos, images, SMS, social media content, PDFs, etc.) Veracity (Value) The accuracy, truthfulness, and quality of data are the most important aspect that fuels new insights into your organization and provides high value. The data that organizations collect is all about supporting the decisions that can have a major impact on the organization as a whole. Businesses are going to want as much quality information as possible to support the business case. Establishing trust in Big Data solutions probably presents the biggest challenge; but once overcome, it will introduce a solid foundation for successful decision-making within your organization. There is more data than ever from which business decisions can be made. According to a study done by Avanade, Inc., 46 percent of companies report they have made an inaccurate business decision as a result of bad or outdated data. In many cases, useful and necessary data to make business decisions are not collected and well-meaning managers end up guessing. It is therefore critical for organizations to address this issue and position itself to react quickly to fast-changing business conditions. For example, a user posts something like, I am interested in buying a new smart phone for my wife on her birthday on social media. A smart phone manufacturer s data engineers who analyze this unstructured data can infer information about the shopper s interest, such as: 1. He is married; 2. He is looking for a smart phone; and 3. The phone will be used by his wife. Page 4
5 In addition, if he is a previous or current customer, the phone manufacturer can pull his profile and better target the individual with various options compared to competitors. Harnessing this kind of unstructured data will help increase the the phone manufacturer s sales and revenue and target the customer with better products. Imagine that this kind of information is posted by users in various social media. The volume of information that is available for organizations to analyze and better target their customers help companies increase 46% of organizations report their market reach. While organizations can acquire a negative online reputation, data can be leveraged as a corrective. For example, a passenger is traveling from one city to another by bus. The bus breaks down on the way to its destination. The passenger takes pictures of the incapacitated bus and tweets those images with complaints about the bus breaking down. Smart data mining from the bus line s data analysis team could provide this information to their customer service department in the form of alerts. Customer service can then return a tweet that apologizes for the inconvenience, ensures fast repair, and promises better services By offering a free ticket back from the trip s original destination, or some other accommodation, the bus line can also rebuild goodwill and fortify its customer retention strategy. The bus line s Big Data solution has mined unstructured data to return an actionable solution. Currently, the challenge that businesses face is to transform raw data into meaningful information and provide actionable insights for better business decision-making. Basically, organizations that mine their data warehouses, transactional systems, and the social media footprints of their customers can benefit by discovering the preferences of their customers. They can establish a meaningful relationship between customer segments and product segments with a higher degree of correlation. they have made an inaccurate business decision as a result of bad or outdated data. Page 5
6 The diagram below encompasses Big Data Management, the technology used, and the benefits for an enterprise: Technology Implementation: A Case Study A Catapult customer implemented a new web portal and wanted to answer basic questions, such as How many people visited the portal? and On average, how much time did people spend on the portal? Catapult leveraged Apache Hadoop, the open-source platform that is applied to Big Data. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage (Apache Hadoop ). In order to answer the customer s business questions, Catapult leveraged web server access logs and an http requests log. Catapult used the Hadoop Distributed File System (HDFS), which breaks down data into smaller pieces for easier processing, and wrote a MapReduce program to identify the unique values based on the IP address. (MapReduce basically breaks down individual data elements, thus reducing the size of a data set. The reduce job takes the output from a map as input and combines those data segments into a still smaller data set.) The session ID plays a critical role in the mining of the web logs. This session information provided the vital information of how long visitors spent on the portal. The MapReduce was used to compute this in fully distributed modes of the cluster. Page 6
7 Finally, the parsed log files were stored as text file in HDFS. This parsed log file was loaded to a Hive data warehouse. (Built on top of Hadoop, Hive provides data summarization, query, and analysis). By writing nosql queries, Catapult got answers to the above business questions. With this information, the customer targeted the user base with appropriate application design and better user experience, which led to more quality information. Security and Privacy both a technical and sociologi- The privacy of data is another huge concern, and cal issue; a solution should be one that increases in the context of Big Data. Organizations should understand that managing data tives. addressed from both perspec- privacy is both a technical and sociological issue and a solution should be addressed from both perspectives. As enterprises become more and more dependent on data to drive business decisions (whether the data is available publicly or through internal collection processes), they face the risk of inaccurate, incomplete, and fraudulently manipulated data. In order to avoid these risks, organizations need to verify and validate all the data sources from which they analyze and use tools and processes to check for vulnerabilities. Enterprises should have a proper Big Data governance process in order to avoid misleading data and additional unexpected costs associated with it. Implementing adequate controls through the governance process ensures that the information that businesses depend on is accurate, consistent, and good quality. In addition, data governance must be measured at three distinct levels: 1. At the program level, at which the organization identifies and highlights the qualitative level and the impact the data governance process delivers; 2. At the operational level, at which the organization monitors on how data is behaving against the companies set policy and baseline; 3. And at the quantitative level, at which the organization measures the effectiveness and efficiency of data management results, assessing quantitative business values like revenue growth, cost savings, risk reduction, internal processes, and customer retention. For example, as part of a data analysis contract with Department of Transportation (DOT) Pipeline and Hazardous Materials Safety Administration (PHMSA), Catapult provided data management activities Managing data privacy is Page 7
8 aligned to the agency s data management policy. The policy identifies roles and responsibilities for data owners, stewards, and managers, as well as rulemaking impacts on collected data set and data management procedures. As part of the agency s data governance effort, Catapult contractors developed comprehensive data policies, standards, and procedures and monitored and enforced conformance with those data policies, standards, and architecture. In addition, Catapult contractors manage and resolve data related issues and communicate and promote the value of data assets within the agency. Conclusion Through Big Data analytics, the potential has never been greater to optimize business processes, to drive product and service innovation, and to enable enterprise controls. By leveraging Big Data analytics, Catapult Technology can help your organization: Measure the incremental cost of managing and analyzing unstructured data sets against the incremental benefits gained over and above what can be achieved using structured data sets. Develop a data culture in which the management, employees, and strategic partners are active participants in managing a meaningful data lifecycle. Harness new sources of information and take responsibility over accurate data creation, dissemination, data governance, quality and maintenance Enable businesses to turn data from information into actionable insights. Catapult s Big Data consultants are adept at: Collecting, cleaning, and integrating unstructured data from multiple sources, while creating a road map that helps organizations realize their business value by deriving greater insights from their data. Developing a migration strategy, creating prototypes, and engaging in full-fledged deployment of Big Data solutions. Page 8
9 Accommodating privacy, security, and data governance aspects of Big Data. Translating Big Data analytical findings into appropriate risk management and marketing strategies that drive business value. Hadoop/HDFS, MapReduce, HBase, Pig, NoSQL data stores (Cassandra, MongoDB). New businesses are emerging based on harvesting Big Data and by combining data and analytics services. Disruptive change is being implemented across industries both horizontally and vertically. Contact Catapult Technology so we can help your organization take advantage of Big Data technologies and build a culture that infuses analytics everywhere! Call or [email protected] References: McKinsey Global Institute: Big data: The next frontier for innovation, competition, and productivity The White House Big Data and Privacy Review Report: default/files/docs/big_data_privacy_report_5.1.14_final_print.pdf Daniel Austin, Principal Architect at PayPal: Apache Hadoop : Avanade Inc., Global Survey: The Business Impact of Big Data, 2010: com/en-us/approach/research/pages/big-data.aspx# (International Data Corporation) Page 9
10 11 Canal Center Plaza, Floor 2 Alexandria, VA [email protected] 09/02/14 QP
The Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
Complex, true real-time analytics on massive, changing datasets.
Complex, true real-time analytics on massive, changing datasets. A NoSQL, all in-memory enabling platform technology from: Better Questions Come Before Better Answers FinchDB is a NoSQL, all in-memory
Big Data Analytics for Space Exploration, Entrepreneurship and Policy Opportunities. Tiffani Crawford, PhD
Big Analytics for Space Exploration, Entrepreneurship and Policy Opportunities Tiffani Crawford, PhD Big Analytics Characteristics Large quantities of many data types Structured Unstructured Human Machine
Ubuntu and Hadoop: the perfect match
WHITE PAPER Ubuntu and Hadoop: the perfect match February 2012 Copyright Canonical 2012 www.canonical.com Executive introduction In many fields of IT, there are always stand-out technologies. This is definitely
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics
BIG DATA & ANALYTICS Transforming the business and driving revenue through big data and analytics Collection, storage and extraction of business value from data generated from a variety of sources are
There s no way around it: learning about Big Data means
In This Chapter Chapter 1 Introducing Big Data Beginning with Big Data Meeting MapReduce Saying hello to Hadoop Making connections between Big Data, MapReduce, and Hadoop There s no way around it: learning
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment
Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014
Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014 Defining Big Not Just Massive Data Big data refers to data sets whose size is beyond the ability of typical database software tools
Hadoop. http://hadoop.apache.org/ Sunday, November 25, 12
Hadoop http://hadoop.apache.org/ What Is Apache Hadoop? The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using
W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Transforming the Telecoms Business using Big Data and Analytics
Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe
Big Data. Fast Forward. Putting data to productive use
Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize
Data Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
BIG DATA TECHNOLOGY. Hadoop Ecosystem
BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big
Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012
Big Data Buzzwords From A to Z By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords Big data is one of the, well, biggest trends in IT today, and it has spawned a whole new generation
Big Data on Microsoft Platform
Big Data on Microsoft Platform Prepared by GJ Srinivas Corporate TEG - Microsoft Page 1 Contents 1. What is Big Data?...3 2. Characteristics of Big Data...3 3. Enter Hadoop...3 4. Microsoft Big Data Solutions...4
How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6
Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...
You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.
What is this course about? This course is an overview of Big Data tools and technologies. It establishes a strong working knowledge of the concepts, techniques, and products associated with Big Data. Attendees
International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: [email protected]
Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce
Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, [email protected] Assistant Professor, Information
Apache Hadoop: The Big Data Refinery
Architecting the Future of Big Data Whitepaper Apache Hadoop: The Big Data Refinery Introduction Big data has become an extremely popular term, due to the well-documented explosion in the amount of data
White Paper: Datameer s User-Focused Big Data Solutions
CTOlabs.com White Paper: Datameer s User-Focused Big Data Solutions May 2012 A White Paper providing context and guidance you can use Inside: Overview of the Big Data Framework Datameer s Approach Consideration
Big Data. White Paper. Big Data Executive Overview WP-BD-10312014-01. Jafar Shunnar & Dan Raver. Page 1 Last Updated 11-10-2014
White Paper Big Data Executive Overview WP-BD-10312014-01 By Jafar Shunnar & Dan Raver Page 1 Last Updated 11-10-2014 Table of Contents Section 01 Big Data Facts Page 3-4 Section 02 What is Big Data? Page
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop Kanchan A. Khedikar Department of Computer Science & Engineering Walchand Institute of Technoloy, Solapur, Maharashtra,
Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA
Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data
Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
Open source Google-style large scale data analysis with Hadoop
Open source Google-style large scale data analysis with Hadoop Ioannis Konstantinou Email: [email protected] Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory School of Electrical
Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges
Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges Prerita Gupta Research Scholar, DAV College, Chandigarh Dr. Harmunish Taneja Department of Computer Science and
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
Big Data and Hadoop for the Executive A Reference Guide
Big Data and Hadoop for the Executive A Reference Guide Overview The amount of information being collected by companies today is incredible. Wal- Mart has 460 terabytes of data, which, according to the
The 3 questions to ask yourself about BIG DATA
The 3 questions to ask yourself about BIG DATA Do you have a big data problem? Companies looking to tackle big data problems are embarking on a journey that is full of hype, buzz, confusion, and misinformation.
Bringing Together ESB and Big Data
Bringing Together ESB and Big Data Bringing Together ESB and Big Data Table of Contents Why ESB and Big Data?...3 Exploring the Promise of Big Data and ESB... 4 Moving Forward With ESB and Big Data...5
BIG DATA CHALLENGES AND PERSPECTIVES
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
Data Mining in the Swamp
WHITE PAPER Page 1 of 8 Data Mining in the Swamp Taming Unruly Data with Cloud Computing By John Brothers Business Intelligence is all about making better decisions from the data you have. However, all
BIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
How To Scale Out Of A Nosql Database
Firebird meets NoSQL (Apache HBase) Case Study Firebird Conference 2011 Luxembourg 25.11.2011 26.11.2011 Thomas Steinmaurer DI +43 7236 3343 896 [email protected] www.scch.at Michael Zwick DI
Testing Big data is one of the biggest
Infosys Labs Briefings VOL 11 NO 1 2013 Big Data: Testing Approach to Overcome Quality Challenges By Mahesh Gudipati, Shanthi Rao, Naju D. Mohan and Naveen Kumar Gajja Validate data quality by employing
Big Data: Beyond the Hype
Big Data: Beyond the Hype Why Big Data Matters to You WHITE PAPER Big Data: Beyond the Hype Why Big Data Matters to You By DataStax Corporation October 2011 Table of Contents Introduction...4 Big Data
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08
A Brief Outline on Bigdata Hadoop
A Brief Outline on Bigdata Hadoop Twinkle Gupta 1, Shruti Dixit 2 RGPV, Department of Computer Science and Engineering, Acropolis Institute of Technology and Research, Indore, India Abstract- Bigdata is
Datenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg
The Big Picture on Big Data Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg Objective of Talk 1. Deliver a Primer on Big Data. 2. How does this emerging topic apply to Quality? 3.
Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect
on AWS Services Overview Bernie Nallamotu Principle Solutions Architect \ So what is it? When your data sets become so large that you have to start innovating around how to collect, store, organize, analyze
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Search and Real-Time Analytics on Big Data
Search and Real-Time Analytics on Big Data Sewook Wee, Ryan Tabora, Jason Rutherglen Accenture & Think Big Analytics Strata New York October, 2012 Big Data: data becomes your core asset. It realizes its
We are Big Data A Sonian Whitepaper
EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed
ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
BIG DATA TOOLS. Top 10 open source technologies for Big Data
BIG DATA TOOLS Top 10 open source technologies for Big Data We are in an ever expanding marketplace!!! With shorter product lifecycles, evolving customer behavior and an economy that travels at the speed
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
Cloudera Enterprise Data Hub in Telecom:
Cloudera Enterprise Data Hub in Telecom: Three Customer Case Studies Version: 103 Table of Contents Introduction 3 Cloudera Enterprise Data Hub for Telcos 4 Cloudera Enterprise Data Hub in Telecom: Customer
ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V
ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK OVERVIEW ON BIG DATA SYSTEMATIC TOOLS MR. SACHIN D. CHAVHAN 1, PROF. S. A. BHURA
Microsoft Big Data. Solution Brief
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
Comprehensive Analytics on the Hortonworks Data Platform
Comprehensive Analytics on the Hortonworks Data Platform We do Hadoop. Page 1 Page 2 Back to 2005 Page 3 Vertical Scaling Page 4 Vertical Scaling Page 5 Vertical Scaling Page 6 Horizontal Scaling Page
Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.
Beyond Web Application Log Analysis using Apache TM Hadoop A Whitepaper by Orzota, Inc. 1 Web Applications As more and more software moves to a Software as a Service (SaaS) model, the web application has
INTRODUCTION TO CASSANDRA
INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open
Blazent IT Data Intelligence Technology:
Blazent IT Data Intelligence Technology: From Disparate Data Sources to Tangible Business Value White Paper The phrase garbage in, garbage out (GIGO) has been used by computer scientists since the earliest
Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop
Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Transitioning
International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology
International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 BIG DATA: A New Technology Farah DeebaHasan Student, M.Tech.(IT) Anshul Kumar Sharma Student, M.Tech.(IT)
Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:
Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
BIG DATA FUNDAMENTALS
BIG DATA FUNDAMENTALS Timeframe Minimum of 30 hours Use the concepts of volume, velocity, variety, veracity and value to define big data Learning outcomes Critically evaluate the need for big data management
White Paper: Hadoop for Intelligence Analysis
CTOlabs.com White Paper: Hadoop for Intelligence Analysis July 2011 A White Paper providing context, tips and use cases on the topic of analysis over large quantities of data. Inside: Apache Hadoop and
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization
So What s the Big Deal?
So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data
Integrating a Big Data Platform into Government:
Integrating a Big Data Platform into Government: Drive Better Decisions for Policy and Program Outcomes John Haddad, Senior Director Product Marketing, Informatica Digital Government Institute s Government
Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP
Big-Data and Hadoop Developer Training with Oracle WDP What is this course about? Big Data is a collection of large and complex data sets that cannot be processed using regular database management tools
Chapter 6. Foundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA
Big Data: What You Should Know Mark Child Research Manager - Software IDC CEMA Agenda Market Dynamics Defining Big Data Technology Trends Information and Intelligence Market Realities Future Applications
An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics
An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,
BIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
Auto-Classification for Document Archiving and Records Declaration
Auto-Classification for Document Archiving and Records Declaration Josemina Magdalen, Architect, IBM November 15, 2013 Agenda IBM / ECM/ Content Classification for Document Archiving and Records Management
Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics
In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning
End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ
End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,
EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst
White Paper EMC s Enterprise Hadoop Solution Isilon Scale-out NAS and Greenplum HD By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst February 2012 This ESG White Paper was commissioned
Big Data Integration: A Buyer's Guide
SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology
International Journal of Innovative Research in Computer and Communication Engineering
FP Tree Algorithm and Approaches in Big Data T.Rathika 1, J.Senthil Murugan 2 Assistant Professor, Department of CSE, SRM University, Ramapuram Campus, Chennai, Tamil Nadu,India 1 Assistant Professor,
Integrated Social and Enterprise Data = Enhanced Analytics
ORACLE WHITE PAPER, DECEMBER 2013 THE VALUE OF SOCIAL DATA Integrated Social and Enterprise Data = Enhanced Analytics #SocData CONTENTS Executive Summary 3 The Value of Enterprise-Specific Social Data
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK BIG DATA HOLDS BIG PROMISE FOR SECURITY NEHA S. PAWAR, PROF. S. P. AKARTE Computer
Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: [email protected];
Microsoft Big Data Solutions Anar Taghiyev P-TSP E-mail: [email protected]; Why/What is Big Data and Why Microsoft? Options of storage and big data processing in Microsoft Azure. Real Impact of Big
Trustworthiness of Big Data
Trustworthiness of Big Data International Journal of Computer Applications (0975 8887) Akhil Mittal Technical Test Lead Infosys Limited ABSTRACT Big data refers to large datasets that are challenging to
Big Data, Big Traffic. And the WAN
Big Data, Big Traffic And the WAN Internet Research Group January, 2012 About The Internet Research Group www.irg-intl.com The Internet Research Group (IRG) provides market research and market strategy
Unlock the business value of enterprise data with in-database analytics
Unlock the business value of enterprise data with in-database analytics Achieve better business results through faster, more accurate decisions White Paper Table of Contents Executive summary...1 How can
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.
Predicting & Preventing Banking Customer Churn by Unlocking Big Data
Predicting & Preventing Banking Customer Churn by Unlocking Big Data Making Sense of Big Data http://www.ngdata.com Predicting & Preventing Banking Customer Churn by Unlocking Big Data 1 Predicting & Preventing
Big Data & Tourism. Rajendra Akerkar
Big Data & Tourism Rajendra Akerkar Technomathematics Research Foundation TMRF Report 11 2012 Big Data & Tourism To promote innovation and increase efficiency in the Tourism sector TMRF-report-11-2012
Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料
Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 美 國 13 歲 學 生 用 Big Data 找 出 霸 淩 熱 點 Puri 架 設 網 站 Bullyvention, 藉 由 分 析 Twitter 上 找 出 提 到 跟 霸 凌 相 關 的 詞, 搭 配 地 理 位 置
Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved
Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment
How To Use Hadoop For Gis
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Big Data: Using ArcGIS with Apache Hadoop David Kaiser Erik Hoel Offering 1330 Esri UC2013. Technical Workshop.
Coverity Scan. Big Data Spotlight
Coverity Scan Big Data Spotlight Coverity Scan Service The Coverity Scan service began as the largest public-private sector research project in the world focused on open source software quality and security.
Hexaware E-book on Predictive Analytics
Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,
Indian Journal of Science The International Journal for Science ISSN 2319 7730 EISSN 2319 7749 2016 Discovery Publication. All Rights Reserved
Indian Journal of Science The International Journal for Science ISSN 2319 7730 EISSN 2319 7749 2016 Discovery Publication. All Rights Reserved Perspective Big Data Framework for Healthcare using Hadoop
Annex: Concept Note. Big Data for Policy, Development and Official Statistics New York, 22 February 2013
Annex: Concept Note Friday Seminar on Emerging Issues Big Data for Policy, Development and Official Statistics New York, 22 February 2013 How is Big Data different from just very large databases? 1 Traditionally,
