From Big Data to Actionable Insight
|
|
|
- Lawrence Stone
- 10 years ago
- Views:
Transcription
1 From Big Data to Actionable Insight Bob Palmer Senior Director, SAP National Security Services (SAP NS2 ) Dan Dorchinsky Client Director, SAP National Security Services (SAP NS2 ) August About SAP National Security Services (SAP NS2 ) SAP National Security Services (SAP NS2) offers a full suite of enterprise applications, analytics, database, cloud, and mobility software solutions from SAP and Sybase with specialized levels of security and support to meet the unique mission requirements of US national security and critical infrastructure customers. SAP National Security Services and SAP NS2 are trademarks owned by SAP Government Support and Services (SAP GSS). For more information, visit Big Data has become a hot topic as information technology (IT) leaders in business and government struggle with how best to leverage the rising flood of data coming from myriad sources. The explosion of information is a double-edged sword. On one hand, the data can reveal new insights that would have previously remained hidden. On the other hand, the quantity of data brings challenges in capturing, storing, sharing, searching, and analyzing that data. The term Big Data has come to describe data sets that are so large and complex that they are too cumbersome to manage using traditional tools or processes. In this paper, we propose that national security IT customers tackle Big Data with a hybrid solution of open-source and commercial technologies. The hybrid solution reduces the operating costs of managing and storing Big Data while providing agility and real-time speed for high-performance analytics all of which helps mission leaders and war fighters be more effective in carrying out their missions.
2 What is Big Data? In today s world, we have access to unimaginably large volumes of information from a growing number of data sources. While the exponentially expanding stream of information has made it possible to accomplish more and to address problems differently for example, spotting new business trends or drawing conclusions on theories that could never before be tested the sheer volume quickly becomes overwhelming. (See Figure 1.) Data sources are increasing daily: There are 5.9 billion mobile phone subscriptions worldwide, equivalent to 87 percent of the world s population. 1 Wal-Mart handles more than 1 million customer transactions every hour. 2 More than 2 billion people are accessing the Internet on a regular basis, creating data with every click. 3 The world's effective capacity to exchange information through telecommunication networks was 281 petabytes in 1986, 471 petabytes in 1993, 2.2 exabytes in 2000, and 65 exabytes in It is predicted that the amount of traffic flowing over the Internet will reach 667 exabytes annually by 2013, and the world s stores of digital information will increase tenfold every five years. 4 1 International Telecommunication Union, ICT Facts and Figures. ITU Telecomm World (2011), 2, 2, Data, data everywhere. The Economist, 25 February 2010, 3 International Telecommunication Union, ICT Facts and Figures. ITU Telecomm World (2011), 1, 4 Martin Hilbert and Priscila Lopez, The World s Technological Capacity to Store, Communicate, and Compute Information, Science 332, no. 6025, (1 April 2011): SAP National Security Services SAP NS2
3 The explosion of information is a double-edged sword. On one hand, the data can reveal new insights that would have previously remained hidden. On the other hand, the quantity of data brings challenges in capturing, storing, sharing, searching, and analyzing the data. Thus, the term Big Data has become an industry catchphrase to describe data sets that are so large and complex that they are too cumbersome to manage using traditional tools or processes. Before discussing specific technologies for harnessing Big Data, it is important to understand there is no single solution or universal approach to Big Data. The platforms, specific functionalities and tools chosen need to be based on the requirements of the mission. The platform will likely include a combination of open-source and commercial-grade enterprise software to create an end-to-end solution with data on one end and analytics on the other. This paper provides a notional architecture of this end-to-end platform and highlight how the Department of Defense (DoD) can use a combination of solutions to maximize the value of its data. DoD Problem Statement Addressing large volumes of data is not new, per se. However, there are key differences in addressing today s Big Data conundrum than in the past: (1) The recent confluence of cloud, social media, and mobile computing trends has reshaped the approach to Big Data. The ability to collect and analyze the staggering volume of data these media generate has driven organizations in many different industries to seek cost-effective solutions SAP National Security Services SAP NS2 3
4 (2) Data consumers (e.g., constituents, service members, and stakeholders) expect that data is served up instantaneously. In other words, they assume there is little to no data latency. (3) Data consumers expect that applications running against the data have the ability to proactively identify trends. And they expect the data to allow for self-service discovery in a user-friendly, visually appealing manner. Like the business and science communities, the Pentagon faces a similarly huge Big Data challenge, especially when it comes to unstructured data that does not fit well into a relational table and may include text, sensor data, images, or other elements. According to recent news reports, the DoD has invested billions of dollars in new electronic systems that gather and store vast quantities of imagery and other data from the battlefield. However, the digital deluge is so vast that sifting through it manually to generate actionable information is not practical, sustainable or cost-effective. According to Zach Lemnios, the Assistant Secretary of Defense for Research and Engineering, the DoD has progressed the quality of imagers and the quality of sensors to the point where our limitation is no longer the front end collection tools, it's the back-end decision tools. How do I take a large data set and integrate it in a time-critical way?" 5 Clearly, deriving new insights, recognizing relationships, and making increasingly accurate predictions are critically important to all of the defense and intelligence services. In addition to ingesting and digesting the sheer volume of data, the DoD faces an additional challenge: sharing information across different mission areas and partners. There is no standardized way of dealing with Big Data across such boundaries. 5 Jared Serbu, DoD R&D prioritizes 'Big Data.' Federal News Radio, 11 April 2012, SAP National Security Services SAP NS2
5 Building the Solution Platform Step 1: Capturing and Storing Huge Volumes of Unstructured Data As stated previously, many of the functions behind Big Data are not new data warehousing, mining, analytics but have been out of reach of many organizations due to prohibitive storage and processing costs. Obviously, a lower-cost approach will always prove most favorable for adoption. Therefore, it is not a surprise that open-source software frameworks, such as Hadoop, have gained popularity for managing a major aspect of the Big Data platform storing unstructured data in large part because of the cost. 6 For example, the data stored on a Hadoop cluster is less than one-tenth the cost of an equivalent relational database. Hadoop clusters were also designed to better process unstructured data, which represents 95% of the data currently being created. Video, picture images, and voice files are not easily stored in relational databases. Lastly, Hadoop offers significant advantages with linear scalability. Although this paper is not intended to focus on Hadoop, it is important to define it for uninitiated executives. Keep in mind that SAP NS2 s position is not for or against any specific technology. Rather, we embrace and extend other technologies. Combining the best features of an open-source solution (such as Hadoop) with innovative, enterprise-ready technology will deliver the most value at the best cost for many organizations with Big Data requirements. Further, a Big Data solution should let an organization store data in its native object format and then enable users to pick and choose what items to bring forward for analysis. Quite simply, Hadoop is a distributed file system, not a database. The Hadoop Distributed File System (HDFS) manages the splitting up and storage of large files of data across many inexpensive commodity servers, which are known as worker nodes and cost hundreds, not thousands, of dollars per terabyte. When Hadoop splits up the files, it puts redundant copies of the chunks of the file on more than one disc drive, providing 6 Hadoop was chosen to represent the open-source file system in our notional platform due to its increasing popularity and market position as a distributed file store SAP National Security Services SAP NS2 5
6 self-healing redundancy in case a low-cost commodity server fails. Hadoop also manages the distribution of scripts that perform business logic on the data files that are split up on those many server nodes. This splitting up of the business logic to each of the CPUs and RAM on many inexpensive worker nodes is what makes Hadoop work well on very large Big Data files. Analysis logic is performed in parallel on all of the server nodes at once, on each of the 64MB or 128MB chunks of the file. Hadoop software is written in Java and is licensed for free; it was developed as an open-source initiative of the Apache Foundation. Assuming that the technological challenge of capturing and storing massive unstructured data has been addressed with a solution like Hadoop, two vitally important questions remain: 1) How can the organization make the data relevant to act upon in real time, and 2) How can the organization accomplish this cost-effectively? Step 2: Adding an Analytical Data Warehouse for Real-Time Logic Our approach combines the Hadoop distributed file system for storing large amounts of unstructured data with (1) an analytical data warehouse for processing real-time analysis logic, and (2) a self-service, web-based user interface for visualizing the data. This approach may irritate purists who advocate solely for the Free and Open-Source Software (FOSS) movement or for the more traditional model of commercial-off-the-shelf (COTS) software. But in looking at this problem, we have concluded that a combination of solutions brings the right tools to the job. The Hadoop system is a scalable approach to inexpensively take in and store very large data files of unstructured and semi-structured data. Then the content of those files can be sorted and processed in parallel as instructed by code written by data scientists using the MapReduce methodology. 7 However, Hadoop and MapReduce have limitations and cannot effectively address Big Data solely on their own. A scalability problem arises in a pure Hadoop environment, and it is not merely a problem of scaling the numbers of cheap server nodes or disc space in the data center. Hadoop with MapReduce is essentially batch-oriented; a developer builds a MapReduce script to operate on the whole file, which may be as large as multiple terabyte-large and may run for 20 minutes, an hour, or even longer. There is no indexing or schema for the file system, nor is there of the capability to create, update, or delete. Given appropriate time and enough skilled data scientists, any type of analysis (predictive, comparative, pattern recognition, text analysis, or time series) can be run against the data in a batch process using MapReduce. While this process is effective, the outcome creates a conundrum due to the iterative 7 Pioneered by Google and further enhanced by internet indexing entities such as Yahoo! and companies that track massive amounts of click-stream data like Amazon.com SAP National Security Services SAP NS2
7 nature of data analytics. For example, the answer to the initial query may be accurate, but often another requirement will emerge to interrogate the data again, this time slightly differently, because analysis is inherently an iterative process. This creates another problem of scalability. In other words, how many hours will it take for highly skilled data scientists to write and rewrite analyses as driven by the shifting needs of mission specialists and war-fighters? Step 3: A Hybrid Solution Combining the Distributed File System with a Columnar-Structured Analytical Repository The ultimate goal of a Big Data solution is to drive actionable insight from as much information as possible. A state-of-the-art approach empowers end-users to query the data, with near-instant response time, in a much more self-service manner. The approach can be delivered by combining Hadoop with a high-performance data store in columnar format. The columnar data format is appropriate for this use-case for three reasons: Because the data originated from unstructured sources, the columnar format avoids the rigorous data modeling needed to construct a traditional Ralph Kimball-type star schema, or row-based relational table structure SAP National Security Services SAP NS2 7
8 The columnar data store obviates the need to build a pre-conceived indexing strategy for the data, because in a sense, in a columnar data store, the data is the index. And finally, the columnar data store allows for extreme data compression through bit-mapping, because all of the attributes of the data are organized together in columns, instead of being distributed in rows. Certainly, data compression is appropriate in what will be a large volume of data being operated upon in this Big Data scenario. SAP has been a thought leader in columnar data base development, and since the acquisition of Sybase in 2010, its expertise has deepened. For years, Sybase IQ was the pioneering columnar database in the market; and now, with more than 13 years of continuous development, it is widely deployed in missioncritical applications in government and the financial industry. Sybase IQ leverages the columnar approach and advanced compression techniques to rapidly produce insight across both structured and unstructured data sets. On the cutting edge of database evolution, SAP now offers a columnar database and computation engine that is totally in-memory. SAP HANA is a flexible, multipurpose, in-memory data warehouse that combines SAP software components optimized on hardware provided and delivered by leading hardware vendors. 8 SAP HANA is not merely a cache of a sub-set of relational SAP proposes that customers tackle Big Data with a hybrid solution of Hadoop plus an Analytical Data Warehouse. database tables. Rather, the data warehouse is completely resident in RAM memory in columnar format, and the computational engine is resident in the same memory. Therefore, there is no need for disc storage with the exception of backups and disaster recovery. The net result is speed. Simply put, SAP HANA is blazingly fast, up to 100,000 times faster than disc-based systems in many cases. To enable powerful real-time analysis, the in-memory data warehouse performs the slicing and dicing of data and performs rigorous predictive analysis without any I/O from data storage, in the same memory space with the data. It can process the in-memory data using numerous algorithms in the HANA Predictive Analysis Library (PAL), including K-Means, KNN, C4.5 Decision Tree, Linear Regression, Apriori, Moving Averages, et cetera, and it also makes available more than 3,500 open-source algorithms available in the R programming language (another nexus between open source and our COTS solutions.) 8 By working collaboratively with Intel Corporation, SAP has enabled HANA to leverage the full capabilities of the Intel Nehalem chip set, on hardware available from leading hardware vendors that include the top manufacturers in the industry, such as Dell, IBM, Hewlett Packard, Cisco, Fujitsu, etc SAP National Security Services SAP NS2
9 SAP proposes that customers tackle Big Data with a hybrid solution including Hadoop and an Analytical Data Warehouse. Rather than asking data scientists to write detailed analyses using MapReduce to perform analytical processes on entire data files, an innovative approach would be to use the MapReduce code to deliver an ETL (Extract, Transform and Load) of a large data domain to the inmemory, columnar data warehouse. The MapReduce process would sort and process the unstructured data, and then the data would be loaded into the in-memory data warehouse, using SAP Data Services, a click-and-drag graphical user-interface tool for mapping data. At this point, the large data domain needed for fast, real-time analysis will be on-hand in a structured data warehouse, in columnar format, with meta-data context. Users can create powerful analytical views of the data in a self-service web client environment (discussed in the next section). This will reduce the number of iterative cycles imposed on the data scientists to tune and re-tune the analyses as required by end users. Ultimately, it will serve to reduce the time lag between the warfighters analytical needs and their fulfillment. The in-memory analytical data warehouse is uniquely qualified to be the target of the large results sets of the MapReduce process. It is optimized so that there is no bottleneck in reporting performance for even extremely large data sets coming from a MapReduce job. The in-memory solution means that the seek time response is much shorter, even when analyzing the very large data sets that may be the result of the Hadoop MapReduce script. In fact, performance benchmarks routinely show sub-second response times, even for complex queries of millions of records. In sum, the combined solution reduces the operating costs of managing and storing Big Data while providing agility and speed for high-performance analytics SAP National Security Services SAP NS2 9
10 Step 4: Add Business Analytics Big Data is meaningless without the ability to cull information at the right time and in the right format that materially impacts the mission. SAP BusinessObjects business intelligence (BI) solutions can operate directly against the data stored in the columnar or in-memory repository. BI solutions provide business users with both an analytics and reporting framework for the data. They also allow end users to interface with existing applications and operational software such as Microsoft Office. Powerful analytical capabilities can be exposed to end-users using the SAP BusinessObjects web-based user interface. This interface is designed to empower subject matter experts who are not IT personnel and who have no coding or scripting skills. Importantly, these users do not need to understand the structure of the underlying data store to create effective ad hoc queries and visualizations. They will not have to burden the data scientists with running additional MapReduce queries, as the data will already reside in the in-memory data warehouse. In traditional analytics, high data volumes require that assumptions be made during data modeling in order to reduce the data set to a manageable size. In classic Ralph Kimball-type data warehousing strategies, aggregations are often used to reduce the data set. But in national security missions, aggregation could mean that the transactional or atomic data element that is crucial for situational awareness is not present in the analytical repository. Such simplified models don t accurately reflect the multifaceted nature of operational data, often resulting in suboptimal forecasting, planning, trend analysis, or root cause analysis all critical to a rapid response and situational awareness in the mission. Combining the in-memory data warehouse with market-leading business intelligence tools provides a platform for extremely fast response times for highly complex ad hoc queries and analyses on very large data sets that were originally stored in a distributed file system like Hadoop SAP National Security Services SAP NS2
11 Conclusion The hybrid solution combining both open source and commercial technologies can best solve the Big Data challenges faced by the defense and intelligence communities. End users, who are increasingly influenced by consumer applications, expect data to be provided to their fingertips with zero latency, and with the ability to proactively identify trends and conduct self-service discovery in a visually appealing environment. National security organizations will benefit from a hybrid solution that provides the ability to collect and analyze a staggering volume of data, and analyze it with agility in real time, with lower costs and increased speed to insight. But this is just the beginning. The in-memory computing platform for analyzing unstructured Big Data coming from Hadoop could also be the nexus for many other data sources in the DoD enterprise. Logistics, Order of Battle, Readiness, Force Generation, Human Resources, and Financial data all represent organizational functions that could be synthesized and powerfully analyzed using a similar inmemory approach, with real-time visibility. Authors: Bob Palmer Senior Director SAP National Security Services (SAP NS2 ) [email protected] Dan Dorchinsky Client Director SAP National Security Services (SAP NS2 ) [email protected] Copyright by SAP Government Support and Services. All rights reserved. May not be copied or redistributed without permission. SAP Government Support and Services, Inc. (SAP GSS), a Delaware corporation, is a wholly owned, independent US subsidiary of SAP and does business as SAP National Security Services (SAP NS2 ). SAP National Security Services and SAP NS2 are trademarks owned by SAP GSS. SAP NS2 offers the combined power of enterprise applications, analytics, database, cloud and mobile software solutions from SAP and Sybase with specialized levels of security and support to meet the unique mission requirements of US national security and critical infrastructure customers. In addition to US national security customers, SAP NS2 also supports private companies such as defense contractors, telecom carriers, and major financial institutions that have specialized information assurance needs. SAP, R/3, xapps, xapp, SAP NetWeaver, Duet, SAP Business ByDesign, ByDesign, PartnerEdge and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and in several other countries all over the world. Business Objects and the Business Objects logo, BusinessObjects, Crystal Reports, Crystal Decisions, Web Intelligence, Xcelsius and other Business Objects products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of Business Objects S.A. in the United States and in several other 2012 SAP National Security Services SAP NS2 11
12 countries. Business Objects is an SAP Company. All other product and service names mentioned and associated logos displayed are the trademarks of their respective companies. Data contained in this document serves informational purposes only. National product specifications may vary. The information in this document is proprietary to SAP. This document is a preliminary version and not subject to your license agreement or any other agreement with SAP. This document contains only intended strategies, developments, and functionalities of the SAP product and is not intended to be binding upon SAP to any particular course of business, product strategy, and/or development. Please note that this document is subject to change and may be changed by SAP at any time without notice. SAP assumes no responsibility for errors or omissions in this document. SAP does not warrant the accuracy or completeness of the information, text, graphics, links, or other items contained within this material. This document is provided without a warranty of any kind, either express or implied, including but not limited to the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP shall have no liability for damages of any kind including without limitation direct, special, indirect, or consequential damages that may result from the use of these materials. This limitation shall not apply in cases of intent or gross negligence SAP National Security Services SAP NS2
Breaking News! Big Data is Solved. What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER
Breaking News! Big Data is Solved. What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER There is a revolution happening in information technology, and it s not just
What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER
What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER A NEW PARADIGM IN INFORMATION TECHNOLOGY There is a revolution happening in information technology, and it s not
Hadoop: Strengths and Limitations in National Security Missions
Hadoop: Strengths and Limitations in National Security Missions How SAP Solutions Complement Hadoop to Enable Mission Success Bob Palmer Senior Director, SAP National Security Services (SAP NS2 ) June
SAP HANA. SAP HANA Performance Efficient Speed and Scale-Out for Real-Time Business Intelligence
SAP HANA SAP HANA Performance Efficient Speed and Scale-Out for Real-Time Business Intelligence SAP HANA Performance Table of Contents 3 Introduction 4 The Test Environment Database Schema Test Data System
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Tap into Big Data at the Speed of Business
SAP Brief SAP Technology SAP Sybase IQ Objectives Tap into Big Data at the Speed of Business A simpler, more affordable approach to Big Data analytics A simpler, more affordable approach to Big Data analytics
SQL Server 2012 Performance White Paper
Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.
Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities
Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling
Extend your analytic capabilities with SAP Predictive Analysis
September 9 11, 2013 Anaheim, California Extend your analytic capabilities with SAP Predictive Analysis Charles Gadalla Learning Points Advanced analytics strategy at SAP Simplifying predictive analytics
SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics
SAP Brief SAP HANA Objectives Transform Your Future with Better Business Insight Using Predictive Analytics Dealing with the new reality Dealing with the new reality Organizations like yours can identify
ENTERPRISE MANAGEMENT AND SUPPORT IN THE AUTOMOTIVE INDUSTRY
ENTERPRISE MANAGEMENT AND SUPPORT IN THE AUTOMOTIVE INDUSTRY The Automotive Industry Businesses in the automotive industry face increasing pressures to improve efficiency, reduce costs, and quickly identify
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse
IBM Analytics Just the facts: Four critical concepts for planning the logical data warehouse 1 2 3 4 5 6 Introduction Complexity Speed is businessfriendly Cost reduction is crucial Analytics: The key to
Innovative technology for big data analytics
Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of
An Overview of SAP BW Powered by HANA. Al Weedman
An Overview of SAP BW Powered by HANA Al Weedman About BICP SAP HANA, BOBJ, and BW Implementations The BICP is a focused SAP Business Intelligence consulting services organization focused specifically
An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics
An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,
Intelligent Business Operations Chapter 1: Overview & Strategy
Intelligent Business Operations Chapter 1: Overview & Strategy Legal Disclaimer The information in this presentation is confidential and proprietary to SAP and may not be disclosed without the permission
Agil visualisering och dataanalys
Agil visualisering och dataanalys True Business and IT collaboration in Analytics Niklas Packendorff @packendorff SAPSA Impuls 2014 Legal disclaimer The information in this presentation is confidential
INTELLIGENCE AND HOMELAND DEFENSE INSIGHT
I N D U S T R Y INTELLIGENCE AND HOMELAND DEFENSE INSIGHT INTELLIGENCE AND HOMELAND DEFENSE CHALLENGES The Intelligence Community (IC) needs the right information, in real time, to make critical decisions.
SAP SE - Legal Requirements and Requirements
Finding the signals in the noise Niklas Packendorff @packendorff Solution Expert Analytics & Data Platform Legal disclaimer The information in this presentation is confidential and proprietary to SAP and
HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics
HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop
W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Sybase IQ Supercharges Predictive Analytics
SOLUTIONS BROCHURE Sybase IQ Supercharges Predictive Analytics Deliver smarter predictions with Sybase IQ for SAP BusinessObjects users Optional Photos Here (fill space) www.sybase.com SOLUTION FEATURES
Advanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
Parallel Data Warehouse
MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability
The Power of Predictive Analytics
The Power of Predictive Analytics Derive real-time insights with accuracy and ease SOLUTION OVERVIEW www.sybase.com KXEN S INFINITEINSIGHT AND SYBASE IQ FEATURES & BENEFITS AT A GLANCE Ensure greater accuracy
SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013
SAP HANA SAP s In-Memory Database Dr. Martin Kittel, SAP HANA Development January 16, 2013 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase
1 Performance Moves to the Forefront for Data Warehouse Initiatives. 2 Real-Time Data Gets Real
Top 10 Data Warehouse Trends for 2013 What are the most compelling trends in storage and data warehousing that motivate IT leaders to undertake new initiatives? Which ideas, solutions, and technologies
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning
How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume
IBM System x reference architecture solutions for big data
IBM System x reference architecture solutions for big data Easy-to-implement hardware, software and services for analyzing data at rest and data in motion Highlights Accelerates time-to-value with scalable,
Next Generation Digital Banking with SAP
Next Generation Digital Banking with SAP Thorsten Weinrich, Director Business Development, Banking, SAP EMEA October, 2014 Use this title slide only with an image Legal Disclaimer The information in this
Advanced In-Database Analytics
Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??
SAP BusinessObjects SOLUTIONS FOR ORACLE ENVIRONMENTS
SAP BusinessObjects SOLUTIONS FOR ORACLE ENVIRONMENTS BUSINESS INTELLIGENCE FOR ORACLE APPLICATIONS AND TECHNOLOGY SAP Solution Brief SAP BusinessObjects Business Intelligence Solutions 1 SAP BUSINESSOBJECTS
Data Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
ANALYTICS BUILT FOR INTERNET OF THINGS
ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that
RevoScaleR Speed and Scalability
EXECUTIVE WHITE PAPER RevoScaleR Speed and Scalability By Lee Edlefsen Ph.D., Chief Scientist, Revolution Analytics Abstract RevoScaleR, the Big Data predictive analytics library included with Revolution
RUN BETTER Become a Best-Run Business with Remote Support Platform for SAP Business One
RUN BETTER Become a Best-Run Business with Remote Support Platform for SAP Business One September 2013 Customer External Become a Best-Run Business with Remote Support Platform for SAP Business One Run
BIG DATA-AS-A-SERVICE
White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, [email protected] Assistant Professor, Information
The Power of Instant Customer Insight
The Power of Instant Customer Insight Medtronic dramatically improved reporting performance, increasing the value of its customer information, with the SAP HANA platform and Cisco Unified Computing System
Accelerating Hadoop MapReduce Using an In-Memory Data Grid
Accelerating Hadoop MapReduce Using an In-Memory Data Grid By David L. Brinker and William L. Bain, ScaleOut Software, Inc. 2013 ScaleOut Software, Inc. 12/27/2012 H adoop has been widely embraced for
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.
BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics
BIG DATA & ANALYTICS Transforming the business and driving revenue through big data and analytics Collection, storage and extraction of business value from data generated from a variety of sources are
Cost-Effective Data Management and a Simplified Data Warehouse
SAP Information Sheet SAP Technology SAP HANA Dynamic Tiering Quick Facts Cost-Effective Data Management and a Simplified Data Warehouse Quick Facts Summary The SAP HANA dynamic tiering option helps application
Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software
Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse
In-Memory Analytics for Big Data
In-Memory Analytics for Big Data Game-changing technology for faster, better insights WHITE PAPER SAS White Paper Table of Contents Introduction: A New Breed of Analytics... 1 SAS In-Memory Overview...
SAP Business One OnDemand. SAP Business One OnDemand Solution Overview
SAP Business One OnDemand SAP Business One OnDemand Solution Overview SAP Business One OnDemand Table of Contents 4 Executive Summary Introduction SAP Business One Today 8 A Technical Overview: SAP Business
Business Performance without limits how in memory. computing changes everything
Business Performance without limits how in memory computing changes everything Information Explosion is unprecedented An inflection point for the enterprise VELOCITY Worldwide digital content will double
Understanding the Value of In-Memory in the IT Landscape
February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM
QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment
IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW
IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW A high-performance solution based on IBM DB2 with BLU Acceleration Highlights Help reduce costs by moving infrequently used to cost-effective systems
Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis
Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis (Version 1.17) For validation Document version 0.1 7/7/2014 Contents What is SAP Predictive Analytics?... 3
Manifest for Big Data Pig, Hive & Jaql
Manifest for Big Data Pig, Hive & Jaql Ajay Chotrani, Priyanka Punjabi, Prachi Ratnani, Rupali Hande Final Year Student, Dept. of Computer Engineering, V.E.S.I.T, Mumbai, India Faculty, Computer Engineering,
Scaling Your Data to the Cloud
ZBDB Scaling Your Data to the Cloud Technical Overview White Paper POWERED BY Overview ZBDB Zettabyte Database is a new, fully managed data warehouse on the cloud, from SQream Technologies. By building
Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:
Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
The Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
Bringing Big Data into the Enterprise
Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?
Protecting Big Data Data Protection Solutions for the Business Data Lake
White Paper Protecting Big Data Data Protection Solutions for the Business Data Lake Abstract Big Data use cases are maturing and customers are using Big Data to improve top and bottom line revenues. With
ENTERPRISE MANAGEMENT AND SUPPORT IN THE TELECOMMUNICATIONS INDUSTRY
ENTERPRISE MANAGEMENT AND SUPPORT IN THE TELECOMMUNICATIONS INDUSTRY The Telecommunications Industry Companies in the telecommunications industry face a number of challenges as market saturation, slow
Architectures for Big Data Analytics A database perspective
Architectures for Big Data Analytics A database perspective Fernando Velez Director of Product Management Enterprise Information Management, SAP June 2013 Outline Big Data Analytics Requirements Spectrum
Using an In-Memory Data Grid for Near Real-Time Data Analysis
SCALEOUT SOFTWARE Using an In-Memory Data Grid for Near Real-Time Data Analysis by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 IN today s competitive world, businesses
Five Technology Trends for Improved Business Intelligence Performance
TechTarget Enterprise Applications Media E-Book Five Technology Trends for Improved Business Intelligence Performance The demand for business intelligence data only continues to increase, putting BI vendors
Focus on the business, not the business of data warehousing!
Focus on the business, not the business of data warehousing! Adam M. Ronthal Technical Product Marketing and Strategy Big Data, Cloud, and Appliances @ARonthal 1 Disclaimer Copyright IBM Corporation 2014.
Agile Business Intelligence Data Lake Architecture
Agile Business Intelligence Data Lake Architecture TABLE OF CONTENTS Introduction... 2 Data Lake Architecture... 2 Step 1 Extract From Source Data... 5 Step 2 Register And Catalogue Data Sets... 5 Step
How To Help Your Business With Big Data Analytics
AUGUST 2012 Peer Research Big Data Analytics Intel s IT Manager Survey on How Organizations Are Using Big Data Why You Should Read This Document This report describes key findings from a survey of 200
BIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
SAP Service Tools for Performance Analysis
SAP Service Tools for Performance Analysis Kerstin Knebusch Active Global Support Month 05, 2013 Public Performance Analysis Wait event based Focus on statements causing high load and / or high wait time
Three Open Blueprints For Big Data Success
White Paper: Three Open Blueprints For Big Data Success Featuring Pentaho s Open Data Integration Platform Inside: Leverage open framework and open source Kickstart your efforts with repeatable blueprints
Made to Fit Your Needs. SAP Solution Overview SAP Solutions for Small Businesses and Midsize Companies
SAP Solution Overview SAP Solutions for Small Businesses and Midsize Companies SAP Solutions for Small Businesses and Midsize Companies Made to Fit Your Needs. Designed to Help You Grow. Becoming a Best-Run
SAP and Hortonworks Reference Architecture
SAP and Hortonworks Reference Architecture Hortonworks. We Do Hadoop. June Page 1 2014 Hortonworks Inc. 2011 2014. All Rights Reserved A Modern Data Architecture With SAP DATA SYSTEMS APPLICATIO NS Statistical
ENTERPRISE MANAGEMENT AND SUPPORT IN THE INDUSTRIAL MACHINERY AND COMPONENTS INDUSTRY
ENTERPRISE MANAGEMENT AND SUPPORT IN THE INDUSTRIAL MACHINERY AND COMPONENTS INDUSTRY The Industrial Machinery and Components Industry Manufacturers in the industrial machinery and components (IM&C) industry
Interactive data analytics drive insights
Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has
Il mondo dei DB Cambia : Tecnologie e opportunita`
Il mondo dei DB Cambia : Tecnologie e opportunita` Giorgio Raico Pre-Sales Consultant Hewlett-Packard Italiana 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject
Microsoft Big Data. Solution Brief
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
Harness the Power of Analytics Across Lines of Business with Speed and Ease
SAP Brief SAP Crystal s Objectives Harness the Power of Analytics Across Lines of Business with Speed and Ease Enable better insight at critical moments of engagement Enable better insight at critical
Bringing the Power of SAS to Hadoop. White Paper
White Paper Bringing the Power of SAS to Hadoop Combine SAS World-Class Analytic Strength with Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities Contents Introduction... 1 What
BUSINESSOBJECTS DATA INTEGRATOR
PRODUCTS BUSINESSOBJECTS DATA INTEGRATOR IT Benefits Correlate and integrate data from any source Efficiently design a bulletproof data integration process Accelerate time to market Move data in real time
SAP BusinessObjects Edge BI, Preferred Business Intelligence. SAP BusinessObjects Portfolio SAP Solutions for Small Businesses and Midsize Companies
SAP BusinessObjects Edge BI, Standard Package Preferred Business Intelligence Choice for Growing Companies SAP BusinessObjects Portfolio SAP Solutions for Small Businesses and Midsize Companies Executive
Ignite Your Creative Ideas with Fast and Engaging Data Discovery
SAP Brief SAP BusinessObjects BI s SAP Crystal s SAP Lumira Objectives Ignite Your Creative Ideas with Fast and Engaging Data Discovery Tap into your data big and small Tap into your data big and small
Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: [email protected] Website: www.qburst.com
Lambda Architecture Near Real-Time Big Data Analytics Using Hadoop January 2015 Contents Overview... 3 Lambda Architecture: A Quick Introduction... 4 Batch Layer... 4 Serving Layer... 4 Speed Layer...
SAP S/4HANA Embedded Analytics
Frequently Asked Questions November 2015, Version 1 EXTERNAL SAP S/4HANA Embedded Analytics The purpose of this document is to provide an external audience with a selection of frequently asked questions
IBM Netezza High Capacity Appliance
IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data
SQL Server 2005 Features Comparison
Page 1 of 10 Quick Links Home Worldwide Search Microsoft.com for: Go : Home Product Information How to Buy Editions Learning Downloads Support Partners Technologies Solutions Community Previous Versions
Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data
Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give
Get More Scalability and Flexibility for Big Data
Solution Overview LexisNexis High-Performance Computing Cluster Systems Platform Get More Scalability and Flexibility for What You Will Learn Modern enterprises are challenged with the need to store and
Cisco Data Preparation
Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and
CitusDB Architecture for Real-Time Big Data
CitusDB Architecture for Real-Time Big Data CitusDB Highlights Empowers real-time Big Data using PostgreSQL Scales out PostgreSQL to support up to hundreds of terabytes of data Fast parallel processing
Fast, Low-Overhead Encryption for Apache Hadoop*
Fast, Low-Overhead Encryption for Apache Hadoop* Solution Brief Intel Xeon Processors Intel Advanced Encryption Standard New Instructions (Intel AES-NI) The Intel Distribution for Apache Hadoop* software
