A total data management approach to big data

Size: px
Start display at page:

Download "A total data management approach to big data"

Transcription

1 RESEARCH PAPER A total data management approach to big data How companies are developing a comprehensive data management strategy to encompass big data along with traditional data management architectures September 2012 Sponsored by

2 Contents Executive summary Introduction Types of data Understanding the data The big data platform Benefits of the big data approach to TDM Survey perspectives on big data benefits and challenges Sought after features in big data tools Implementation and future trends Conclusion About the sponsor, Talend p3 p3 p4 p5 p7 p8 p9 p12 p12 p14 p15 This document is property of Incisive Media. Reproduction and distribution of this publication in any form without prior written permission is forbidden. 2 Computing research paper sponsored by Talend

3 Executive summary In recent years attention has increasingly focused on the corporate data that lies outside conventional structured enterprise databases. As the realisation has grown that this data is much larger in volume than previously thought, and potentially just as valuable as enterprise data, corporate users and vendors have both focused on developing technologies and approaches for capturing and integrating this big data into mainstream enterprise data infrastructures. The development of big data technologies, in particular, presents new ways that such data can be brought onto centralised, high performance platforms to provide Total Data Management (TDM) across the entire enterprise. Benefits of the TDM approach include the ability to achieve new levels of business intelligence (BI), data integration between big data, unstructured data and enterprise relational databases, customer intelligence, and the ability to achieve major performance improvements while reducing the amount of storage and hardware necessary throughout the organisation. This Computing report looks at how big data technology has developed into a means for organisations not only to manage big data itself, but also to achieve TDM though importing corporate unstructured and semi-structured data onto big data platforms. It examines the benefits to organisations of achieving this new level of TDM, and the challenges both technical and organisational that need to be overcome to achieve the benefits of big data. Introduction Very large volumes of variable data are being generated at dizzying speeds by customers interacting with corporate websites, monitoring and logging software and devices, trading platforms and many other pillars of the modern business environment. The term big data has become a potent brand, with many vendors avidly beating the big data drum as a panacea to the problem of managing all non-structured corporate data, or even, in some cases, as an all-encompassing new paradigm for enterprise data, structured and non-structured. Big data technologies that have been developed to handle very large volumes of variable, fast-changing and non-schematic data also provide large organisations with an ideal platform for centralising and storing their unstructured and semi-structured data, yielding major advantages in scope of analytics, knowledge discovery, new opportunity identification, as well as providing high speed, high performance storage and retrieval which can help slash hardware costs. Computing research paper sponsored by Talend 3

4 Types of data The big part of big data may be defined in terms of the Three Vs: volume, velocity and variety. Volume the quantity of data relative to the ability to store and manage it Velocity the speed of calculation needed to query the data relative to the rate of change of the data Variety a measure of the number of different formats the data exists in (eg text, audio, video, logs etc) If any of these Vs is low, then it may be more efficient to analyse the data using traditional BI methods. However, if volume and required velocity are high, then big data techniques and technologies become more efficient and economical. The data part of the big data equation can comprise almost any content: server logs; content captured from the web; mp3s; document scans; case notes; financial market data; banking transactions; audio and video streams; manufacturing process information; geophysical data; lab data; satellite telemetry; and so on. Again, the wider the range of formats, the more favourable big data techniques become when compared with relational database approaches. For many people big data is synonymous with unstructured data, but as the above definition makes clear, this is an over-simplification. For the sake of argument let us separate big data (rapidly changing, high in volume, wide in variety of formats) and corporate unstructured/semi-structured data, which may not be changing rapidly, but which also tends to exist outside of the enterprise database infrastructure. Neither lends itself naturally to the relational database model, but big data technologies, with very fast and efficient storage and retrieval of very variable data types, together with tools that allow sophisticated high performance analysis of unstructured and semi-structured data held in big data databases, provide a platform for consolidating corporate unstructured and semi-structured data as well as big data. With all manner of data integrated into a single platform, these technologies can provide organisations with unparalleled BI, competitive information and customer insights, including new knowledge about the markets that organisations operate in, emerging trends, and potential new business opportunities. They can also yield valuable information about the state and long term health of customer facing business systems, and how vulnerable these may be to failure: an invaluable source of business continuity and risk management data. Finely grained organisational knowledge, as well providing insight into the business also has the potential for exposing and correcting any number of false assumptions, errors and miscalculations that may have found their way into the corporate strategy. However, considerable challenges exist in integrating big data file systems. These obstacles include: a lack of big data integration skills and expertise, as it presents a very different paradigm to conventional enterprise database and programming systems; discovery and selection of appropriate file sets; and how to perform integration into big data file systems. There can even be issues around pilot studies and testing, which can be very different with big data when compared with conventional enterprise computing. 4 Computing research paper sponsored by Talend

5 Understanding the data Commentators agree that the majority of corporate data exists not in structured formats (i.e. within corporate enterprise databases) but rather as non-structured files: either unstructured or semi-structured. As early as 1998, financial business consultancy Merrill Lynch put the proportion of non-structured data within large enterprises and organisations at around 80 percent. Some put the figure even higher: Anant Jhingran of IBM Research said in 2011 that the figure was nearer to 85 percent. And according to IDC, the source of Merrill Lynch s 80 percent figure was also an earlier study by IBM. A 2006 study by the Data Warehousing Institute looked at the composition of data held within corporate and enterprise applications. It found that 47 percent of data within centralised corporate computing was structured, while 31 percent was unstructured and 27 percent was semi-structured. Given that corporate computing applications are largely enterprise database focused, while personal and work-group computing is far more likely to be dependent on unstructured data, such as documents, presentations, s and spreadsheets, these figures in fact accord largely with the Merrill Lynch / IBM consensus. Semi-structured data, such as XML, which the DWI study places at 27 percent, is likely to trail off sharply once one moves outside the corporate computing environment and more into workplace and business mobile computing since XML and similar semi-structured data formats, including JSON and SQL files, are far more likely to be used by data processing professionals, for transferring data between database tables, or into enterprise content management systems, than by management or administrative workers. Examples of unstructured data include word processing documents, PDFs, spreadsheets, s, reports, presentations, images, videos, and plain text files. Unlike big data, these tend to be small in size, but highly dispersed throughout organisations, and amount to very large volumes of data in aggregate. Unstructured data: word processing documents, PDFs, spreadsheets, s, reports, presentations, images, videos, plain text files Semi-structured data: JSON, XML Structured data: SQL database records Unstructured data tends to be heavily represented in the tools by which companies are managed and decisions taken. Spreadsheets and presentations are still the key management tools for corporate decision-making at all levels. In terms of BI, unstructured enterprise data is a goldmine waiting to be tapped. The challenge is capturing that data from its widely dispersed locations, and secondly rendering or processing it in a way that yields up organisational knowledge and BI. Computing research paper sponsored by Talend 5

6 It is worth adding that for many organisations, the unstructured data challenge does not just lie outside the realm of enterprise databases. Many large organisations hold vast amounts of unstructured data within enterprise database systems themselves, as Variable Character or Large Text fields, or as Binary Large Objects (BLOBs). Some enterprise databases can be optimised for full text search, while other enterprise databases lack advanced full text functions. But implementing sophisticated full text analytics on large enterprise databases can be both resource intensive and complex. When asked to identify major users of unstructured data, the Computing survey respondents placed corporate, finance and HR first (Fig. 1). According to this view, it is remarkable is how much core business data is unstructured. While much of the focus of big data often revolves around sales and IT, core business corporate functions HR, Finance and Corporate are major users of unstructured or semi-structured data. Fig. 1 : Which business functions would you identify as major users of unstructured data? Corporate Finance HR Marketing Sales IT Other 47% 42% 40% 31% 27% 22% 10% *Respondents could select more than one answer. Structured non-rdbms data formats such as XML and JSON have become well established in the data centre and IT department, with nearly 60 percent of IT departments reporting their use. Outside IT, their use falls significantly to a third or below of most business functions (Fig. 2). 6 Computing research paper sponsored by Talend

7 Fig. 2 : Which business functions would you identify as major users of structured flat files (eg JSON, XML)? IT Finance Corporate Sales Marketing HR Other 57% 33% 28% 23% 22% 19% 7% *Respondents could select more than one answer. An interesting feature about the Computing survey respondents estimates for unstructured and semi-structured data is how much closer they are to the 2006 Data Warehousing Institute study than to the more established figures publicised by Merrill Lynch, Aberdeen and other consultancy groups (see box page 5). The big data platform If corporate unstructured and semi-structured data is generated largely within organisations, big data is driven by the increasing convergence of organisations with Web technologies. However, although the Web is responsible for generating much of the data that most organisations would consider big data, as well as developing most of the technology to deal with it, not all big data is Web generated. Many organisations, including those in finance, insurance and Government, face significant big data processing challenges that have little or nothing to do with Web use. The combination of massive volumes and large variety of formats that characterises big data means that conventional databases can struggle to process information in a timely fashion. This has the crippling effect of greatly reducing the types of analysis that organisations can effectively run on very large bodies of data. Big data technologies such as Hadoop meet this challenge by supplementing conventional databases which store the data with technologies that distribute, or map, the processing across multiple servers, and then recombine, or reduce, the result set, and return it to the database or application. This allows data mining and analytics to be performed very rapidly, while the number of factors that can typically be analysed can often be multiplied by hundreds. One example of the performance advantage of big data techniques over SQL relational approaches is provided by Equifax, the credit reporting agency. Equifax developed a function to predict who across the United States might go bankrupt over the next 30 days. Unfortunately, written in SQL and running on a relational database, the function took 26 days to run. Transferred to a big data map-and-reduce platform, and rewritten in ECL, the function took just six minutes to complete. Computing research paper sponsored by Talend 7

8 Another approach does away with underlying SQL databases altogether, replacing them with databases that are capable of very rapid record storage and retrieval, and greater flexibility and efficiency in handling very different types of data. Both Google and Amazon have pursued this course in their web infrastructures, while Facebook and Twitter have infrastructures based more on mapping and reducing data derived from more conventional database designs. This addresses both the second and the third of the V s: velocity and variety. Big data often accumulates at such a fast pace that the locking and validation procedures of conventional databases struggle to keep up. Dedicated NoSQL databases do much better at keeping pace with very fast storage and retrieval, at the price of less functionality. Instead, NoSQL databases replace conventional SQL querying and analysis with specialist analytics tools that retrieve data from the NoSQL database and then analyse it separately. Such velocity implications attach not only to web data, but also to financial trading systems, banking transaction systems, scientific observational data, and military systems. Big data systems also set out to preserve the variety and scope of source data. Where possible, according to big data advocates, keep everything. Converting data to be stored in a relational database often strips away vital information. Dedicated NoSQL databases can also be designed for specific data types, such as XML, graphs, or documents. Their storage and retrieval can be orders of magnitude both faster and more efficient, and they can retain all of an object s full information than if it is converted to fit into an SQL schema, and can be far more agile in extracting meta-data from variable or differently sourced semi-organised data such as XML or JSON than relationally structured databases, which require a far more static and set data model for data extraction. Benefits of the big data approach to TDM Capturing and integrating corporate unstructured and semi-structured data into big data systems presents a number of significant organisational benefits. These include the ability to perform far more complex analytics; far faster data analysis, which in turn allows a considerable degree of dynamic BI to be extended to online and near real-time processes; faster data storage and retrieval; the ability to capture whole data; and efficiencies in storage caching all of which can translate into considerable cost savings. It becomes possible to integrate a large volume of highly diverse and widely distributed unstructured data files such as spreadsheets, documents, and presentations, together with corporate data into single enterprise file stores. The benefits are not limited to better and more comprehensive corporate data analysis by taking in the totality of enterprise data, not just a fraction of it. Timeliness of querying could also be radically improved. Complex queries that in the past took hours to schedule and execute could in future be set up and run in seconds. There is also a veritude pay-off that can extend right across the enterprise. Just as big data can enhance decision making at the corporate level by integrating crucial unstructured data into BI and other enterprise data systems, it also promises to improve decision making, planning and execution at the departmental and workgroup levels. At the local level, most project management, planning and delivery is based on unstructured data files. And these files contain errors, questionable assumptions and misinformation. Big data systems can highlight and spot-check such errors and misinformation on the fly, greatly improving the timeliness and effectiveness of local programmes. 8 Computing research paper sponsored by Talend

9 There are potentially significant benefits for corporate IT departments too. Storage and licensing costs can be sharply reduced, integrity of corporate data can be improved, and disaster recovery and data reliability can be simplified and made more cost-effective. There is significant evidence that big data systems can even reduce the level of processing and memory requirements of server systems by a significant amount, which could have the result of lowering long-term organisational IT CAPEX costs. Survey perspectives on big data benefits and challenges Efficiency savings are seen as the main benefit of integrating dispersed data into big data systems. However, BI, customer intelligence and improved corporate decision making are also viewed as significant benefits by most respondents, while improved data quality also a significant corporate business benefit is the other main benefit for large businesses (Fig. 3). Fig. 3 : What would you identify as potentially the main benefits of integrating disparate data via big data file systems? Organisational benefits Efficiency savings Business intelligence Data integrity Customer intelligence Improved corporate decision making Better risk management Greater agility Opportunity identification Compliance Fraud detection Other Supply chain intelligence 47% 37% 30% 28% 26% 23% 19% 12% 10% 9% 7% 2% Computing research paper sponsored by Talend 9

10 Operational or technical benefits Better data quality in general Improved performance Faster queries Reduced storage Reduced hardware requirement None Other 65% 49% 43% 35% 14% 4% 4% *Respondents could select more than one answer. Looking at the main technical benefits of big data file systems, 65 percent identified better data quality as the principal benefit a finding that is largely accordant with the findings of the previous question. A large number of respondents also looked to faster queries and improved performance, which are business user benefits, and are service-focused. Cost and reducing footprint, largely internal IT issues, are seen as significantly less relevant to big data. Above all, respondents are looking at big data as a way to improve service and data functionality to business end-users. Potential hurdles and significant questions remain. Setting up and running big data systems can present significant skills and knowledge challenges as big data presents a very different paradigm to conventional enterprise relational database systems. Programming skills required for big data are also often quite different, centring around language types such as Perl and statistical analytical programming languages such as R and ECL. Expertise in areas such as multi-tiered SQL constructions are replaced by a need to understand efficient regex constructions and Levenshtein algorithms. Some of these new languages can demand in-depth knowledge of data science and advanced maths. ECL, for example, is an extremely terse language that looks more like artificial intelligence code than the procedural programming languages most organisational IT staff are familiar with. On the other hand, it can be extremely efficient. One line of ECL is claimed to be equivalent programmatically to 120 lines of C++ code. There can also be significant issues around selecting which unstructured file sets to be incorporated into enterprise big data systems, and how to prioritise integration of unstructured data. In addition to the ever present problem of budget constraints likely to feature at the top of any list of barriers to IT project implementation survey respondents see the main barriers to implementing big data as the variety of data structures and volume of files to be incorporated. Hence, it is the perceived technical difficulty of implementing big data that is seen as being the main challenge in big data projects (Fig. 4). 10 Computing research paper sponsored by Talend

11 Fig. 4 : What do you see as the challenges to implementing big data in your organisation? Organisational challenges Making the business case Uncertainty about return on investment (ROI) Lack of or cost of staff resources required 48% 40% 32% No compelling need Insufficient buy-in by the board Departmental opposition Cost of buying software Cost of buying hardware Other 25% 20% 18% 16% 11% 8% Technical or operational challenges Budget constraints Variety of data structures Volume of files to be incorporated Skill levels Rate of change of the data you need to process Other 59% 53% 42% 28% 21% 8% *Respondents could select more than one answer. Computing research paper sponsored by Talend 11

12 The main organisational challenge to implementing big data appears to be uncertainty over identifying and quantifying its benefits. Forty-eight percent identified difficulties in making the business case, while 40 percent found difficulty in quantifying the return on investment (ROI). In principal, these are closely related issues. They are almost certainly related to the way that respondents perceive big data as principally a business benefit to the organisation in a much wider sense, and to end-user departments. Since they are seeing it as a top-line business benefit, and not a bottom line business benefit, and because the technology is relatively new, they find it much more difficult to quantify the likely measurable difference big data could make to their organisations. Sought after features in big data tools When survey respondents were asked what benefits they would most wish for from a big data implementation, top of the list were: Better data quality A single version of the truth Eliminating confusion caused by multiple spreadsheet versions Improved opportunity identification and decision support Ease of use and the ability to integrate multiple data formats rapidly are the most looked for features in big data systems, underlying how they are seen as a business and organisational benefit, and that there is some urgency to their implementation. Second to this are high performance and scalability (Fig. 5). Technical issues and concerns about programming languages, APIs and other technical facets, are a much lesser concern. Implementation and future trends Big data is currently at an early stage of development as an enterprise technology. The technology originated within a number of the largest Web content providers, such as Amazon, Google and Yahoo, as a way of handling very large volumes of data. Cross-over to the enterprise computing world has been gradual to date. Some notable figures within the big data world have bemoaned the slow uptake of big data within corporates and enterprise organisations. Amazon s Werner Vogels, Chief Technology Officer of Amazon Web Services, told a conference in London in March 2012 that large enterprise organisations were lagging far behind small tech start-ups in their adoption of big data technologies. What we would consider a level playing field is actually tilting towards younger businesses, said Vogels. They are using big data much more rapidly at the core of their businesses to continuously improve their services, and enterprises are the ones who are having to catch up. The Computing survey shows that just 2 percent of large organisations are currently engaged in large scale roll-out of big data projects, and 61 percent of large organisations have not engaged with the issue at all yet (Fig. 6). 12 Computing research paper sponsored by Talend

13 Fig. 5 : What features would you look for in a big data platform? Ability to integrate different data formats rapidly Ease of use High performance Scalability Low learning curve Familiar or easy to learn querying language Wide range of prog. language interfaces Similarity to existing systems Powerful programming functions Other 69% 66% 51% 49% 39% 26% 16% 14% 14% 7% *Respondents could select more than one answer. Fig. 6 : How far has your organisation engaged with big data? 1% 24% 8% 4% 2% 61% No interest shown Discussions about using it Discussed and rejected its use Planning and appraisal stage Pilot projects Large-scale roll-out Computing research paper sponsored by Talend 13

14 However, our survey also shows the likelihood of very rapid growth of live production big data implementations in the near future, with large scale roll-outs likely to double, if not triple. Furthermore, there is a considerable funnel building up behind enterprise big data, with a doubling at almost every stage up the funnel, apart from at the mouth of the funnel, which shows a tripling of interest. Of the 15 percent of large organisations in our survey who have moved beyond discussions to take active decisions about the use of big data in their organisations, only 1 percent of enterprises had rejected the use of big data. In contrast, 38 percent of large organisations are either considering the use of big data, or are actively planning, piloting or implementing big data implementations. Conclusion Big data technology platforms offer enterprises considerable opportunities to integrate large volumes of unstructured and semi-structured data that presently lie outside the ambit of the enterprise IT infrastructure towards to goal of Total Data Management. Integrating these dispersed sets of data onto big data platforms provides organisations with opportunities to gain new levels of BI, analytics, corporate risk management, and customer intelligence. It also provides scope to gain significant savings in dispersed and work-group computing hardware expenditure. Although big data is at an early stage in acceptance within enterprise computing, our survey shows that the number of installations is set to grow very rapidly, with the installed base likely to double, and then double again within the near future. However, potential enterprise users see a number of issues with current big data technology, of which the chief are the difficulty in integrating different data formats rapidly onto a big data platform, and the general ease of use of big data technologies. In our survey, concerns over the ability to integrate different data formats rapidly and easily into big data systems, together with ease of use of these new technologies were the key concerns of enterprise IT staff by a clear margin. 14 Computing research paper sponsored by Talend

15 About the sponsor, Talend Talend is one of the largest pure play vendors of open source software, offering a breadth of middleware solutions that address both data management and application integration needs. Companies are faced with exponential growth in the volume and heterogeneity of the data and applications they need to manage and control. A key challenge that IT departments face today is ensuring the consistency of their data and processes by using modelling tools, workflow management and storage, the foundations of data governance in any company today. This challenge is actually faced by organisations of all sizes not only the largest corporations. In just a few years, Talend has become the recognized market leader in open source data management. Many large organisations around the globe use Talend s products and services to optimise the costs of data integration, data quality, MDM and application integration. With an ever growing number of product downloads and paying customers, Talend offers the most widely used and deployed data management solutions in the world. Contact Talend: Visit: Computing research paper sponsored by Talend 15

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

Why Cloud BI? The 10 Substantial Benefits of Software-as-a-Service Business Intelligence

Why Cloud BI? The 10 Substantial Benefits of Software-as-a-Service Business Intelligence The 10 Substantial Benefits of Software-as-a-Service Business Intelligence Executive Summary Smart businesses are pursuing every available opportunity to maximize performance and minimize costs. Business

More information

Big Data Integration: A Buyer's Guide

Big Data Integration: A Buyer's Guide SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology

More information

Understanding the Value of In-Memory in the IT Landscape

Understanding the Value of In-Memory in the IT Landscape February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

Why Cloud BI? of Software-as-a-Service Business Intelligence. Executive Summary. This white paper explores the 10 substantial

Why Cloud BI? of Software-as-a-Service Business Intelligence. Executive Summary. This white paper explores the 10 substantial of Software-as-a-Service Business Intelligence Executive Summary Smart businesses are pursuing every available opportunity to maximize performance and minimize costs. Business Intelligence tools used to

More information

COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES

COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY The business world is abuzz with the potential of data. In fact, most businesses have so much data that it is difficult for them to process

More information

Powerful analytics. and enterprise security. in a single platform. microstrategy.com 1

Powerful analytics. and enterprise security. in a single platform. microstrategy.com 1 Powerful analytics and enterprise security in a single platform microstrategy.com 1 Make faster, better business decisions with easy, powerful, and secure tools to explore data and share insights. Enterprise-grade

More information

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM David Chappell SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM A PERSPECTIVE FOR SYSTEMS INTEGRATORS Sponsored by Microsoft Corporation Copyright 2014 Chappell & Associates Contents Business

More information

Big Data and Healthcare Payers WHITE PAPER

Big Data and Healthcare Payers WHITE PAPER Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other

More information

I D C V E N D O R S P O T L I G H T. S t o r a g e Ar c h i t e c t u r e t o Better Manage B i g D a t a C hallenges

I D C V E N D O R S P O T L I G H T. S t o r a g e Ar c h i t e c t u r e t o Better Manage B i g D a t a C hallenges I D C V E N D O R S P O T L I G H T S t o r a g e Ar c h i t e c t u r e t o Better Manage B i g D a t a C hallenges September 2012 Adapted from Worldwide File-Based Storage 2011 2015 Forecast: Foundation

More information

Scalable Enterprise Data Integration Your business agility depends on how fast you can access your complex data

Scalable Enterprise Data Integration Your business agility depends on how fast you can access your complex data Transforming Data into Intelligence Scalable Enterprise Data Integration Your business agility depends on how fast you can access your complex data Big Data Data Warehousing Data Governance and Quality

More information

RESEARCH PAPER. Big data are we nearly there yet?

RESEARCH PAPER. Big data are we nearly there yet? RESEARCH PAPER Big data are we nearly there yet? A look at the degree to which big data solutions have become a reality and the barriers to wider adoption May 2013 Sponsored by CONTENTS Executive summary

More information

Big Data Defined Introducing DataStack 3.0

Big Data Defined Introducing DataStack 3.0 Big Data Big Data Defined Introducing DataStack 3.0 Inside: Executive Summary... 1 Introduction... 2 Emergence of DataStack 3.0... 3 DataStack 1.0 to 2.0... 4 DataStack 2.0 Refined for Large Data & Analytics...

More information

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

Data Virtualization and ETL. Denodo Technologies Architecture Brief

Data Virtualization and ETL. Denodo Technologies Architecture Brief Data Virtualization and ETL Denodo Technologies Architecture Brief Contents Data Virtualization and ETL... 3 Summary... 3 Data Virtualization... 7 What is Data Virtualization good for?... 8 Applications

More information

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling

More information

The 4 Pillars of Technosoft s Big Data Practice

The 4 Pillars of Technosoft s Big Data Practice beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed

More information

Traditional BI vs. Business Data Lake A comparison

Traditional BI vs. Business Data Lake A comparison Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

Extending the Power of Analytics with a Proven Data Warehousing. Solution

Extending the Power of Analytics with a Proven Data Warehousing. Solution SAP Brief SAP s for Small Businesses and Midsize Companies SAP IQ, Edge Edition Objectives Extending the Power of Analytics with a Proven Data Warehousing Uncover deep insights and reach new heights Uncover

More information

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. Impact of Big Data in Oil & Gas Industry Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. New Age Information 2.92 billions Internet Users in 2014 Twitter processes 7 terabytes

More information

The 3 questions to ask yourself about BIG DATA

The 3 questions to ask yourself about BIG DATA The 3 questions to ask yourself about BIG DATA Do you have a big data problem? Companies looking to tackle big data problems are embarking on a journey that is full of hype, buzz, confusion, and misinformation.

More information

BIG DATA CHALLENGES AND PERSPECTIVES

BIG DATA CHALLENGES AND PERSPECTIVES BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,

More information

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.

More information

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment

More information

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP Pythian White Paper TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP ABSTRACT As companies increasingly rely on big data to steer decisions, they also find themselves looking for ways to simplify

More information

An Enterprise Framework for Business Intelligence

An Enterprise Framework for Business Intelligence An Enterprise Framework for Business Intelligence Colin White BI Research May 2009 Sponsored by Oracle Corporation TABLE OF CONTENTS AN ENTERPRISE FRAMEWORK FOR BUSINESS INTELLIGENCE 1 THE BI PROCESSING

More information

Five Technology Trends for Improved Business Intelligence Performance

Five Technology Trends for Improved Business Intelligence Performance TechTarget Enterprise Applications Media E-Book Five Technology Trends for Improved Business Intelligence Performance The demand for business intelligence data only continues to increase, putting BI vendors

More information

How To Make Data Streaming A Real Time Intelligence

How To Make Data Streaming A Real Time Intelligence REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log

More information

The Business Value of Predictive Analytics

The Business Value of Predictive Analytics The Business Value of Predictive Analytics Alys Woodward Program Manager, European Business Analytics, Collaboration and Social Solutions, IDC London, UK 15 November 2011 Copyright IDC. Reproduction is

More information

Email archives: no longer fit for purpose?

Email archives: no longer fit for purpose? RESEARCH PAPER Email archives: no longer fit for purpose? Most organisations are using email archiving systems designed in the 1990s: inflexible, non-compliant and expensive May 2013 Sponsored by Contents

More information

Market Pulse Research: Big Data Storage & Analytics

Market Pulse Research: Big Data Storage & Analytics Market Pulse Research: Big Data Storage & Analytics MARKETING RESEARCH EMPLOYEE ENGAGEMENT A WORLD OF INSIGHTS January 2015 Presented on behalf of HP & Microsoft METHODOLOGY & RESEARCH OBJECTIVES Sample

More information

Virtual Data Warehouse Appliances

Virtual Data Warehouse Appliances infrastructure (WX 2 and blade server Kognitio provides solutions to business problems that require acquisition, rationalization and analysis of large and/or complex data The Kognitio Technology and Data

More information

The Liaison ALLOY Platform

The Liaison ALLOY Platform PRODUCT OVERVIEW The Liaison ALLOY Platform WELCOME TO YOUR DATA-INSPIRED FUTURE Data is a core enterprise asset. Extracting insights from data is a fundamental business need. As the volume, velocity,

More information

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management Big Data and New Paradigms in Information Management Vladimir Videnovic Institute for Information Management 2 "I am certainly not an advocate for frequent and untried changes laws and institutions must

More information

Are You Big Data Ready?

Are You Big Data Ready? ACS 2015 Annual Canberra Conference Are You Big Data Ready? Vladimir Videnovic Business Solutions Director Oracle Big Data and Analytics Introduction Introduction What is Big Data? If you can't explain

More information

Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com;

Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com; Microsoft Big Data Solutions Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com; Why/What is Big Data and Why Microsoft? Options of storage and big data processing in Microsoft Azure. Real Impact of Big

More information

How To Turn Big Data Into An Insight

How To Turn Big Data Into An Insight mwd a d v i s o r s Turning Big Data into Big Insights Helena Schwenk A special report prepared for Actuate May 2013 This report is the fourth in a series and focuses principally on explaining what s needed

More information

Big Data and Cloud Computing - The Global Challenge

Big Data and Cloud Computing - The Global Challenge RESEARCH PAPER Is your business ready for Big Data? A discussion of the cultural, operational and technical elements required to make Big Data work for your business August 2014 Sponsored by Contents Executive

More information

DATAOPT SOLUTIONS. What Is Big Data?

DATAOPT SOLUTIONS. What Is Big Data? DATAOPT SOLUTIONS What Is Big Data? WHAT IS BIG DATA? It s more than just large amounts of data, though that s definitely one component. The more interesting dimension is about the types of data. So Big

More information

In-Database Analytics

In-Database Analytics Embedding Analytics in Decision Management Systems In-database analytics offer a powerful tool for embedding advanced analytics in a critical component of IT infrastructure. James Taylor CEO CONTENTS Introducing

More information

Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure

Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure White Paper Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure Providing Agile and Efficient Service Delivery for Sustainable Business Advantage What You Will Learn Enterprises

More information

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6 Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...

More information

How to Enhance Traditional BI Architecture to Leverage Big Data

How to Enhance Traditional BI Architecture to Leverage Big Data B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...

More information

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores Composite Software October 2010 TABLE OF CONTENTS INTRODUCTION... 3 BUSINESS AND IT DRIVERS... 4 NOSQL DATA STORES LANDSCAPE...

More information

BIG DATA: BIG CHALLENGE FOR SOFTWARE TESTERS

BIG DATA: BIG CHALLENGE FOR SOFTWARE TESTERS BIG DATA: BIG CHALLENGE FOR SOFTWARE TESTERS Megha Joshi Assistant Professor, ASM s Institute of Computer Studies, Pune, India Abstract: Industry is struggling to handle voluminous, complex, unstructured

More information

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.

More information

What happens when Big Data and Master Data come together?

What happens when Big Data and Master Data come together? What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information

More information

Are you storing problems for the future?

Are you storing problems for the future? Are you storing problems for the future? How business success tomorrow will be driven by smarter storage today June 2014 Phone: 01304 814800 Fax: 01304 814899 info@ 1 Are you storing problems for the future?

More information

Transforming the Telecoms Business using Big Data and Analytics

Transforming the Telecoms Business using Big Data and Analytics Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the

More information

Big Data & the Cloud: The Sum Is Greater Than the Parts

Big Data & the Cloud: The Sum Is Greater Than the Parts E-PAPER March 2014 Big Data & the Cloud: The Sum Is Greater Than the Parts Learn how to accelerate your move to the cloud and use big data to discover new hidden value for your business and your users.

More information

Changing the Equation on Big Data Spending

Changing the Equation on Big Data Spending White Paper Changing the Equation on Big Data Spending Big Data analytics can deliver new customer insights, provide competitive advantage, and drive business innovation. But complexity is holding back

More information

RESEARCH paper. April 2013. Sponsored by

RESEARCH paper. April 2013. Sponsored by RESEARCH paper Big data: what s holding you back? Uncertainty about return on investment and skills shortages needs to be overcome if the promise of big data technologies is to be fulfilled April 2013

More information

BUILDING THE CASE FOR CLOUD: HOW BUSINESS FUNCTIONS IN UK MANUFACTURERS ARE DRIVING PUBLIC CLOUD ADOPTION

BUILDING THE CASE FOR CLOUD: HOW BUSINESS FUNCTIONS IN UK MANUFACTURERS ARE DRIVING PUBLIC CLOUD ADOPTION BUILDING THE CASE FOR CLOUD: HOW BUSINESS FUNCTIONS IN UK MANUFACTURERS ARE DRIVING PUBLIC CLOUD ADOPTION Industry Report Contents 2 4 6 Executive Summary Context for the Sector Key Findings 3 5 9 About

More information

White Paper: Evaluating Big Data Analytical Capabilities For Government Use

White Paper: Evaluating Big Data Analytical Capabilities For Government Use CTOlabs.com White Paper: Evaluating Big Data Analytical Capabilities For Government Use March 2012 A White Paper providing context and guidance you can use Inside: The Big Data Tool Landscape Big Data

More information

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS! The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader

More information

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2 Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue

More information

ANALYTICS BUILT FOR INTERNET OF THINGS

ANALYTICS BUILT FOR INTERNET OF THINGS ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that

More information

Global Data Integration with Autonomous Mobile Agents. White Paper

Global Data Integration with Autonomous Mobile Agents. White Paper Global Data Integration with Autonomous Mobile Agents White Paper June 2002 Contents Executive Summary... 1 The Business Problem... 2 The Global IDs Solution... 5 Global IDs Technology... 8 Company Overview...

More information

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA

More information

Integrating SAP and non-sap data for comprehensive Business Intelligence

Integrating SAP and non-sap data for comprehensive Business Intelligence WHITE PAPER Integrating SAP and non-sap data for comprehensive Business Intelligence www.barc.de/en Business Application Research Center 2 Integrating SAP and non-sap data Authors Timm Grosser Senior Analyst

More information

Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges

Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges Prerita Gupta Research Scholar, DAV College, Chandigarh Dr. Harmunish Taneja Department of Computer Science and

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many

More information

A Study on Big-Data Approach to Data Analytics

A Study on Big-Data Approach to Data Analytics A Study on Big-Data Approach to Data Analytics Ishwinder Kaur Sandhu #1, Richa Chabbra 2 1 M.Tech Student, Department of Computer Science and Technology, NCU University, Gurgaon, Haryana, India 2 Assistant

More information

BIG DATA AND MICROSOFT. Susie Adams CTO Microsoft Federal

BIG DATA AND MICROSOFT. Susie Adams CTO Microsoft Federal BIG DATA AND MICROSOFT Susie Adams CTO Microsoft Federal THE WORLD OF DATA IS CHANGING Cloud What s making this possible? Electrical efficiency of computers doubles every year and ½. Laptops and mobile

More information

!!!!! White Paper. Understanding The Role of Data Governance To Support A Self-Service Environment. Sponsored by

!!!!! White Paper. Understanding The Role of Data Governance To Support A Self-Service Environment. Sponsored by White Paper Understanding The Role of Data Governance To Support A Self-Service Environment Sponsored by Sponsored by MicroStrategy Incorporated Founded in 1989, MicroStrategy (Nasdaq: MSTR) is a leading

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

Big Data Comes of Age: Shifting to a Real-time Data Platform

Big Data Comes of Age: Shifting to a Real-time Data Platform An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for SAP April 2013 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents Introduction... 1 Drivers of Change...

More information

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation 2014 MarkLogic. All rights reserved. Reproduction of this white paper by any means is strictly prohibited. TABLE

More information

Data Virtualization A Potential Antidote for Big Data Growing Pains

Data Virtualization A Potential Antidote for Big Data Growing Pains perspective Data Virtualization A Potential Antidote for Big Data Growing Pains Atul Shrivastava Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and

More information

How To Learn To Use Big Data

How To Learn To Use Big Data Information Technologies Programs Big Data Specialized Studies Accelerate Your Career extension.uci.edu/bigdata Offered in partnership with University of California, Irvine Extension s professional certificate

More information

There s no way around it: learning about Big Data means

There s no way around it: learning about Big Data means In This Chapter Chapter 1 Introducing Big Data Beginning with Big Data Meeting MapReduce Saying hello to Hadoop Making connections between Big Data, MapReduce, and Hadoop There s no way around it: learning

More information

What to Look for When Selecting a Master Data Management Solution

What to Look for When Selecting a Master Data Management Solution What to Look for When Selecting a Master Data Management Solution What to Look for When Selecting a Master Data Management Solution Table of Contents Business Drivers of MDM... 3 Next-Generation MDM...

More information

Big Data Explained. An introduction to Big Data Science.

Big Data Explained. An introduction to Big Data Science. Big Data Explained An introduction to Big Data Science. 1 Presentation Agenda What is Big Data Why learn Big Data Who is it for How to start learning Big Data When to learn it Objective and Benefits of

More information

Why Most Big Data Projects Fail

Why Most Big Data Projects Fail Learning from Common Mistakes to Transform Big Data into Insights What is Big Data?...2 Three Reasons Why Big Data Projects Fail...3 How Can Big Data Be Used?...5 The Lavastorm Approach to Big Data...5

More information

Anuradha Bhatia, Faculty, Computer Technology Department, Mumbai, India

Anuradha Bhatia, Faculty, Computer Technology Department, Mumbai, India Volume 3, Issue 9, September 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Real Time

More information

Software as a Service Offers Broadening Appeal for Small and Medium-Sized Discrete Manufacturers

Software as a Service Offers Broadening Appeal for Small and Medium-Sized Discrete Manufacturers Software as a Service Offers Broadening Appeal for Small and Medium-Sized Discrete Manufacturers WHITE PAPER Sponsored by: SAP Simon Ellis November 2010 IDC MANUFACTURING INSIGHTS OPINION Software as a

More information

Automated Business Intelligence

Automated Business Intelligence Automated Business Intelligence Delivering real business value,quickly, easily, and affordably 2 Executive Summary For years now, the greatest weakness of the Business Intelligence (BI) industry has been

More information

Big Data Executive Survey

Big Data Executive Survey Big Data Executive Full Questionnaire Big Date Executive Full Questionnaire Appendix B Questionnaire Welcome The survey has been designed to provide a benchmark for enterprises seeking to understand the

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,

More information

Evolution to Revolution: Big Data 2.0

Evolution to Revolution: Big Data 2.0 Evolution to Revolution: Big Data 2.0 An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for Actian March 2014 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents

More information

WHITEPAPER. Why Dependency Mapping is Critical for the Modern Data Center

WHITEPAPER. Why Dependency Mapping is Critical for the Modern Data Center WHITEPAPER Why Dependency Mapping is Critical for the Modern Data Center OVERVIEW The last decade has seen a profound shift in the way IT is delivered and consumed by organizations, triggered by new technologies

More information

Wrangling Actionable Insights from Organizational Data

Wrangling Actionable Insights from Organizational Data Wrangling Actionable Insights from Organizational Data Koverse Eases Big Data Analytics for Those with Strong Security Requirements The amount of data created and stored by organizations around the world

More information

QUICK FACTS. Delivering a Unified Data Architecture for Sony Computer Entertainment America TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES

QUICK FACTS. Delivering a Unified Data Architecture for Sony Computer Entertainment America TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES [ Consumer goods, Data Services ] TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES QUICK FACTS Objectives Develop a unified data architecture for capturing Sony Computer Entertainment America s (SCEA)

More information

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Big Data Analytics DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Tom Haughey InfoModel, LLC 868 Woodfield Road Franklin Lakes, NJ 07417 201 755 3350 tom.haughey@infomodelusa.com

More information

Ten Mistakes to Avoid

Ten Mistakes to Avoid EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects

More information

Making Leaders Successful Every Day

Making Leaders Successful Every Day Making Leaders Successful Every Day Demystifying Big Data and Hadoop for BI Pros Boris Evelson Vice President, Principal Analyst Information is the next competitive differentiator Information derived from

More information

Scale-out NAS Unifies the Technical Enterprise

Scale-out NAS Unifies the Technical Enterprise Scale-out NAS Unifies the Technical Enterprise Panasas Inc. White Paper July 2010 Executive Summary Tremendous effort has been made by IT organizations, and their providers, to make enterprise storage

More information

ANALYTICS STRATEGY: creating a roadmap for success

ANALYTICS STRATEGY: creating a roadmap for success ANALYTICS STRATEGY: creating a roadmap for success Companies in the capital and commodity markets are looking at analytics for opportunities to improve revenue and cost savings. Yet, many firms are struggling

More information