MarkLogic for Government. July 2014

Size: px
Start display at page:

Download "MarkLogic for Government. July 2014"

Transcription

1 MarkLogic for Government July 2014

2 Table of Contents Executive Summary... 3 Big Trends, Big Data... 3 Public Sector Challenges... 4 The Case For NoSQL... 6 Three Use Cases for MarkLogic s NoSQL Platform... 7 Use case 1: Rescuing Healthcare.gov... 7 Use Case #2: Improving Army IT... 8 Use Case #3: Integrating Intelligence... 8 Not All NoSQL Databases Are Created Equal... 8

3 Executive Summary A decade and a half into the 21st century, it s time government agencies moved on past their 20th century database technology. Durable as it s been, the relational DBMS-plus-Structured Query Language (SQL) applications simply aren t up to today s requirements. You need something radically different. Why? Because computing, applications, and data itself have all changed radically. More precisely, the notions of what constitutes data, and to what applications data is available, have expanded. Data now includes a host of information sources Word documents, PDFs, images, social media posts, and location information, to name a few that simply don t fit into the rows and columns of traditional relational databases. Or, for that matter, the structures in flat file databases. These changes are why a growing number of federal agencies are evaluating a whole new class of databases that don t rely solely on structured query language, yet can act relationally if required. The product they choose most often for mission critical operations is MarkLogic. NoSQL doesn t mean non-sql. To the contrary, developers can use many of the SQL tools with which they are already familiar plus many new ones. In other words, not only SQL, but rather SQL and much more. MarkLogic is indeed radically different, yet it s a robust, tested, and proven alternative. Now in its 7th version since hitting the market 12 years ago, MarkLogic lets government agencies use not only traditional, cell-type data but also the government s vast stores of unstructured data to create value for their internal operations and in the services they are able to deploy to the public. This data takes many forms that, until the advent of NoSQL databases, could not be employed in an integrated way with mainframe and clientserver applications. Another radical benefit: With MarkLogic, the tech staff can dispense with data schema and modeling that, in some cases, can take scores or hundreds of man-hours. Whereas information stored in one relational DBMS may not be immediately compatible with another, when using MarkLogic you can ingest data from virtually any source and begin using it. Big Trends, Big Data Let s take a closer look at some of the data and computing trends affecting government. No discussion of this topic can get around the three important enterprise computing trends: Mobility, cloud computing, and big data. On the mobility front, agencies face the dual challenges of enabling greater mobility in the federal workforce and deploying apps to an increasingly mobile public. MarkLogic Corporation: MarkLogic for Government 3

4 On the cloud front, the cloud computing phenomenon is coalescing into four basic use cases. Clouds are hosts for productivity applications such as , software development projects, virtual machines, and big data stores. The last use case is where you find so many types of data. And big data by definition encompasses not only large but also dissimilar data sets harnessed together for analytical needs. These phenomena both encourage and enable the treatment of data as a separate and distinct commodity. Data is no longer tied to a specific application. Mobile apps have many characteristics that distinguish them from client-server or web applications, but high on the list is the way they call on multiple, often unlike data sources. Conversely, they create data that doesn t fit neatly into rows and columns. For example, agencies have thousands of people in the field conducting inspections, gathering evidence, taking censuses of people, animals or things, or testing food and consumer products. They are increasingly using mobile devices to record and upload this information, which can consist of entries in standard databases, images, videos, voice files, and text. Whereas information stored in one relational DBMS may not be immediately compatible with another, when using MarkLogic you can ingest data from virtually any source and begin using it. Whether developing traditional or mobile apps, agencies need the flexibility to incorporate the many types of structured and unstructured data sources. With the MarkLogic Enterprise NoSQL platform, agencies can design applications that use video, audio, XML, digital documents, geospatial information, and text all stored in a single repository. This means contracts, manuals, books, and are no longer islands of information, unusable by applications. Applications can also incorporate linked data from the web, such as RDF triples, plus social media like tweets, Facebook comments, and blogs. Public Sector Challenges When you look at the public sector through the right prism, you see a very large enterprise encompassing a wide set of vertical industries. Nearly everything done in the private sector has a counterpart in government. For example, federal missions involve transportation, health care, communications, and cybersecurity. Finance, human resources, logistics, procurement, even sales and marketing all occur in the common lines of government business. Government also has large exclusive functions in defense, intelligence, public safety, and law enforcement. MarkLogic Corporation: MarkLogic for Government 4

5 Many of the government s challenges in these three areas have a data component specifically the ability, or inability, to deal with data from different sources and of different types. Agencies can take on these challenges more readily if they have applications powered by databases which themselves can incorporate both structured and unstructured data. Some challenges relate to mission or operations. Health care provides a good example. Data challenges there range from relatively small to really big. The Defense Department and VA have tried mightily to fashion a single medical record for service members as they migrate to veteran status. It s not lack of desire or even money that s stalled this effort, but rather the sheer technical difficulty of matching data sets. Other challenges are bigger. Collectively the Food and Drug Administration, the National Institutes of Health, TRICARE Management Activity, and others conduct vast amounts of research, collect field information from all over the world, and treat millions of patients. All of this creates structured databases, notes, digital documents, images, and videos. Each data set might be developed for a specific purpose, but as a big data store it can power research and applications limited only by the imagination. Agency-generated datasets become even more valuable when mixed with other publicly available data sources. In one example, the FDA recently issued a request for information on how it could mine social media wikis, blogs, Facebook entries for early warnings of medical device problems or food borne illnesses. Other data challenges stem from policy. Agencies across the board are challenged to use evidence to justify or improve program effectiveness. At one time managerial competence was largely a function of whether allocated funds were spent within the time allotted and within the rules of the program. No longer. Office of Management and Budget guidance, for instance, and the analytical perspectives accompanying the last several federal budget requests specifically call for evidence-based decision-making and budgeting. This goes for contracted projects, grants, and public assistance and subsidy programs. Judging the effectiveness of a national program whether assistance to veterans homelessness or grants to engineering colleges to test unmanned aircraft in controlled airspace often requires synthesizing data from several sources, including those from outside of the agency. Homeland security and law enforcement together make up another prime use case for use of the NoSQL database. In addition to integrating data from many sources (including sensors) and of many types, there is a powerful and durable need to share data and information across siloed systems and among the thousands of agencies at the federal, state, and local levels. MarkLogic Corporation: MarkLogic for Government 5

6 Even the relatively tiny Consumer Products Safety Commission conducts epidemiologic work with reports coming in from more than 100 emergency rooms across the country, and 600 product complaints from citizens every week. The Case For NoSQL When you boil it all down, governments are vast information generators. Regardless of mission, agencies share the challenge of getting value out of data in such a way that it leads to new or improved services, better decision-making, and more-informed policy development. In the era of multi-source big data, the model of relational database bolted to client-server application won t help agencies meet that challenge. Again, the reason is simply that most data simply doesn t fit the columns and rows of the RDBMS. A different type of database is required to create new value by combining data in new ways using fast, agile development. The emerging model provides the platform for realizing the data value proposition. It looks like this: Agencies are gathering and creating data from sensors, network logs, surveillance, social media, geospatial information systems (GIS), and documents of all types including texts, spreadsheets, , contracts, and PDFs the list is endless. They are storing that data in the cloud to improve accessibility and scalability. The choice of public, private, or hybrid cloud depends on a host of factors including sensitivity, cost, and network performance considerations. This is where the NoSQL database comes in. Selecting a NoSQL container helps government agencies solve both data and system management challenges while enabling the deployment of new and different applications. The foundation technology supporting cloud storage and big, diverse data is the NoSQL database. Many of the government s challenges have a data component specifically the ability, or inability, to deal with data from different sources and of different types. From a policy standpoint, agencies need a way to sustain data governance and discovery regulations. At the federal level, and at many state and municipal levels, policy states that government data sets be made available in machine-readable formats. Agencies can greatly ease the management of these requirements MarkLogic Corporation: MarkLogic for Government 6

7 when multiple data sources, both structured and unstructured, can be stored in a unified, scalable database, a capability only available with a NoSQL solution like MarkLogic. Data management is also simplified when datasets are unified under one roof. That avoids the costs of duplication, lowers the possibility for unsynchronized data, and simplifies storage subsystems and database administration. You can t separate data and storage considerations. MarkLogic brings high availability, elasticity, and tiered storage features to let administrators minimize storage costs while maximizing availability all on commodity hardware. MarkLogic opens up new possibilities for application development simply by freeing IT man-hours otherwise devoted to administration, data modeling, and the extract-transform-load (ETL) functions. Thanks to MarkLogic s built-in search function and automatic indexing, application developers can easily and quickly find data and information relevant to the application they are building. Many federal agencies are exploring the Hadoop model for cost-effective data storage, as well as for analytics and other compute-intensive chores on large data sets, particularly those composed of multiple data types that don t fit into relational tables. MarkLogic runs natively atop the Hadoop Distributed File System, and also has a connector for the Hadoop MapReduce engine. Three Use Cases for MarkLogic s NoSQL Platform Use case 1: Rescuing Healthcare.gov When the new Health and Human Services secretary appointed a technology specialist to focus on Healthcare.gov, it was not coincidental she chose someone who d worked on the Data Services Hub (DSH) component of Healthcare.gov. Despite the site s troubled rollout, the DSH was the one component that worked. That s no accident either. The hub relies on MarkLogic to integrate data on applicants citizenship, Social Security Number, vital statistics, and tax information. It adds health care provider data by zip code. It does all this by connecting with web services of those sources, each of which looks different. In fact, MarkLogic even ingests data from existing relational databases, in effect unchaining them from their applications and unlocking value. Often the logic for calculating insurance eligibility and rates occurs right within MarkLogic. And it all occurred with no predefined standard schema and no data reference model because the NoSQL approach is schema-agnostic. In fact, early calculations show that using a relational approach to the DSH would have required data modeling work totaling 100 years! MarkLogic Corporation: MarkLogic for Government 7

8 Use Case #2: Improving Army IT The Army Network Command in Ft. Huachuca, Ariz. needed to quantify its IT assets on both classified and unclassified-but-sensitive networks. This required information from 58 data sources with a variety of formats, and an attempt to feed all of the sources into an RDBMS simply could not go live. Yet MarkLogic was able to create an operational census of IT using 22 of the data sets within 30 days; full operating capabilities launched within 90 days. Use Case #3: Integrating Intelligence Some agencies have found that MarkLogic s NoSQL platform in effect rescues an RDBMS in a critical application. A case in point is a program in the intelligence community for analyzing electronic intelligence data. Disparate types of information from 18 data sources later raised to 26 are transformed in MarkLogic before they load into the popular RDBMS that planners had first chosen. Queries that took 12 minutes of parsing when using just the RDBMS now take only 6 seconds with MarkLogic, and four seconds of that is the loading of transformed data into the RDBMS. Plus, the system that could barely support a few users now scales to hundreds. Not All NoSQL Databases Are Created Equal Pioneered by MarkLogic over a decade ago, the NoSQL database market is diverse and growing. But MarkLogic uniquely offers an enterprise-grade NoSQL database platform bundled with the tools necessary for mission critical implementations. MarkLogic brings to public sector enterprises: High speed search and automatic data indexing capabilities embedded in the core product, to enable structure-aware searches across all text and data elements, in multiple languages Full atomicity, consistency, isolation and durability (ACID) compliance for assured transactions and no data loss High availability and rapid disaster recovery Real-time alerts to changes in database objects Elasticity and scalability to meet data volume and access demands World class, government-grade security controls Join the agencies like Health and Human Services, the Army, the Intelligence Community, the National Archives and Records Administration, and the Patent and Trademark Office that have moved to a 21st century data platform. Unify your data, speed application development, and master big data. MarkLogic Corporation: MarkLogic for Government 8

9 About MarkLogic For more than a decade, MarkLogic has delivered a powerful, agile, and trusted Enterprise NoSQL database platform that enables organizations to turn all data into valuable and actionable information. Organizations around the world rely on MarkLogic s enterprise-grade technology to power the new generation of information applications. MarkLogic is headquartered in Silicon Valley with offices in Washington D.C., New York, London, Frankfurt, Utrecht, and Tokyo. For more information, please visit MarkLogic Corporation. All rights reserved. This technology is protected by U.S. Patent No. 7,127,469B2, U.S. Patent No. 7,171,404B2, U.S. Patent No. 7,756,858 B2, and U.S. Patent No 7,962,474 B2. MarkLogic is a trademark or registered trademark of MarkLogic Corporation in the United States and/or other countries. All other trademarks mentioned are the property of their respective owners. [SS-MLIH-13-06] 999 Skyway Road, Suite 200, San Carlos, CA US: INT'L.: sales@marklogic.com MarkLogic Corporation: MarkLogic for Government 9

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014 Increase Agility and Reduce Costs with a Logical Data Warehouse February 2014 Table of Contents Summary... 3 Data Virtualization & the Logical Data Warehouse... 4 What is a Logical Data Warehouse?... 4

More information

You Have Your Data, Now What?

You Have Your Data, Now What? You Have Your Data, Now What? Kevin Shelly, GVP, Global Public Sector Data is a Resource SLIDE: 2 Time to Value SLIDE: 3 Big Data: Volume, VARIETY, and Velocity Simple Structured Complex Structured Textual/Unstructured

More information

MarkLogic Enterprise Data Layer

MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer September 2011 September 2011 September 2011 Table of Contents Executive Summary... 3 An Enterprise Data

More information

MarkLogic and Cisco: A Next-Generation, Real-Time Solution for Big Data

MarkLogic and Cisco: A Next-Generation, Real-Time Solution for Big Data MarkLogic and Cisco: A Next-Generation, Real-Time Solution for Big Data MarkLogic Enterprise NoSQL Database and Cisco Unified Computing System provide a single, integrated hardware and software infrastructure

More information

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP Pythian White Paper TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP ABSTRACT As companies increasingly rely on big data to steer decisions, they also find themselves looking for ways to simplify

More information

Wrangling Actionable Insights from Organizational Data

Wrangling Actionable Insights from Organizational Data Wrangling Actionable Insights from Organizational Data Koverse Eases Big Data Analytics for Those with Strong Security Requirements The amount of data created and stored by organizations around the world

More information

MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. MarkLogic Semantics in Healthcare and Life Sciences for LIDER The Only Enterprise NoSQL Database Search & Query ACID Transactions High Availability / Disaster Recovery Replication Government-grade Security

More information

A 360 Degree View of Anything

A 360 Degree View of Anything A 360 Degree View of Anything Sara Mazer, Principal Solutions Architect MarkLogic Corporation Data is Growing at a Staggering Rate 44 ZB 8 ZB 2015 2020 Source: IDC SLIDE: 2 Enterprise IT Faces Unprecedented

More information

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation 2014 MarkLogic. All rights reserved. Reproduction of this white paper by any means is strictly prohibited. TABLE

More information

Big Data: Overview and Roadmap. 2015 eglobaltech. All rights reserved.

Big Data: Overview and Roadmap. 2015 eglobaltech. All rights reserved. Big Data: Overview and Roadmap 2015 eglobaltech. All rights reserved. What is Big Data? Large volumes of complex and variable data that require advanced techniques and technologies to enable capture, storage,

More information

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

and NoSQL Data Governance for Regulated Industries Using Hadoop Justin Makeig, Director Product Management, MarkLogic October 2013

and NoSQL Data Governance for Regulated Industries Using Hadoop Justin Makeig, Director Product Management, MarkLogic October 2013 Data Governance for Regulated Industries Using Hadoop and NoSQL Justin Makeig, Director Product Management, MarkLogic October 2013 Who am I? Product Manager for 6 years at MarkLogic Background in FinServ

More information

Beyond Relational: Reimagine Your Data With Enterprise NoSQL. May 2014

Beyond Relational: Reimagine Your Data With Enterprise NoSQL. May 2014 Beyond Relational: Reimagine Your Data With Enterprise NoSQL May 2014 Table of Contents Executive Summary 3 Introduction 4 The NoSQL Paradigm Shift 6 Reimagining Big Data with MarkLogic 7 An Illustration:

More information

Create and Drive Big Data Success Don t Get Left Behind

Create and Drive Big Data Success Don t Get Left Behind Create and Drive Big Data Success Don t Get Left Behind The performance boost from MapR not only means we have lower hardware requirements, but also enables us to deliver faster analytics for our users.

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

Mission-Critical Database with Real-Time Search for Big Data

Mission-Critical Database with Real-Time Search for Big Data Mission-Critical Database with Real-Time Search for Big Data February 17, 2012 Slide 1 Overview About MarkLogic Why MarkLogic Case Studies Technology and Features Slide 2 About MarkLogic 10 years in business

More information

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Transitioning

More information

www.sryas.com Analance Data Integration Technical Whitepaper

www.sryas.com Analance Data Integration Technical Whitepaper Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring

More information

NoSQL for SQL Professionals William McKnight

NoSQL for SQL Professionals William McKnight NoSQL for SQL Professionals William McKnight Session Code BD03 About your Speaker, William McKnight President, McKnight Consulting Group Frequent keynote speaker and trainer internationally Consulted to

More information

EXECUTIVE OFFICE OF THE PRESIDENT OFFICE OF MANAGEMENT AND BUDGET WASHINGTON, D.C. 20503

EXECUTIVE OFFICE OF THE PRESIDENT OFFICE OF MANAGEMENT AND BUDGET WASHINGTON, D.C. 20503 EXECUTIVE OFFICE OF THE PRESIDENT OFFICE OF MANAGEMENT AND BUDGET WASHINGTON, D.C. 20503 Keynote by Vivek Kundra Federal Chief Information Officer The Economic Gains of Cloud Computing Good morning and

More information

GigaSpaces Real-Time Analytics for Big Data

GigaSpaces Real-Time Analytics for Big Data GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and

More information

www.ducenit.com Analance Data Integration Technical Whitepaper

www.ducenit.com Analance Data Integration Technical Whitepaper Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring

More information

Government Technology Trends to Watch in 2014: Big Data

Government Technology Trends to Watch in 2014: Big Data Government Technology Trends to Watch in 2014: Big Data OVERVIEW The federal government manages a wide variety of civilian, defense and intelligence programs and services, which both produce and require

More information

Making Sense of Big Data in Insurance

Making Sense of Big Data in Insurance Making Sense of Big Data in Insurance Amir Halfon, CTO, Financial Services, MarkLogic Corporation BIG DATA?.. SLIDE: 2 The Evolution of Data Management For your application data! Application- and hardware-specific

More information

The Principles of the Business Data Lake

The Principles of the Business Data Lake The Principles of the Business Data Lake The Business Data Lake Culture eats Strategy for Breakfast, so said Peter Drucker, elegantly making the point that the hardest thing to change in any organization

More information

Protecting Data with a Unified Platform

Protecting Data with a Unified Platform Protecting Data with a Unified Platform The Essentials Series sponsored by Introduction to Realtime Publishers by Don Jones, Series Editor For several years now, Realtime has produced dozens and dozens

More information

Data Modeling for Big Data

Data Modeling for Big Data Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes

More information

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper Offload Enterprise Data Warehouse (EDW) to Big Data Lake Oracle Exadata, Teradata, Netezza and SQL Server Ample White Paper EDW (Enterprise Data Warehouse) Offloads The EDW (Enterprise Data Warehouse)

More information

BIG DATA ANALYTICS For REAL TIME SYSTEM

BIG DATA ANALYTICS For REAL TIME SYSTEM BIG DATA ANALYTICS For REAL TIME SYSTEM Where does big data come from? Big Data is often boiled down to three main varieties: Transactional data these include data from invoices, payment orders, storage

More information

Big Data Integration: A Buyer's Guide

Big Data Integration: A Buyer's Guide SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology

More information

Ten Mistakes to Avoid

Ten Mistakes to Avoid EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects

More information

CORPORATE OVERVIEW. Big Data. Shared. Simply. Securely.

CORPORATE OVERVIEW. Big Data. Shared. Simply. Securely. CORPORATE OVERVIEW Big Data. Shared. Simply. Securely. INTRODUCING PHEMI SYSTEMS PHEMI unlocks the power of your data with out-of-the-box privacy, sharing, and governance PHEMI Systems brings advanced

More information

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the

More information

Software-Defined Networks Powered by VellOS

Software-Defined Networks Powered by VellOS WHITE PAPER Software-Defined Networks Powered by VellOS Agile, Flexible Networking for Distributed Applications Vello s SDN enables a low-latency, programmable solution resulting in a faster and more flexible

More information

The Importance of Data Quality for Intelligent Data Analytics:

The Importance of Data Quality for Intelligent Data Analytics: The Importance of Data Quality for Intelligent Data Analytics: Optimizing the Financial and Operational Performance of IT White Paper IT decisions are only as good as the data they re based on. And that

More information

Big Data and Healthcare Payers WHITE PAPER

Big Data and Healthcare Payers WHITE PAPER Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other

More information

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08

More information

Apache Hadoop: The Big Data Refinery

Apache Hadoop: The Big Data Refinery Architecting the Future of Big Data Whitepaper Apache Hadoop: The Big Data Refinery Introduction Big data has become an extremely popular term, due to the well-documented explosion in the amount of data

More information

Data Science and Big Data: Below the Surface and Implications for Governance

Data Science and Big Data: Below the Surface and Implications for Governance Data Science and Big Data: Below the Surface and Implications for Governance Randy Soper The views expressed are those of the author and do not reflect the official position or policy of the Defense Intelligence

More information

WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS

WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS Managing and analyzing data in the cloud is just as important as it is anywhere else. To let you do this, Windows Azure provides a range of technologies

More information

Integrating a Big Data Platform into Government:

Integrating a Big Data Platform into Government: Integrating a Big Data Platform into Government: Drive Better Decisions for Policy and Program Outcomes John Haddad, Senior Director Product Marketing, Informatica Digital Government Institute s Government

More information

Big Data Analytics Best Practices

Big Data Analytics Best Practices 1 Big Data Analytics Best Practices Marshall Presser Federal Field CTO Greenplum 2 Big Data Makes the Mainstream 3 WHAT DOES IT TAKE? 4 1. New Applications MADlib 5 2. New Skill Sets -- Data Science 6

More information

Data Integration Checklist

Data Integration Checklist The need for data integration tools exists in every company, small to large. Whether it is extracting data that exists in spreadsheets, packaged applications, databases, sensor networks or social media

More information

Protecting Big Data Data Protection Solutions for the Business Data Lake

Protecting Big Data Data Protection Solutions for the Business Data Lake White Paper Protecting Big Data Data Protection Solutions for the Business Data Lake Abstract Big Data use cases are maturing and customers are using Big Data to improve top and bottom line revenues. With

More information

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

Introduction to Big Data! with Apache Spark UC#BERKELEY# Introduction to Big Data! with Apache Spark" UC#BERKELEY# So What is Data Science?" Doing Data Science" Data Preparation" Roles" This Lecture" What is Data Science?" Data Science aims to derive knowledge!

More information

www.objectivity.com Choosing The Right Big Data Tools For The Job A Polyglot Approach

www.objectivity.com Choosing The Right Big Data Tools For The Job A Polyglot Approach www.objectivity.com Choosing The Right Big Data Tools For The Job A Polyglot Approach Nic Caine NoSQL Matters, April 2013 Overview The Problem Current Big Data Analytics Relationship Analytics Leveraging

More information

Top 3 Ways Big Data Impacts Financial Services

Top 3 Ways Big Data Impacts Financial Services Top 3 Ways Big Data Impacts Financial Services The Big Data Dilemma for Financial Services Today s firms are looking for new ways to solve Big Data challenges. From front-office risk management to back-office

More information

Engage your customers

Engage your customers Business white paper Engage your customers HP Autonomy s Customer Experience Management market offering Table of contents 3 Introduction 3 The customer experience includes every interaction 3 Leveraging

More information

Complex, true real-time analytics on massive, changing datasets.

Complex, true real-time analytics on massive, changing datasets. Complex, true real-time analytics on massive, changing datasets. A NoSQL, all in-memory enabling platform technology from: Better Questions Come Before Better Answers FinchDB is a NoSQL, all in-memory

More information

Why Big Data in the Cloud?

Why Big Data in the Cloud? Have 40 Why Big Data in the Cloud? Colin White, BI Research January 2014 Sponsored by Treasure Data TABLE OF CONTENTS Introduction The Importance of Big Data The Role of Cloud Computing Using Big Data

More information

Decision Ready Data: Power Your Analytics with Great Data. Murthy Mathiprakasam

Decision Ready Data: Power Your Analytics with Great Data. Murthy Mathiprakasam Decision Ready Data: Power Your Analytics with Great Data Murthy Mathiprakasam 2 Your Mission Repeatably deliver trusted and timely data for great analytics and great social impact 3 Great Data Powers

More information

Cloud Computing and Advanced Relationship Analytics

Cloud Computing and Advanced Relationship Analytics Cloud Computing and Advanced Relationship Analytics Using Objectivity/DB to Discover the Relationships in your Data By Brian Clark Vice President, Product Management Objectivity, Inc. 408 992 7136 brian.clark@objectivity.com

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

The NoSQL Generation: Embracing the Document Model. May 2014

The NoSQL Generation: Embracing the Document Model. May 2014 The NoSQL Generation: Embracing the Document Model May 2014 Table of Contents Introduction 3 The History of NoSQL 3 Types of NoSQL Databases 4 Embracing the Document Model 7 Defining Enterprise NoSQL 10

More information

White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014

White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014 White Paper EMC Isilon: A Scalable Storage Platform for Big Data By Nik Rouda, Senior Analyst and Terri McClure, Senior Analyst April 2014 This ESG White Paper was commissioned by EMC Isilon and is distributed

More information

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.

More information

Information Architecture

Information Architecture The Bloor Group Actian and The Big Data Information Architecture WHITE PAPER The Actian Big Data Information Architecture Actian and The Big Data Information Architecture Originally founded in 2005 to

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Unleash your intuition

Unleash your intuition Introducing Qlik Sense Unleash your intuition Qlik Sense is a next-generation self-service data visualization application that empowers everyone to easily create a range of flexible, interactive visualizations

More information

Data Governance for Regulated Industries

Data Governance for Regulated Industries Data Governance for Regulated Industries Amir Halfon CTO, Worldwide Financial Service Agenda Components of Data Governance Challenges Solutions and Case Studies Q&A SLIDE: 2 Data Governance Considerations

More information

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Exploiting Data at Rest and Data in Motion with a Big Data Platform Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags

More information

INTRODUCTION TO CASSANDRA

INTRODUCTION TO CASSANDRA INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open

More information

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Cloud Integration and the Big Data Journey - Common Use-Case Patterns Cloud Integration and the Big Data Journey - Common Use-Case Patterns A White Paper August, 2014 Corporate Technologies Business Intelligence Group OVERVIEW The advent of cloud and hybrid architectures

More information

Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com;

Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com; Microsoft Big Data Solutions Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com; Why/What is Big Data and Why Microsoft? Options of storage and big data processing in Microsoft Azure. Real Impact of Big

More information

Solution White Paper Connect Hadoop to the Enterprise

Solution White Paper Connect Hadoop to the Enterprise Solution White Paper Connect Hadoop to the Enterprise Streamline workflow automation with BMC Control-M Application Integrator Table of Contents 1 EXECUTIVE SUMMARY 2 INTRODUCTION THE UNDERLYING CONCEPT

More information

Solving the Security Puzzle

Solving the Security Puzzle Solving the Security Puzzle How Government Agencies Can Mitigate Today s Threats Abstract The federal government is in the midst of a massive IT revolution. The rapid adoption of mobile, cloud and Big

More information

A Hurwitz white paper. Inventing the Future. Judith Hurwitz President and CEO. Sponsored by Hitachi

A Hurwitz white paper. Inventing the Future. Judith Hurwitz President and CEO. Sponsored by Hitachi Judith Hurwitz President and CEO Sponsored by Hitachi Introduction Only a few years ago, the greatest concern for businesses was being able to link traditional IT with the requirements of business units.

More information

DATA EXPERTS MINE ANALYZE VISUALIZE. We accelerate research and transform data to help you create actionable insights

DATA EXPERTS MINE ANALYZE VISUALIZE. We accelerate research and transform data to help you create actionable insights DATA EXPERTS We accelerate research and transform data to help you create actionable insights WE MINE WE ANALYZE WE VISUALIZE Domains Data Mining Mining longitudinal and linked datasets from web and other

More information

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3

More information

ENVIRONMENTAL PRESSURES DRIVING AN EVOLUTION IN FILE STORAGE

ENVIRONMENTAL PRESSURES DRIVING AN EVOLUTION IN FILE STORAGE ENVIRONMENTAL PRESSURES DRIVING AN EVOLUTION IN FILE STORAGE JEFF LUNDBERG MAY 23, 2012 WEBTECH EDUCATIONAL SERIES ENVIRONMENTAL PRESSURES DRIVING AN EVOLUTION IN FILE STORAGE IT organizations are under

More information

CitusDB Architecture for Real-Time Big Data

CitusDB Architecture for Real-Time Big Data CitusDB Architecture for Real-Time Big Data CitusDB Highlights Empowers real-time Big Data using PostgreSQL Scales out PostgreSQL to support up to hundreds of terabytes of data Fast parallel processing

More information

The Next Wave of Data Management. Is Big Data The New Normal?

The Next Wave of Data Management. Is Big Data The New Normal? The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management

More information

HOW THE DATA LAKE WORKS

HOW THE DATA LAKE WORKS HOW THE DATA LAKE WORKS by Mark Jacobsohn Senior Vice President Booz Allen Hamilton Michael Delurey, EngD Principal Booz Allen Hamilton As organizations rush to take advantage of large and diverse data

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases

DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases DATAMEER WHITE PAPER Beyond BI Big Data Analytic Use Cases This white paper discusses the types and characteristics of big data analytics use cases, how they differ from traditional business intelligence

More information

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed

More information

The Enterprise Data Hub and The Modern Information Architecture

The Enterprise Data Hub and The Modern Information Architecture The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader

More information

The Modern Online Application for the Internet Economy: 5 Key Requirements that Ensure Success

The Modern Online Application for the Internet Economy: 5 Key Requirements that Ensure Success The Modern Online Application for the Internet Economy: 5 Key Requirements that Ensure Success 1 Table of Contents Abstract... 3 Introduction... 3 Requirement #1 Smarter Customer Interactions... 4 Requirement

More information

There s no way around it: learning about Big Data means

There s no way around it: learning about Big Data means In This Chapter Chapter 1 Introducing Big Data Beginning with Big Data Meeting MapReduce Saying hello to Hadoop Making connections between Big Data, MapReduce, and Hadoop There s no way around it: learning

More information

Investor Presentation. Second Quarter 2015

Investor Presentation. Second Quarter 2015 Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

Discover A New Path For Your Healthcare Data and Storage

Discover A New Path For Your Healthcare Data and Storage Discover A New Path For Your Healthcare Data and Storage Enable Your IT With Healthcare Storage Virtualization Using Your Data, Your Storage, Your Way In healthcare IT, your mission is the smooth running

More information

Integrating Big Data into Business Processes and Enterprise Systems

Integrating Big Data into Business Processes and Enterprise Systems Integrating Big Data into Business Processes and Enterprise Systems THOUGHT LEADERSHIP FROM BMC TO HELP YOU: Understand what Big Data means Effectively implement your company s Big Data strategy Get business

More information

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining BUSINESS INTELLIGENCE Bogdan Mohor Dumitrita 1 Abstract A Business Intelligence (BI)-driven approach can be very effective in implementing business transformation programs within an enterprise framework.

More information

White Paper: Big Data and the hype around IoT

White Paper: Big Data and the hype around IoT 1 White Paper: Big Data and the hype around IoT Author: Alton Harewood 21 Aug 2014 (first published on LinkedIn) If I knew today what I will know tomorrow, how would my life change? For some time the idea

More information

Integrate Big Data into Business Processes and Enterprise Systems. solution white paper

Integrate Big Data into Business Processes and Enterprise Systems. solution white paper Integrate Big Data into Business Processes and Enterprise Systems solution white paper THOUGHT LEADERSHIP FROM BMC TO HELP YOU: Understand what Big Data means Effectively implement your company s Big Data

More information

Business white paper The disruptive power of big data

Business white paper The disruptive power of big data Business white paper The disruptive power of big data How big data analytics is transforming business Business white paper Table of contents 3 Executive overview: The big data revolution 4 The big data

More information

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data Research Report CA Technologies Big Data Infrastructure Management Executive Summary CA Technologies recently exhibited new technology innovations, marking its entry into the Big Data marketplace with

More information

Big Data Comes of Age: Shifting to a Real-time Data Platform

Big Data Comes of Age: Shifting to a Real-time Data Platform An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for SAP April 2013 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents Introduction... 1 Drivers of Change...

More information

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms

More information

Scalable Enterprise Data Integration Your business agility depends on how fast you can access your complex data

Scalable Enterprise Data Integration Your business agility depends on how fast you can access your complex data Transforming Data into Intelligence Scalable Enterprise Data Integration Your business agility depends on how fast you can access your complex data Big Data Data Warehousing Data Governance and Quality

More information

WHITE PAPER. Realizing the Value of Unified Communications

WHITE PAPER. Realizing the Value of Unified Communications Realizing the Value of Unified Communications TABLE OF CONTENTS Executive Summary...3 Maximizing the Benefit of Unified Messaging...3 Why Should You Consider Unified Messaging?...3 Overview...3 The Challenges

More information

Integration Maturity Model Capability #5: Infrastructure and Operations

Integration Maturity Model Capability #5: Infrastructure and Operations Integration Maturity Model Capability #5: Infrastructure and Operations How improving integration supplies greater agility, cost savings, and revenue opportunity TAKE THE INTEGRATION MATURITY SELFASSESSMENT

More information

How To Use Hp Vertica Ondemand

How To Use Hp Vertica Ondemand Data sheet HP Vertica OnDemand Enterprise-class Big Data analytics in the cloud Enterprise-class Big Data analytics for any size organization Vertica OnDemand Organizations today are experiencing a greater

More information