1 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source A Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy December 2013 Sponsored by
2 Copyright 2013 R20/Consultancy. All rights reserved. InterSystems Caché, InterSystems Ensemble, InterSystems HealthShare, InterSystems DeepSee, and TrakCare are registered trademarks of InterSystems Corporation. InterSystems iknow is a trademark of InterSystems Corporation. Trademarks of companies referenced in this document are the sole property of their respective owners.
3 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source 1 1 Introduction Analyzing Textual Big Data Big Data for Enriching Analytical Capabilities Big data is revolutionizing the world of business intelligence and analytics. Gartner 1 predicts that big data will drive $232 billion in spending through 2016, Wikibon 2 claims that by 2017 big data revenue will have grown to $47.8 billion, and McKinsey Global Institute 3 indicates that big data has the potential to increase the value of the US health care industry by $300 billion and to increase the industry value of Europe s public sector administration by 250 billion. The big data breakthrough comes from innovative big data analytics. For some companies the primary challenge comes from analyzing massive amounts of structured data, primarily numerical, such as credit card companies with millions of cardholders and billions of transactions looking for fraud patterns. Analyzing massive amounts of structured data may require new software strategies and technologies but is generally straightforward and readily achievable. Not all big data is structured. Big data comes in all shapes and sizes. The greatest big data challenge is that a large portion of it is not structured, often in the form of unstructured text. Think of all the data used or created in a typical business s, documents, voice transcripts from customer calls, conferences with note taking, and more. Most of this data is unstructured text. Even in an industry dominated by numerical data, text abounds. For example, in commercial banking, financial statements and loan activity are well-structured data, but to understand the loan you have to read the file, which is full of correspondence, written assessments and notes from every phone call and meeting. To really understand the risk in a lending portfolio you need to read and understand every loan file. In a medical environment, many structured data sources exist, such as test results over time and coded fields. However, some of the most valuable data is found within a clinician s textual notes: his impressions, what he learned from conversing with the patient, why he reached his diagnosis or ordered a test, what he concluded from various test results, and much more. In most large clinical settings these invaluable notes comprise very large data sets but, while they are increasingly digitized, they are rarely analyzed. Analyzing Textual Data Advanced analytical capabilities have always been available for analyzing non-textual data. Almost every organization knows how to turn their own structured data that has been collected over the years by business processes into valuable business insights. Countless reporting and analytical tools are available to assist them. Surely, these tools and algorithms may have to be adapted somewhat to be able to run fast on big data (for example, they may have to use in-memory techniques and dedicated hardware), but the algorithms stay the same and are well-known. 1 Gartner, October 2012; see 2016/ 2 Wikibon, Big Data Vendor Revenue and Market Forecast , August 26, 2013; See 3 McKinsey Global Institute, Big Data: The Next Frontier for Innovation, Competition, and Productivity, June 2011; see
4 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source 2 But what about all the textual data that has been gathered in s, document management systems, call center log files, instant messaging transcripts and voice transcripts from customer calls? And what about all the external textual data, such as blogs, tweets, Facebook messages and informational Websites? A wealth of information is hidden in the vast amounts of textual data being created every day. The challenge for every organization is to extract valuable business insights from this mountain of data that allows it to, for example, optimize its business processes, improve the level of customer care it offers, personalize products, and improve product development. This paper will outline the benefits and challenges of analyzing textual big data. It will also discuss InterSystems iknow technology, which offers an easier, less time-consuming way to unlock the information contained in textual data. 2 Textual Big Data: The Big Untapped Data Source Business Reasons for Analyzing Textual Big Data Almost every industry can benefit from analyzing textual data, especially those industries in which storing text is crucial for business operations, such as advertising, healthcare, legal, pharmaceuticals, publishing and real estate. For example, a hospital may be interested in analyzing the descriptions written by specialists and included in patient files to discover patterns with respect to allergic reactions to medications. An electronics company may want to analyze messages on Twitter to find out if their products are mentioned and whether the tweets are positive or not (a practice commonly referred to as sentiment analysis). Transcripts of call center log files can be analyzed to determine whether popular questions can be identified, or whether over the past couple of weeks specific products have been mentioned more often or in a different context than usual. What Exactly is Analyzing Text? We hardly need to analyze text if we only want to know how many words appear in a document, or how often a word appears. This can be determined with a simple, purely mathematical algorithm. But what if we want to answer more complex questions, such as: How often do particular symptoms and medications appear together in patient files? Does a text carry a positive or negative sentiment and to which concepts applies this sentiment? How many texts deal with the bankruptcy of Bank X? For each month, how many texts dealt with brain surgery? Which concept appears most often in texts together with the concept credit card fraud? Which book is, with respect to concepts used on the content of the book, most equal to Jeff Shaara s Gods and Generals, and which one is most different? How do we identify the characteristics of customer calls that resulted in escalation? These questions are much harder to handle. For example, how do we determine whether a text has a positive sentiment? How do you measure the difference between the contents of two books? This is the type of question for which text analysis is used. Analyzing text can also be defined as deriving structured data from unstructured text. For example, when a text is analyzed based on whether it s positive or not, the result is a structured
5 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source 3 data value: the value yes or no. The answers to the first and fourth questions above also lead to structured data. The advantage of deriving structured data is that this newly created structured data can be combined easily with other structured data sources and processed by well-known algorithms. The Index and the Thesaurus Historically, the first developments in trying to understand texts were based on indexing. Indexing texts means that terms are selected from a document that provide a sufficient indication of the document s subject-matter, ensuring that it can be retrieved using a specific query. Indexing, however, has its limitations. First, developing an index is timeconsuming which words should be indexed? Second, if the right terms are not indexed, some important and relevant texts may not be found, or incorrect texts may be found. Quite some time ago, to overcome the problems with indexing, the concept of a thesaurus was introduced. With a thesaurus the relations between terms are defined. In a way, a thesaurus can be seen as an intelligent index. The result of deploying a thesaurus is that a more accurate set of texts is found. But building up and managing a thesaurus is also time-consuming. Plus, a thesaurus must be kept up to date as new words are introduced, new domains are introduced, and so on. The Work Upfront Most text analysis tools require work in advance, such as setting up a thesaurus. Such tools are only useful if there is enough time to do all this work. What if a new and urgent question arises and in the thesaurus this hasn t been catered for? Or what if new texts become available for analysis and questions have to be asked right away? Also, with most text analysis tools the goal of the analysis exercise must be clear in advance. In other words, the tool is guided by the analyst. For example, search technology requires that one or more words are entered first. Another example is when patient files are analyzed to discover new insights with respect to the effect a particular medication has on patients with diabetes. As can be imagined, a different thesaurus may be needed when the goal is to look for historical patterns in side effects after surgery, even when the same patient files are analyzed. A thesaurus limits the analytical freedom and thus limits potential outcomes. 3 Exploring Textual Big Data Without the Hassle Use of Text Analysis Today Organizations can benefit from analyzing textual data. Unfortunately, most organizations have barely scratched the surface with respect to analyzing textual data. This is a missed opportunity. One of the dominant reasons why organizations have not tapped into their big data is that most text analysis tools and technologies require time-consuming upfront work. Indexes, thesauri and ontologies have to be developed in advance before any real analytical work can start. The Need for Text Exploration Analysis has to be able to follow the speed of business. For text analysis, this means technology is needed that allows text to be analyzed without the need to do all that work in advance. This form of text analysis is called text exploration.
6 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source 4 A hospital environment is a good example of where text exploration can be used. Imagine a patient is brought to the emergency room. If doctors must act quickly, they probably don t have time to read the full patient file. What they want is a summary that shows all the important aspects related to the patient. Is he diabetic? Does he usually have high blood pressure? What kind of medication is he taking? Has he been here before? This requires text analysis on the spot. The analysis should also be unguided, because the doctors might know nothing about this patient and therefore the analysis technology should not require the clinician to guide the tool, but the other way around. Another example is analyzing tweets. Everyday new words (acronyms in many cases) and hash tags are invented. It would be impossible to constantly update a thesaurus on them. Furthermore, is there time to develop one? Many situations exist in which there is no time for all this preparation work. Here, text exploration is needed to give the desirable business insights. The Three Requirements for Text Exploration To summarize, text exploration is a form of text analysis that meets the following three requirements: No advance preparations: There should be no need to develop thesauri or ontologies before the analysis work can be started. It should be possible to start text analysis right away without any preparations. Even if the text covers a new domain. Unguided analysis: Analysts should be able to invoke the text analysis technology without having to specify a goal in advance. The text analysis technology must be able to analyze the text in an unguided style. Self-service: Analysts are able to invoke the text analysis without the help of IT experts, although connecting the tool to particular data sources may require some assistance. 4 InterSystems iknow Technology for Analyzing Textual Big Data The Classic Approach of Text Analysis Tools for analyzing text usually try to identify important concepts in sentences. For example, in the sentence The enterprise search market is being reshaped by new consumer experiences, the key concepts are enterprise search market and new consumer experiences. Most text analysis tools try to locate these concepts by looking at individual words, which results in the words consumer, enterprise, experience, market and search. They are considered key concepts in this text. Some tools search for two-word phrases and even three-word phrases. The result of this approach, however, can be that words are connected that should not be connected. Take the following sentence as an example: Michael Phelps breaks a world record. If two-word phrases are identified, the result contains the concepts Michael Phelps and Phelps breaks. Now, the first one is probably a useful one, but the second isn t. And if we would search for all two-word phrases in the first sentence, we get enterprise search and search market, but not enterprise search market. This more classic approach doesn t guarantee that the words that are linked together form the right concept.
7 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source 5 In addition, to make sense of the sentences, developers have to build up thesauri and ontologies. This can represent quite an effort and requires domain knowledge. For every domain a new thesaurus and ontology must be created and maintained. In most situations, this process will never end, because the use of words changes over time. New terms are introduced and the meaning of words can change. As an example take tweets every day new important hash tags are being introduced. New terms are also introduced in the BI domain. Who had heard of the term big data a few years ago? InterSystems Approach to Text Analysis The approach taken by InterSystems to analyze text is different from many other approaches. InterSystems has introduced a technology called iknow that breaks texts into sentences, and then breaks sentences into concepts and relations. Decomposing sentences is done by first identifying the relations in a sentence. Verbs can represent relations between concepts in a sentence, but other language constructs can signify relations as well. By identifying the relations within a sentence, iknow has a better chance of discovering the desired concepts. For example, in the sentence The programmer found bugs, iknow considers the verb found to be a relation separating the concepts programmer and bugs. In iknow this is called a concept-relation-concept sequence (CRC). Note that iknow automatically discards irrelevant stop words from sentences, such as the and an. As indicated, other language constructs can indicate a relation. For example, in the sentence snippet Mammals, such as elephants and lions a relation exists between mammals and elephants and between mammals and lions. Another example is the sentence I like the car in the showroom. Here, the word in represents a relation between the concepts car and showroom. iknow has been designed to recognize many different language constructs that can identify relations. If the concepts and relations consist of multiple words, iknow still recognizes them. For example, in the sentence The enterprise search market is being reshaped by new consumer experiences, iknow discovers that the verb clause is being reshaped by represents the relation between the two concepts enterprise search market and new consumer experiences. This fast and domain-independent entity identification process effectively decomposes sentences into graph structures where concepts are linked to one another through relationships. These graph structures and the contextual metadata and metrics iknow collects along the way can then be used for advanced analysis within a text or across a corpus of texts. iknow is not restricted to analyze simple sentences consisting of CCs and CRCs. It can handle more complex sentence structures consisting of multiple CRCs. These are called CRC sequences. Note: InterSystems iknow technology supports several languages, including Dutch, English, French, German, Portuguese and Spanish. Japanese and Russian are in development.
8 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source 6 How InterSystems iknow Technology Supports the Three Requirements for Text Exploration iknow supports all three key requirements for text exploration described in Section 3: No advance preparations: iknow doesn t require the development of thesauri and ontologies. It can analyze text coming from a domain or industry it has never analyzed before, and is still able to discover the important concepts. Unguided analysis: iknow does not need a goal. It does not, like search technology for example, need a search term before it can analyze the text. iknow can analyze text in an unguided or bottom-up fashion. The result can be studied by the analysts and those results can then trigger them to search in a certain direction. Self-service: Analysts can use InterSystems DeepSee to invoke all the text analytical features of iknow. DeepSee can be categorized as a self-service analytical technology that allows users to develop their own reports and do their own analysis without the help of IT experts. Using iknow With Big Data InterSystems iknow technology is embedded with InterSystems Caché, a high-performance database server. Caché s unique multidimensional data engine makes it ideal for storing, managing and querying all forms of data, including textual data. Its performance and scalability have been proven in many big data environments. Any Cachébased application can invoke iknow and can therefore analyze both text and structured data. 5 Summary Everyone agrees, big data can enrich the analytical capabilities of organizations. For many organizations today it means crunching huge amounts of highly structured and mostly numerical data. In other words, the focus has been on analyzing non-textual and highly structured data. However, a wealth of information is hidden in vast amounts of textual data sources, such as s, document management systems, call center log files, instant messaging transcripts and voice transcripts from customer calls. Not to mention external textual data, such as blogs, tweets, Facebook messages and informational Websites. For most organizations these textual data sources are still an untapped source of information. The challenge for many organizations will be to extract valuable business insights from this mountain of textual data in order to, for example, optimize business processes, improve the level of customer care, personalize products and improve product development. Text exploration is a form of text analysis that allows organizations to analyze textual data at the speed of business. No or minimal upfront work is required. Text can be analyzed when needed by the business. InterSystems iknow is a breakthrough technology aimed at text exploration. It allows organizations to analyze their big textual data for business insights as needed.
9 Analytics of Textual Big Data Text Exploration of the Big Untapped Data Source 7 About the Author Rick F. van der Lans Rick F. van der Lans is an independent analyst, consultant, author and lecturer specializing in data warehousing, business intelligence, data virtualization and database technology. He works for R20/Consultancy (www.r20.nl), a consultancy company he founded in Rick is chairman of the annual European Business Intelligence and Enterprise Data Conference (organized in London). He writes for the eminent B-eye-Network 4 and other Websites. He introduced the business intelligence architecture called the Data Delivery Platform in 2009 in a number of articles 5 all published at BeyeNetwork.com. He has written several books on SQL. Published in 1987, his popular Introduction to SQL 6 was the first English book on the market devoted entirely to SQL. After more than twenty years, this book is still being sold, and has been translated in several languages, including Chinese, German and Italian. His latest book 7 Data Virtualization for Business Intelligence Systems was published in For more information please visit or to You can also get in touch with him via LinkedIn and via About InterSystems Corporation Founded in 1978, InterSystems Corporation is a US$446,000,000 privately held software company with offices in 25 countries and corporate headquarters in Cambridge, Massachusetts. They provide the premier platform for connected healthcare, and their innovative products are widely used in other industries that demand the highest software performance and reliability. Clients include TD Ameritrade, European Space Agency, U.S. Department of Veteran Affairs, Johns Hopkins Hospital, Belgium Police, Mediterranean Shipping Company, and thousands of other successful organizations. Leading application providers also leverage the high performance and reliability of InterSystems' advanced technology in their own products. These organizations include Epic Systems, Fiserv, GE Healthcare, and hundreds of others. 4 See 5 See 6 R.F. van der Lans, Introduction to SQL; Mastering the Relational Database Language, fourth edition, Addison- Wesley, R.F. van der Lans, Data Virtualization for Business Intelligence Systems, Morgan Kaufmann Publishers, 2012.
Extending Business Intelligence with Text Exploration Technology A Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy June 2013 Sponsored by Copyright 2013 R20/Consultancy.
Data Warehouse Optimization Embedding Hadoop in Data Warehouse Environments A Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy September 2013 Sponsored by Copyright
What is Data Virtualization? by Rick F. van der Lans, R20/Consultancy August 2011 Introduction Data virtualization is receiving more and more attention in the IT industry, especially from those interested
Active AnAlytics: Driving informed Decisions leading to Better clinical AnD financial outcomes An InterSystems White Paper for Healthcare IT Executives Active AnAlytics: Driving informed Decisions leading
Empowering Operational Business Intelligence with Data Replication A Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy April 2013 Sponsored by Copyright 2013 R20/Consultancy.
Discovering Business Insights in Big Data Using SQL-MapReduce A Technical Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy July 2013 Sponsored by Copyright 2013
The New Rules for Integration A Unified Integration Approach for Big Data, the Cloud, and the Enterprise A Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy September
CAPTURING THE VALUE OF UNSTRUCTURED DATA: INTRODUCTION TO TEXT MINING Mary-Elizabeth ( M-E ) Eddlestone Principal Systems Engineer, Analytics SAS Customer Loyalty, SAS Institute, Inc. Is there valuable
Advanced Analytics The Way Forward for Businesses Dr. Sujatha R Upadhyaya Nov 2009 Advanced Analytics Adding Value to Every Business In this tough and competitive market, businesses are fighting to gain
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
INTERSYSTEMS CACHÉ AS AN ALTERNATIVE TO IN-MEMORY DATABASES David Kaaret InterSystems Corporation INTERSYSTEMS CACHÉ AS AN ALTERNATIVE TO IN-MEMORY DATABASES Introduction To overcome the performance limitations
I N T E R S Y S T E M S W H I T E P A P E R F O R F I N A N C I A L SERVICES EXECUTIVES Deploying an elastic Data Fabric with caché Deploying an elastic Data Fabric with caché Executive Summary For twenty
HOW INTERSYSTEMS TECHNOLOGY ENABLES BUSINESS INTELLIGENCE SOLUTIONS A white paper by: Dr. Mark Massias Senior Sales Engineer InterSystems Corporation HOW INTERSYSTEMS TECHNOLOGY ENABLES BUSINESS INTELLIGENCE
Key Data Replication Criteria for Enabling Operational Reporting and Analytics A Technical Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy May 2013 Sponsored by
Enriching Big Data with Mainframe Data Using Data Virtualization A Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy July 2014 Sponsored by Copyright 2014 R20/Consultancy.
So Many Tools, So Much Data, and So Much Meta Data Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands. All rights reserved. No part of this material may be reproduced, stored in a retrieval
The Key Challenges FaCing MediCal device ManuFaCTurers (and how to meet them with strategic interoperability) The Key Challenges FaCing MediCal device ManuFaCTurers (and how to meet them with strategic
SAP Product Brief SAP s for Small Businesses and Midsize Companies SAP BusinessObjects Business Intelligence, Edge Edition Objectives Winning with an Intuitive Business Intelligence for Midsize Companies
Unlocking The Value of the Deep Web Harvesting Big Data that Google Doesn t Reach Introduction Every day, untold millions search the web with Google, Bing and other search engines. The volumes truly are
SAP BusinessObjects BI and EIM 4.0 Safe Harbor Statement This document is intended to outline future product direction, and is not a commitment by SAP to deliver any given code or functionality. Any statements
What is Data Virtualization? Rick F. van der Lans Data virtualization is receiving more and more attention in the IT industry, especially from those interested in data management and business intelligence.
More than a Database InterSystems Data Platform Robert Nagle Why a Data Platform? To Develop the best Operational Applications for Enterprises Requires Incredibly Scalable, Robust, Reliable Storage Engine
Infor10 Corporate Performance Management (PM10) Deliver better information on demand. The speed, complexity, and global nature of today s business environment present challenges for even the best-managed
EC Wise Report: Unlocking the Value of Deeply Unstructured Data Feedback from the Market: Forest Rim enables significant improvements in the quality of semantic information derived from text data. This
Embedded Real-time Business Intelligence. Discover the Treasures. Make Applications More Valuable with Embedded Real-time Business Intelligence You can enhance your transactional applications with features
SAP Solutions for Small Businesses and Midsize Companies SAP BusinessObjects Edge BI, Standard Package Preferred Business Intelligence Choice for Growing Companies SAP BusinessObjects Edge BI, Standard
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed
SAP Brief SAP Technology SAP Sybase IQ Objectives Tap into Big Data at the Speed of Business A simpler, more affordable approach to Big Data analytics A simpler, more affordable approach to Big Data analytics
InterSyStemS HealtHSHare: a StrategIc Platform for BreaktHrougH HealtHcare SolutIonS A Unique Business Opportunity for InterSystems Application Partners InterSyStemS HealtHSHare: a StrategIc Platform for
E SB DE CIS IO N GUID E Business Transformation for Application Providers 10 Questions to Ask Before Selecting an Enterprise Service Bus 10 Questions to Ask Before Selecting an Enterprise Service Bus InterSystems
IBM Software Business Analytics IBM SPSS Modeler Solve Your Toughest Challenges with Data Mining Use predictive intelligence to make good decisions faster Solve Your Toughest Challenges with Data Mining
More than a Database InterSystems Data Platform Robert Nagle Why a Data Platform? To Develop the best Operational Applications for Enterprises Requires Incredibly Scalable, Robust, Reliable Storage Engine
Embedding Analytics in Decision Management Systems In-database analytics offer a powerful tool for embedding advanced analytics in a critical component of IT infrastructure. James Taylor CEO CONTENTS Introducing
IBM Content Analytics with Enterprise Search, Version 3.0 Highlights Enables greater accuracy and control over information with sophisticated natural language processing capabilities to deliver the right
NATURAL LANGUAGE BI IS NATURAL LANGUAGE THE FUTURE OF BUSINESS INTELLIGENCE? 4 BI challenges Natural Language is solving Page 03 Introduction Page 04 The solution to Unstructured Data Page 05 Democratizing
Big Data in the Nordics 2012 A survey about increasing data volumes and Big Data analysis among private and governmental organizations in Sweden, Norway, Denmark and Finland. Unexplored Big Data Potential
decision ready. How Master Data Management powers big data decision making. Building an enterprise architecture that s decision ready. Bringing discipline to big data. The trouble with insight is it doesn
Big data: Unlocking strategic dimensions By Teresa de Onis and Lisa Waddell Dell Inc. New technologies help decision makers gain insights from all types of data from traditional databases to high-visibility
IBM Software IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster Solve your toughest challenges with data mining Imagine if you could
Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For
Making critical connections: predictive analytics in government Improve strategic and tactical decision-making Highlights: Support data-driven decisions using IBM SPSS Modeler Reduce fraud, waste and abuse
Harnessing the Power of Big Data for Real-Time IT: Sumo Logic Log Management and Analytics Service A Sumo Logic White Paper Introduction Managing and analyzing today s huge volume of machine data has never
Data Services: The Marriage of Data Integration and Application Integration A Whitepaper Author: Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy July, 2012 Sponsored by Copyright
TRANSACTION DATA ENRICHMENT AS THE FIRST STEP ON THE BIG DATA JOURNEY A key part of its industry-leading platform for digital financial services, the new Yodlee TransactionDataEnrichment solution enables
Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
USING INTERSYSTEMS CACHÉ FOR SECURELY STORING CREDIT CARD DATA Andreas Dieckow Principal Product Manager InterSystems Corporation USING INTERSYSTEMS CACHÉ FOR SECURELY STORING CREDIT CARD DATA Introduction
White Paper Analyzing Big Data: The Path to Competitive Advantage by Marcia Kaplan Contents Introduction....2 How Big is Big Data?................................................................................
WHITEPAPER OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT A top-tier global bank s end-of-day risk analysis jobs didn t complete in time for the next start of trading day. To solve
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
IBM Cognos Performance Management Solutions for Oracle Gain more value from your Oracle technology investments Highlights Deliver the power of predictive analytics across the organization Address diverse
customer care solutions from Nuance white paper :: A Guide to Successful Intelligent Virtual Assistants Why Best-in-Class Technology Alone Is Not Enough NUANCE :: customer care solutions More than ever
TM Interactive product brochure :: Nina TM Mobile: The Virtual Assistant for Mobile Customer Service Apps This PDF contains embedded interactive features. Make sure to download and save the file to your
High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances Highlights IBM Netezza and SAS together provide appliances and analytic software solutions that help organizations improve
Text Analytics Beginner s Guide Extracting Meaning from Unstructured Data Contents Text Analytics 3 Use Cases 7 Terms 9 Trends 14 Scenario 15 Resources 24 2 2013 Angoss Software Corporation. All rights
SAP BusinessObjects Edge BI, Standard Package Preferred Business Intelligence Choice for Growing Companies SAP Solutions for Small Business and Midsize Companies Executive Summary Business Intelligence
ETPL Extract, Transform, Predict and Load An Oracle White Paper March 2006 ETPL Extract, Transform, Predict and Load. Executive summary... 2 Why Extract, transform, predict and load?... 4 Basic requirements
5 PLACES IN YOUR HOSPITAL WHERE ENTERPRISE CONTENT MANAGEMENT CAN HELP WHAT IS ECM AND WHY MIGHT YOU NEED IT? Although technology continues to improve how healthcare organizations share information both
Data Discovery, Analytics, and the Enterprise Data Hub Version: 101 Table of Contents Summary 3 Used Data and Limitations of Legacy Analytic Architecture 3 The Meaning of Data Discovery & Analytics 4 Machine
by Michael Garzone Solutions Director, Technology Solutions 972-530-5755 firstname.lastname@example.org Big Data seems to have become the latest marketing buzzword. While there is a lot of talk about it, do
Data Virtualization for Business Intelligence Agility A Whitepaper Author: Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy February 9, 2012 Sponsored by Copyright 2012 R20/Consultancy.
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Anatomy of Cyber Threats, Vulnerabilities, and Attacks ACTIONABLE THREAT INTELLIGENCE FROM ONTOLOGY-BASED ANALYTICS 1 Anatomy of Cyber Threats, Vulnerabilities, and Attacks Copyright 2015 Recorded Future,
Making Critical Connections: Predictive Analytics in Improve strategic and tactical decision-making Highlights: Support data-driven decisions. Reduce fraud, waste and abuse. Allocate resources more effectively.
SPAN White Paper!? Sentiment Analysis on Big Data Machine Learning Approach Several sources on the web provide deep insight about people s opinions on the products and services of various companies. Social
Judith Hurwitz President and CEO Sponsored by Hitachi Introduction Only a few years ago, the greatest concern for businesses was being able to link traditional IT with the requirements of business units.
IBM Global Business Services Microsoft Dynamics CRM solutions from IBM Power your productivity 2 Microsoft Dynamics CRM solutions from IBM Highlights Win more deals by spending more time on selling and
Streamlining the Process of Business Intelligence with JReport An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) Product Summary from 2014 EMA Radar for Business Intelligence Platforms for Mid-Sized Organizations
ADVANTAGES OF IMPLEMENTING A DATA WAREHOUSE DURING AN ERP UPGRADE Advantages of Implementing a Data Warehouse During an ERP Upgrade Upgrading an ERP system presents a number of challenges to many organizations.
IBM Software Business Analytics IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster 2 Solve your toughest challenges with data mining
SINEQUA FOR LIFE SCIENCES DRIVE INNOVATION. ACCELERATE RESEARCH. SHORTEN TIME-TO-MARKET. 6 Ways to Leverage Big Data Search & Content Analytics for a Pharmaceutical Company Improve Cooperation in R&D Catalyze
INSIGHT SDL BeGlobal: Machine Translation for Multilingual Search and Text Analytics Applications José Curto David Schubmehl IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.872.8200
Community Driven Apache Hadoop Apache Hadoop Patterns of Use April 2013 2013 Hortonworks Inc. http://www.hortonworks.com Big Data: Apache Hadoop Use Distilled There certainly is no shortage of hype when
Reveal More Value in Your Data with Location Analytics Brought to you compliments of: In nearly every industry, executives, managers and employees are increasingly using maps in conjunction with enterprise
WHY IT ORGANIZATIONS CAN T LIVE WITHOUT QLIKVIEW A QlikView White Paper November 2012 qlikview.com Table of Contents Unlocking The Value Within Your Data Warehouse 3 Champions to the Business Again: Controlled
Business Intelligence 3.0: Revolutionizing Organizational Data WHITEPAPER Introduction Organizations today are all faced with the same challenge how to make better decisions faster. While this challenge
Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal Information has gone from scarce to super-abundant. That brings huge new benefits. The Economist
An Oracle White Paper October 2013 Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics Introduction: The value of analytics is so widely recognized today that all mid
White Paper Ten Things You Need to Know About Data Virtualization What is Data Virtualization? Data virtualization is an agile data integration method that simplifies information access. Data virtualization
CORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE Michael Diederich, Microsoft CMG Research & Insights Introduction The rise of social media platforms like Facebook and Twitter has created new
Cloud Computing The impact for IT departments and the IT professional by Maurice van der Woude Cloud Computing The impact for IT departments and the IT professional Preface 3 Organizational changes 4 Moving
A P P L I C A T I O N S A WHITE PAPER SERIES BUILDING A DATA WAREHOUSE IS NO EASY TASK... THE RIGHT PEOPLE, METHODOLOGY, AND EXPERIENCE ARE EXTREMELY CRITICAL Eleven Steps to Success in Data Warehousing
IBM G-Cloud - IBM Social Media Analytics Software as a Service Service Definition 1 1. Summary 1.1 Service Description IBM Social Media Analytics Software as a Service is a powerful Cloud-based tool for
CASE STUDY KPMG Unlocks Hidden Value in Client Information with Smartlogic Semaphore Sponsored by: IDC David Schubmehl July 2014 IDC OPINION Dan Vesset Big data in all its forms and associated technologies,
Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data: The Datafication Of Everything Thoughts Devices Processes Thoughts Things Processes Run the Business Organize data to do something
1 technavio insights About TechNavio Technavio is the research platform of Infiniti Research. Infiniti Research provides actionable market intelligence to leading companies worldwide. A team of 120 analysts
IBM Software Business Analytics IBM Cognos Business Intelligence A business intelligence agenda for midsize organizations: Six strategies for success A business intelligence agenda for midsize organizations:
Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring
NOVEMBER 2014 TDWI E-Book Predictive Analytics: Revolutionizing Business Decision Making 1 Q&A: Predictive Analytics 101 3 Who Should Be Building Predictive Models? 5 Exploratory Predictive Analytics 8
Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,
Social Media Marketing for Local Businesses The average number of hours a U.S. consumer spends on social media per week. - PQ Media, 2013 Social is the Norm A lot has changed in the 10 years since Facebook