Content Analyst's Cerebrant Combines SaaS Discovery, Machine Learning, and Content to Perform Next-Generation Research



Similar documents
SDL BeGlobal: Machine Translation for Multilingual Search and Text Analytics Applications

"Why Didn't We Do It Sooner?" Deployment of a New BI Solution at The Pain Center of Arizona

Worldwide Advanced and Predictive Analytics Software Market Shares, 2014: The Rise of the Long Tail

Worldwide Workload Management Software 2013 Vendor Shares

IDC MarketScape Excerpt: Worldwide HR BPO 2014 Vendor Assessment

SAS Enterprise Decision Management at a Global Financial Services Firm: Enabling More Rapid Implementation of Decision Models into Production

KPMG Unlocks Hidden Value in Client Information with Smartlogic Semaphore

How To Understand And Understand Cyber Group

Converged and Integrated Datacenter Systems: Creating Operational Efficiencies

Worldwide Cloud Systems Management Software 2013 Vendor Shares

Worldwide Datacenter Automation Software 2013 Vendor Shares

Worldwide Application Performance Management Software 2013 Vendor Shares

ScienceLogic Offers Unified Infrastructure Monitoring and Analytics for Hybrid IT

IT as a Service Emerges as a New Management Paradigm in the Software-Defined Datacenter Era

DevOps and the Cost of Downtime: Fortune 1000 Best Practice Metrics Quantified

Worldwide Problem Management Software Market Shares, 2014: 3rd Platform Technologies and Delivery Models Drive Growth

Incorporating Mobility into a Customer Experience Strategy

IDC MarketScape: Worldwide Datacenter Infrastructure Management 2015 Vendor Assessment

IDC MarketScape: Worldwide Service Desk Management Software 2014 Vendor Analysis

Microsoft Office 365: How the Hosted Exchange Server Is Redefining SMB Cloud IT Adoption

IDC MarketScape: Worldwide Oracle Implementation Services Ecosystem 2014 Vendor Assessment

IDC MarketScape: Worldwide Digital Enterprise Strategy Consulting Services 2015 Vendor Assessment

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Ricoh 1to1 Create Review

IDC MarketScape: Worldwide Service Desk Management Software 2014 Vendor Analysis

Business Analytics Software- as- a - Service Case Study: Market Research Provider Delivers Data Through the Cloud

Perspective: Cloud Solutions and Deployment for Healthcare Payers in 2014

IDC MarketScape: Worldwide Cloud Professional Services 2016 Vendor Assessment

The State of Mobility in the Enterprise in 2014: An IDC Survey of Devices, Platforms, Decisions, and Deployments

Ultimate Software: Successfully Navigating the Transition from On-Premise to Cloud ISV as a Public Company

2014 Human Capital Management Survey: HCM Buyer Actions and Plans

IDC MarketScape: U.S. Government Private Cloud IaaS 2014 Vendor Assessment

Worldwide Datacenter Automation Software Market Shares, 2014: Year of Cloud and DevOps

Long Term Care Group Deploys Zerto for Data Protection and Recovery for Virtual Environments

FirstRain Private Vendor Watchlist Profile: Next-Generation Personal Analytics Through Value-Added Content

IDC MarketScape: Worldwide Life Science Social Media Analytics 2014 Vendor Assessment

Business Networks: The Next Wave of Innovation

IDC MarketScape: Worldwide Telecom Service Provider 2013 Vendor Assessment

IDC MarketScape: Worldwide Business Consulting Strategy for Digital Operations 2015 Vendor Assessment

INSIGHT. IDC's Social Business Taxonomy, 2011 IDC OPINION IN THIS INSIGHT. Scott Guinn

Western European Organizations Turn to the Cloud for UCaaS

I D C E X E C U T I V E B R I E F

Worldwide Cloud Systems Management Software Market Shares, 2015: Year of Continued Expansion

IDC MarketScape: U.S. Government Private Cloud IaaS 2014 Vendor Assessment

How Collaboration Can Help Achieve Your Business Goals: A European Perspective

Allstate Getting Much More from Its IT Services with ServiceNow Cloud-Based IT Service Management Solution

Worldwide DDI Market Update

IDC MarketScape: Worldwide Managed Print and Document Services 2014 Hardcopy Vendor Assessment Focus on Managed Workflow Services

How To Understand Cloud Economics

Worldwide Security and Vulnerability Management Forecast and 2013 Vendor Shares

Worldwide Business Analytics Software Forecast and 2013 Vendor Shares

IDC MarketScape: Worldwide Life Science CRM Software 2015 Vendor Assessment

Cirba Targets Software-Defined Infrastructure Control with Workload-Aware Predictive Analytics

Worldwide Application Performance Management Software 2012 Vendor Shares

Vendor Assessment: 2014 Top 10 Life Science Software Vendors

IDC MarketScape: Worldwide Supply Chain Management Business Consulting Services 2014 Vendor Assessment

Mining for Insight: Rediscovering the Data Archive

Using Converged Infrastructure to Enable Rapid, Cost-Effective Private Cloud Deployments

U.S. IT Buyer Survey Shows Outsourcers Bring Strength to Cloud

Worldwide Cloud Systems Management Software 2012 Vendor Shares

W o r l d w i d e B u s i n e s s A n a l y t i c s S o f t w a r e F o r e c a s t a n d V e n d o r S h a r e s

Impact of Juniper Training and Certification on Network Management Activities

IDC MarketScape: Worldwide Big Data Consulting and Systems Integration Services 2016 Vendor Assessment

Data Management: Foundational Technologies for Health Insurance Exchange Success

Equinix Increases IT and Employee Productivity with ServiceNow Cloud-Based IT Service Automation Solution

IDC MarketScape: Worldwide Microsoft Enterprise Applications Implementation Services Ecosystem 2015 Vendor Assessment

IDC MarketScape: Worldwide Public Deployment-Centric Cloud Application Platform 2015 Vendor Assessment

AT&T Leverages HP Vertica Analytics Platform to Change the Economics of Providing Actionable Insights to Decision Makers

IDC MarketScape: Worldwide Life Science Sales and Marketing BPO 2015 Vendor Assessment

IDC MarketScape: Worldwide Strategy Consulting Services 2014 Vendor Assessment

IDC MarketScape: Worldwide Life Science Sales and Marketing ITO 2015 Vendor Assessment

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

IBM Enhances Portfolio by Acquiring Object-Based Storage Supplier Cleversafe

Workload Automation Emerges as Business Innovation Engine in the Era of Cloud, Big Data, and DevOps

IDC MarketScape: Worldwide Federated Identity Management and Single Sign-On 2014 Vendor Assessment

Cloud Contact Center Services Profile: LiveOps

Worldwide Cloud Systems Management Software Market Shares, 2014: Year of Hybrid Cloud

Global Headquarters: 5 Speen Street Framingham, MA USA P F

VersaPay Automates the Accounts Receivable Process

I N D U S T R Y D E V E L O P M E N T S A N D M O D E L S. I D C M a t u r i t y M o d e l : P r i n t a n d D o c u m e n t M a n a g e m e n t

Schneider Electric's SmartBunker Provides Smarter, More Secure Datacenters at the Edge

Journey to 3rd Platform Digital Customer Experience

W o r l d w i d e a n d U. S. M a n a g e d M o b i l i t y F o r e c a s t : U n i t e d S t a t e s L e a d s i n A d o p t i o n

COMPETITIVE ANALYSIS. Worldwide RDBMS 2006 Vendor Shares: Preliminary Results for the Top 5 Vendors IDC OPINION. Carl W. Olofson

Nimble Storage Leverages Operational Data to Drive Its Business with Analytics Delivered by HP Vertica

Transcription:

INSIGHT Content Analyst's Cerebrant Combines SaaS Discovery, Machine Learning, and Content to Perform Next-Generation Research David Schubmehl IDC OPINION Organizations are looking for better ways to perform research, given the explosion of information that is available via the Internet today. IDC estimates that the digital universe is growing 40% per year into the next decade, and by 2020, the digital universe the data we create and copy annually will reach 44ZB or 44 trillion gigabytes. A significant percentage of the digital universe is human-generated data in the form of patent filings, research papers, blog posts, and news articles as well as social media data such as Twitter, Facebook, reddit, and other social commentary forums. While aggregators such as LexisNexis, Thomson Scientific, PubMed, and Elsevier exist to help pull all of this information together, it is difficult to bridge the gap between traditional information retrieval tools and the synthesis of knowledge across information sources without the help of advanced analytics tools. Content Analyst Company LLC, a search technologies company specializing in latent semantic indexing (LSI) and with roots from Bell Research, SAIC, and the U.S. intelligence agencies, has been developing unstructured information analytics tools for over a decade. The company has been selling its tools to other software companies for inclusion in their products in the ediscovery, research and information access industries. In addition: With the growing popularity of cloud-based computing and software-as-a-service (SaaS) technologies, Content Analyst has decided to launch a new product aimed directly at the research community. Its Cerebrant product offering combines advanced search, categorization, and filtering technologies combined with third-party content such as medical research, news, and industry information to bring cost-effective research to a whole new level. Cerebrant also offers the ability to ingest proprietary content and combine that with third-party information. The combination of advanced technology, SaaS-based services, and value-added content is where a growing number of companies are beginning to offer solutions aimed directly at communities of researchers and knowledge workers. Organizations in the market looking out for a new or enhanced solution for their researchers may want to consider Cerebrant among other options available in the market. IN THIS INSIGHT This IDC Insight discusses the announcement and release of Content Analyst's new Cerebrant SaaS Discovery platform, combining cloud-based discovery and machine learning with third-party content sources as well as an organization's internal content to facilitate research and exploration. March 2015, IDC #255065

SITUATION OVERVIEW Content Analyst Company, developer of the CAAT machine learning text analytics engine, recently announced the general availability of Cerebrant a SaaS-based discovery platform designed to enable subject matter experts in any industry to gain insight into the growing collections of unstructured content. Cerebrant enables users to find the most relevant internal and external content and discover important, nonobvious relationships buried within massive collections of unstructured information. Cerebrant was designed to enable subject matter experts to create an online workbench with large collections of content from disparate sources and navigate and explore their content sets in a matter of hours. Users can identify and select disparate collections of public and premium unstructured content such as scientific research papers, industry reports, syndicated research, news, Wikipedia, and other internal and external repositories. Unlike alternative solutions, Cerebrant is not dependent upon Boolean search strings, exhaustive taxonomies, or word libraries since it leverages the power of the Content Analyst's proprietary LSIbased learning engine. Users simply take a selection of text ranging from a short phrase, sentence, paragraph to an entire document, and Cerebrant identifies and ranks the most conceptually related documents, articles, and terms across the selected content sets, ranging from tens of thousands to millions of text items. About Latent Semantic Indexing According to Wikipedia, latent semantic indexing is an indexing and retrieval method that uses a statistical technique to identify patterns in the relationships between terms and concepts contained in an unstructured collection of text. LSI's statistical approach is based on the common sense principle that words that are used in the same contexts tend to have similar meanings. A key capability of LSI is its ability to extract the conceptual content of a body of text by establishing associations between those terms that occur in similar contexts without the necessity of a taxonomy or other process that relates words and terms of similar meaning. LSI is able to correlate semantically related terms that are latent in a collection of text using statistical methods, and it was first applied to text at Bellcore Labs in the late 1980s. The method uncovers the underlying latent semantic structure in the usage of words in a body of text without really understanding it, and it can be used to extract the meaning of the text in response to user queries, commonly referred to as concept searches. Search queries on an LSI-based index of documents will return results that are conceptually similar in meaning to the search criteria even if the results don't share a specific word or words with the search criteria. The result of applying LSI is that it becomes easier to locate information that is related to the query, even though the exact terms that are being searched for are not found in the document result set. Figure 1 shows the example of documents automatically clustered into groups based on similarity of topic using LSI. 2015 IDC #255065 2

FIGURE 1 Content Analyst's Cerebrant Clustering Source: Content Analyst, 2015 Content Analyst's Cerebrant Cerebrant brings the power of the Content Analyst's proven CAAT text analytics engine that's used to process billions of text items in the U.S. Intelligence Community and dozens of ediscovery software products. Cerebrant delivers the power, security, and time to benefits of cloud computing through a simple browser-based interface, requiring little or no IT support to implement and use across targeted large-scale content repositories. Cerebrant is being offered by Content Analyst as a software-as-a-service offering to end-user customers. This is Content Analyst's first foray into the end-user market; all of the previous products have been developer and OEM technologies that other software companies integrate into their products. The initial market focus for Cerebrant is on the pharma market, and the introductory pricing for pharma is $7,500 for up to 10 users. The pharma version includes access to certain pharma content repositories such as PubMed Central and a collection of FDA guidance and drafts as well as pharma industry news. The starting cost for a version of standard Cerebrant without pharma-specific 2015 IDC #255065 3

content is $5,000 per month for 10 users as introductory pricing with the ability to ingest up to 5GB of user content with the content offered by Content Analyst. Both the pharma and the standard versions will come with general new content and the entire contents of Wikipedia as information sources for analysis. There will be a falling price curve for customers with more users, and the standard product will be multitenant with named user log-ins. However, organizations can contract with Content Analyst to provide standalone versions if necessary. The product uses Amazon's AWS as the compute service, providing Content Analyst with excellent uptime and capacity capabilities. FUTURE OUTLOOK Many organizations are beginning to offer content-based research and analytics services, and Content Analyst's Cerebrant research service is an excellent example of this trend. Competitors to Cerebrant are IHS' Goldfire research product as well as Northern Light's SinglePoint research service. The combination of Cerebrant's capabilities using LSI and the inclusion of third-party content with an initial price point of approximately $500 per user per month makes Content Analyst a company to consider when looking at the variety of research products and services now available to organizations. While Content Analyst is a new contender in the market for end-user services, it has a long and distinguished history as a technology vendor dealing with large amounts of unstructured information and text-based data sets. Organizations that are seeking to expand their capabilities in research should consider Cerebrant as one of several options as they assess which product or products will meet the growing requirements for better research tools. 2015 IDC #255065 4

About IDC International Data Corporation (IDC) is the premier global provider of market intelligence, advisory services, and events for the information technology, telecommunications and consumer technology markets. IDC helps IT professionals, business executives, and the investment community make factbased decisions on technology purchases and business strategy. More than 1,100 IDC analysts provide global, regional, and local expertise on technology and industry opportunities and trends in over 110 countries worldwide. For 50 years, IDC has provided strategic insights to help our clients achieve their key business objectives. IDC is a subsidiary of IDG, the world's leading technology media, research, and events company. Global Headquarters 5 Speen Street Framingham, MA 01701 USA 508.872.8200 Twitter: @IDC idc-insights-community.com www.idc.com Copyright Notice This IDC research document was published as part of an IDC continuous intelligence service, providing written research, analyst interactions, telebriefings, and conferences. Visit www.idc.com to learn more about IDC subscription and consulting services. To view a list of IDC offices worldwide, visit www.idc.com/offices. Please contact the IDC Hotline at 800.343.4952, ext. 7988 (or +1.508.988.7988) or sales@idc.com for information on applying the price of this document toward the purchase of an IDC service or for information on additional copies or Web rights. Copyright 2015 IDC. Reproduction is forbidden unless authorized. All rights reserved.