1 ediscovery and Search of Enterprise Data in the Cloud From Hype to Reality By John Patzakis & Eric Klotzko
2 ediscovery and Search of Enterprise Data in the Cloud: From Hype to Reality Despite the enormous cost benefits of the cloud, past concerns about the feasibility of ediscovery and general search across an organization s critical cloud-resident data prevented broader adoption. The absence of effective enterprise search and the inability to perform ediscovery on such data in-place without first exporting it en masse significantly negated many of the cloud s overall benefits. Central to the problem is the failure of ediscovery and enterprise search vendors to support the cloud with solutions that easily and remotely deploy and seamlessly operate in virtualized environments where the organization s data resides until now. This white paper examines the requirements of truly cloud-capable ediscovery software and benefits of breakthrough solutions that perform ediscovery as well as enterprise search of terabytes of cloud-based data where it resides and its implications for broader cloud adoption. ediscovery and Search Are Essential for Successful Enterprise Cloud Deployments This discussion focuses on Infrastructure as a Service (IaaS) cloud deployments (such as the Amazon cloud or Rackspace). IaaS provides the means for organizations to aggressively store and virtualize their enterprise data and software, thus spawning the same large data volumes and requiring the same critical search and ediscovery requirements as traditional enterprise environments. IaaS enables organizations to deploy whatever data and supporting software they need into the cloud, including software to search, collect and perform early case assessment of all data in the IaaS cloud so long as that software is cloud-capable. However, the absence of ediscovery and enterprise search software that can be deployed, operated and leveraged to capture the extensive and elastic processing benefits of IaaS cloud deployments presents a barrier to broader cloud adoption. All of the potential advantages of the cloud should apply to ediscovery as well. The aggregation of electronically stored information into common online repositories provides the potential ability to cheaply search, collect and perform early case assessment in place without having to first export the data from the cloud and in a much faster and cheaper manner than land-based ediscovery efforts. However, despite much marketing hype, the architecture and design of nearly all of the leading ediscovery and enterprise search solutions are highly ill-suited for such cloud deployments. About X1 & LTech X1 offers a game-changing enterprise search and ediscovery solution allowing organizations to quickly search, identify, and act upon distributed data, whether it resides within the enterprise or within virtualized environments. X1 unifies search across all critical business data such as local s & files, webmail, network file shares, SharePoint sites, MS Exchange, PST files and Google docs. X1 s very unique and compelling ability to install and fully operate within virtualized environments, including the AWS cloud, enables organizations to address their globally distributed data in an efficient, expedient, defensible and highly cost-effective manner. For more information, please visit LTech was founded in 2001 to provide enterprises with best-ofbreed technology solutions. Since then, we have evolved into a leading cloud technology integrator. As an early Google Enterprise Partner, we have helped hundreds of customers benefit from Google Apps and the Google Search Appliance. Our experience working with a diverse set of Customers - from startups to Fortune 500 companies - has made us a trusted name in enterprise class technology services. For more information, please visit
3 Defining Cloud-Capable ediscovery and Enterprise Search Software Many enterprise software vendors claim to support the cloud, but are simply re-branding their existing offerings. Data hosting, especially where the vendor s manual labor is routinely required to upload and process data, does not meet defined cloud standards. Neither does a process that exports data through APIs or other means out of its resident cloud environment to slowly migrate the cloud data to the vendor tools, instead of deploying the tools (and their processing power) to the data where it resides in the cloud. In order to truly support IaaS cloud deployments, ediscovery and enterprise search software must meet the following three core requirements: 1 Automated Installation and Virtualization The ediscovery and search solution must immediately and rapidly install, execute and efficiently operate in a virtualized environment without rigid hardware requirements or on-site physical access. This is impossible if the solution is fused to hardware appliances or otherwise requires a complex on-site installation process. As hardware appliance solutions by definition are not cloud deployable and with enterprise search installations often requiring many months of on-premise man hours to install and configure, whether many of these vendors will be able to support robust IaaS cloud deployments in the reasonably foreseeable future is a significant question. Conversely, a truly virtualized solution within the Amazon Web Services (AWS) public cloud can be installed remotely and on-demand by launching an Amazon Machine Image (AMI) than contains the object code and installer of the application. 2 On-Demand Self-Service In its definition of cloud computing, The National Institute of Standards and Technology (NIST) identifies on-demand self-service as an essential characteristic of the cloud where a consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with each service provider. Many hosted ediscovery services require shipping of data to the provider or extensive behind the scenes manual labor to load and configure the systems for data ingestion. Conversely, solutions that truly support cloud IaaS will spin up, ingest data and fully operate in an automated fashion without the need for manual on-premise labor for configuration or data import. 3 Rapid Elasticity NIST describes this characteristic as capabilities that scale rapidly outward and inward commensurate with demand. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be appropriated in any quantity at any time. This important benefit of cloud computing is accomplished by a parallelized software architecture designed to dynamically scale out over potentially several dozen virtualized servers to enable rapid ingestion, processing and analysis of data sets in that cloud environment. This capability would allow several terabytes of data to be indexed and processed within 2 to 4 hours on a highly automated basis at far less cost than non-cloud ediscovery efforts. However, many characteristics of leading ediscovery solutions fundamentality prevent their ability to support this core cloud requirement. Most ediscovery early case assessment solutions are developed and configured toward a monolithic processing schema designed to operate on a single expensive hardware apparatus. While recently spawning some bold marketing claims of high speeds and feeds, such architecture is very ill-suited to the cloud, which is powered by highly distributed processing across multitudes of servers. Additionally, many of the leading ediscovery and enterprise search solutions are tightly integrated with third party databases and other OEM technology that cannot be easily decoupled (and also present possible licensing constraints) making such elasticity physically and even legally impossible.
4 Breakthrough ediscovery and Enterprise Search Software for the Cloud X1 Rapid Discovery is a proven and now truly cloud-deployable ediscovery and enterprise search solution that enables corporations, law firms and consultants to quickly identify, search, and preserve distributed data in-place wherever it resides in the cloud or within the enterprise. With X1 Rapid Discovery, organizations can now quickly access, search, triage and preserve their data in their existing cloud environments, without having to first export that data, thereby transforming how organizations address the challenges of search, collection and analysis of cloud-based data. While other ediscovery products require migrating or even shipping your data to the vendor tools, X1 Rapid Discovery is a hardware independent software solution that uniquely installs and operates on-demand where your data currently resides. X1 Rapid Discovery provides these revolutionary capabilities due to its distinctive ability to install remotely in a matter of minutes without costly and rigid hardware requirements. Other ediscovery tools that are hardware appliance based or require days or even weeks of on-premise manual installation and configuration do not, by definition, truly support the cloud. X1 Rapid Discovery s easy to use web interface, federated search capabilities, matter centric work-flow, and high speed culling of up to terabytes of data transforms how legal and IT professionals conduct ediscovery and enterprise search in the cloud and throughout the enterprise. Case Study: ediscovery Case Study: Enterprise Search A Fortune 500 company maintains approximately 10 terabytes of enterprise data to file shares and SharePoint instances located in the Amazon Web Services (EC2) public cloud. In response to an urgent ediscovery preservation and collection requirement, the company launches X1 Rapid Discovery on demand within its instance of the AWS cloud where the data is located within minutes of the directive from legal. Using the scalable elasticity of the cloud, the 10 TB of data is then indexed within 24 hours and then in-house counsel and their staff are able, through a user friendly web interface, to perform iterative keyword searches and faceted filtering of the data to identify a small subset (2 GB of files) that are potentially relevant. The 2 gigabytes are then, with a few clicks, preserved in-place and made available for further attorney review. This entire exercise takes place in the company s instance of the cloud without exporting the data saving weeks of time and tens of thousands of dollars in ediscovery costs. A mid-sized pharmaceutical firm with a geographically disparate workforce locates 40 Terabytes of user files and archived pst s on file shares and SharePoint instances located in the Amazon Web Services (EC2) public cloud. The company also has a 15 TB cluster of similar data on premises on physical hardware and other 20GB cluster stored in their virtual data center. The company requires an effective and federated search capability of all three data clusters for over 1000 business professionals who need to search through and retrieve the data on a routine basis. The firm is able to easily install and operate X1 Rapid Discovery within their instance of the AWS cloud and the data is indexed in place providing excellent search and retrieval speeds. Additionally, separate instances of X1 Rapid Discovery are installed in the virtual data center as well as the on-premise cluster and the three index clusters are searched on a federated basis with excellent search and retrieval speeds with a rich user interface that includes a full fidelity preview pane.
5 Next steps for implementation: LTech as your premier cloud partner Why use LTech as your partner to implement X1 into your infrastructure? Simple: LTech provides a one stop shop cloud brokerage Our Valued Customers model that can help you implement any of your needs from X1 to AWS. LTech has the expertise needed to help you with any system you are currently working with to get you up and running the right way without hassle. LTech is focused on software engineering and cloud enablement for mid-market and enterprise customers. An AWS consulting partner since 2009, LTech strives to make sure our projects are completed on-time, on-budget, and with success for every party involved. As an AWS consulting partner, LTech provides infrastructure assessment and strategy formulation. LTech, in combination with AWS and X1, will work with your organization's regulatory compliance requirements and create a roadmap for development for migration of development, QA, production and other environments such as Disaster Recovery related to AWS. LTech and X1 together are problem solvers for your organization. With a simple and quick UI for your end users, rapid install on EC2 machines adjacent to your data and the ability to scale across multiple EC2 instances, X1 is a great solution for your organization. Let LTech's expertise in the business help you implement these products. Please contact LTech at for our exclusive Search Assessment Pilot Package. Let us assist you in understanding the most cost eﬀective strategy while testing X1 s capabilities. For more information visit x1.com, ltech.com or follow our social media sites: twitter.com/ltech and twitter.com/x1discovery. Our Accreditations
Convergence of Social, Mobile and Cloud: 7 Steps to Ensure Success June, 2013 Contents Executive Overview...4 Business Innovation & Transformation...5 Roadmap for Social, Mobile and Cloud Solutions...7
Thought Leadership Paper Cloud Computing in the Hedge Fund Industry About Eze Castle Integration Eze Castle Integration is the leading provider of IT solutions and private cloud services to more than 600
UNITED STATES SECURITIES AND EXCHANGE COMMISSION Washington, D.C. 20549 FORM 10-K (Mark One) È Annual Report pursuant to Section 13 or 15(d) of the Securities Exchange Act of 1934 For the fiscal year ended
White Paper May 2006 Applying Electronic Records Management in the Document Management Environment: An Integrated Approach Written by: Bud Porter-Roth Porter-Roth Associates Table of Contents Introduction
How AWS Pricing Works May 2015 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 15 Table of Contents Table of Contents... 2 Abstract... 3 Introduction...
A Requirement for Virtualization and Cloud Computing An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for FrontRange Solutions October 2012 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS
Migration Planning Kit Microsoft Windows Server 2003 This educational kit is intended for IT administrators, architects, and IT managers. The kit covers the reasons and process you should consider when
SAP Statement of Direction Business Intelligence Solutions Business Intelligence Solutions from SAP: Statement of Direction Table of Contents 3 Quick Facts 4 Driving Business Innovation Through Radical
Managing Information for E-discovery Readiness A Docula bs Wh i te Pa pe r 2009 Doculabs, 200 West Monroe Street, Suite 2050, Chicago, IL 60606 (312) 433-7793 email@example.com. Reproduction in whole or
Federal Server Core Configuration (FSCC) A high-level overview of the value and benefits of deploying a single, standard, enterprise-wide managed server environment A Microsoft U.S. Public Sector White
White Paper Information Security, Virtualization, and the Journey to the Cloud By Jon Oltsik August, 2010 This ESG White Paper was commissioned by Trend Micro and is distributed under license from ESG.
WHITEPAPER Microsoft SQL Server Databases Thrive in the Cloud Virtualizing Data-Intensive Applications for Page 2 Overview As more and more organizations embrace cloud computing to save money, increase
Special Publication 800-146 DRAFT Cloud Computing Synopsis and Recommendations Recommendations of the National Institute of Standards and Technology Lee Badger Tim Grance Robert Patt-Corner Jeff Voas NIST
www.pwc.com PwC Advisory Oracle practice 2012 How to drive innovation and business growth Leveraging emerging technology for sustainable growth 1 Heart of the matter Top growth driver today is innovation
HOW SAAS CHANGES AN ISV S BUSINESS A GUIDE FOR ISV LEADERS Sponsored by Microsoft Corporation Copyright 2012 Chappell & Associates Contents Understanding the Move to SaaS... 3 Assessing SaaS...3 Benefits
Plug Into The Cloud with Oracle Database 12c ORACLE WHITE PAPER DECEMBER 2014 Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only,
Identity and access management as a driver for business growth February 2013 Identity and access management (IAM) systems are today used by the majority of European enterprises. Many of these are still
Securing Traditional and Cloud-Based Datacenters With Next-generation Firewalls February 2015 Table of Contents Executive Summary 3 Changing datacenter characteristics 4 Cloud computing depends on virtualization
TABLE OF CONTENTS Introduction... 3 The Importance of Triplestores... 4 Why Triplestores... 5 The Top 8 Things You Should Know When Considering a Triplestore... 9 Inferencing... 9 Integration with Text
Why Service Providers Need an NFV Platform Strategic White Paper Network Functions Virtualization (NFV) brings proven cloud computing and IT technologies into the networking domain to help service providers
SAP BusinessObjects Business Intelligence SAP BusinessObjects Business Intelligence 4.0 Solutions Empowering the Real-Time, Mobile, Social, and Global Enterprise SAP BusinessObjects Business Intelligence
IBM Software Thought Leadership White Paper February 2012 Automated, centralized management for enterprise servers Servers present unique management challenges but IBM Endpoint Manager is up to the job
Microsoft System Center 2012 R2 Why Microsoft? For Virtualizing & Managing SharePoint July 2014 v1.0 2014 Microsoft Corporation. All rights reserved. This document is provided as-is. Information and views
white paper Public or Private Cloud: The Choice is Yours Current Cloudy Situation Facing Businesses There is no debate that most businesses are adopting cloud services at a rapid pace. In fact, a recent
Best practice in the cloud: an introduction Using ITIL to seize the opportunities of the cloud and rise to its challenges Michael Nieves AXELOS.com White Paper April 2014 Contents 1 Introduction 3 2 The
CLOUD COMPUTING: IS YOUR COMPANY WEIGHING BOTH BENEFITS & RISKS? Toby Merrill CLOUD COMPUTING: IS YOUR COMPANY WEIGHING BOTH BENEFITS & RISKS? Toby Merrill Toby Merrill, Thomas Kang April 2014 Cloud computing
A REPORT BY HARVARD BUSINESS REVIEW ANALYTIC SERVICES How the Cloud Looks from the Top: Achieving Competitive Advantage In the Age of Cloud Computing Sponsored by Length of Time Using Cloud Computing Figure