Best Practices: ediscovery Search



Similar documents
Best Practices: Defensibly Collecting, Reviewing, and Producing

Stu Van Dusen Marketing Manager, Lexbe LC. September 18, 2014

Redefining High Speed ediscovery Processing & Production

Best Practices: Litigation Timelining

Are You Paying Too Much for ediscovery Processing?

Best Practices: Cloud ediscovery Using On-Demand Technology and Workflows to Speed Discovery and Reduce Expenditure

ediscovery & Case Management for Disability Rights Advocates

On-Demand ediscovery Processing

Taming Big Data ediscovery. Ten Tips to Avoid be Byten by Big Data in Your Case

Are you ready for more efficient and effective ways to manage discovery?

E-Discovery Basics For the RIM Professional. Learning Objectives 5/18/2015. What is Electronic Discovery?

The Business Case for ECA

APPENDIX B TO REQUEST FOR PROPOSALS

Enhancing Document Review Efficiency with OmniX

How to Keep OCR Errors from Spoiling Your ediscovery Party

Litigation Support. Learn How to Talk the Talk. solutions. Document management

This Webcast Will Begin Shortly

Administrative Office of the U.S. Courts Office of Defender Services, Training Branch

The Summation Users Guide to Digital WarRoom It s time for a fresh approach to e-discovery.

Reduce Cost and Risk during Discovery E-DISCOVERY GLOSSARY

Sample Electronic Discovery Request for Proposal

for Insurance Claims Professionals

ediscovery 101 Myth Busting October 29, 2009 Olivia Gerroll ediscovery Solutions Group Director

Discussion of Electronic Discovery at Rule 26(f) Conferences: A Guide for Practitioners

Five Steps to Ensure a Technically Accurate Document Production

EDiscovery The table below is a guideline of the data that is expected to be presented by the WCB as the initial source.

SEVENTH CIRCUIT ELECTRONIC DISCOVERY PILOT PROGRAM FOR DISCOVERY OF ELECTRONICALLY STORED

What Am I Looking At? Andy Kass

UNITED STATES DISTRICT COURT DISTRICT OF MINNESOTA

Best Practices in Electronic Record Retention

tunnelvision End User Manual version 3.8 discover DISCOVER MORE. REVIEW LESS.

ZEROING IN DATA TARGETING IN EDISCOVERY TO REDUCE VOLUMES AND COSTS

Symantec ediscovery Platform, powered by Clearwell

The Best Kept Secrets to Using Keyword Search Technologies

Data Targeting to Reduce EDVERTISING Costs

FEDERAL PRACTICE. In some jurisdictions, understanding the December 1, 2006 Amendments to the Federal Rules of Civil Procedure is only the first step.

ediscovery Software Buyer s Guide FOR SMALL LAW FIRMS

Office 365 for the Information Governance and ediscovery Practitioner. Part II: ediscovery Deep Dive October 27, 2015

A large and rapidly expanding Digital Universe

E-Discovery for Paralegals: Definition, Application and FRCP Changes. April 27, 2007 IPE Seminar

Best Practices Page 1

Considering Third Generation ediscovery? Two Approaches for Evaluating ediscovery Offerings

Veritas ediscovery Platform

Discovery Data Management

Review Easy Guide for Administrators. Version 1.0

Digital Government Institute. Managing E-Discovery for Government: Integrating Teams and Technology

Digital Forensics, ediscovery and Electronic Evidence

Vancouver Toronto Seattle

The Basics of Automated Litigation Support

T201 - SEARCHING FOR MONEY

Discovery in the Digital Age: e-discovery Technology Overview. Chuck Rothman, P.Eng Wortzman Nickle Professional Corp.

E-Discovery Best Practices

CASE NO. 279 CIVIL ACTION APPLICABLE TO ALL CASES CASE MANAGEMENT ORDER NO.5

Viewpoint ediscovery Services

Understanding How Service Providers Charge for ediscovery Services

Guide to advanced ediscovery solutions

(Previously published in The Legal Intelligencer, November 8, 2011) New Cost Guidelines for E-Discovery by Peter Vaira

INDEX. OutIndex Services...2. Collection Assistance...2. ESI Processing & Production Services...2. Computer-Based Language Translation...

ARCHIVING FOR EXCHANGE 2013

Workflow Solutions for Very Large Workspaces

E-Discovery Tip Sheet

Purpose: To ensure that e-discovery Requests and Litigation Hold Notices are received, routed and responded to in a timely and thorough manner.

Document Storage Tips: Inside the Vault

Limited Resources. Michael Condé National Litigation Support Manager Borden Ladner Gervais LLP

E-Discovery Tip Sheet

E-Discovery in Michigan. Presented by Angela Boufford

Multi-language E-Discovery

2972 NW 60 th Street, Fort Lauderdale, Florida Tel Fax

HOW TO BECOME AN ESI HERO

What You Should Know About ediscovery

Litigation Solutions. insightful interactive culling. distributed ediscovery processing. powering digital review

GUIDELINES FOR USE OF THE MODEL AGREEMENT REGARDING DISCOVERY OF ELECTRONICALLY STORED INFORMATION

Litigation Support connector installation and integration guide for Summation

REVIEW. FIND. SHARE. GET ON TOP OF REVIEW WORKLOADS FASTER.

ithenticate User Manual

REDUCING COSTS WITH ADVANCED REVIEW STRATEGIES - PRIORITIZATION FOR 100% REVIEW. Bill Tolson Sr. Product Marketing Manager Recommind Inc.

Discovery of Electronically Stored Information ECBA conference Tallinn October 2012

Approach to E-Discovery Boolean Search

Symantec Enterprise Vault Discovery.cloud

2011 Winston & Strawn LLP

Electronic Discovery. Answers to life s enduring questions

What s Happening with Summation? FAQs

LONG INTERNATIONAL. Long International, Inc Whistling Elk Drive Littleton, CO (303) Fax: (303)

Making Sense of E-Discovery: 10 Plain Steps for Producing ESI

Xact Data Discovery. Xact Data Discovery. Xact Data Discovery. Xact Data Discovery. ediscovery for DUMMIES LAWYERS. MDLA TTS August 23, 2013

Excel 2007 Tutorials - Video File Attributes

ithenticate User Manual

AccessData Corporation. No More Load Files. Integrating AD ediscovery and Summation to Eliminate Moving Data Between Litigation Support Products

Case 2:14-cv KHV-JPO Document 12 Filed 07/10/14 Page 1 of 10 IN THE UNITED STATES DISTRICT COURT FOR THE DISTRICT OF KANSAS

DISCOVERY MANAGEMENT IN LITIGATION USING SPREADSHEETS AND DATABASES

Clearwell Legal ediscovery Solution

ESI: Focus on Review and Production Strategy. Meredith Lee, Online Document Review Supervisor, Paralegal

Judge Peck Provides a Primer on Computer-Assisted Review By John Tredennick

Early Case Assessment in ediscovery

Electronically Stored Information: Focus on Review and Strategies

CHAPTER OUTLINE. Copyright Delmar Learning. ALL RIGHTS RESERVED. 35

Transcription:

Best Practices: ediscovery Search Improve Speed and Accuracy of Reviews & Productions with the Latest Tools February 27, 2014 Karsten Weber Principal, Lexbe LC

ediscovery Webinar Series Info & Future Takes Place Monthly Cover a Variety of Relevant ediscovery Topics Next Month: Legal Timelines and Early Case Assessment Presentations Available for Download by Registrants.

ediscovery Webinar Series Questions & Technical Issues If you have any questions or technical issues, please e-mail them to: webinars@lexbe.com Questions will be forwarded to Karsten and answered during the webinar or via e-mail if we run out of time.

ediscovery Webinar Series Karsten Weber bio Current - Principal of Lexbe LC - Principal Architect of Lexbe ediscovery Suites and Lexbe ediscovery Services Prior Experience - Consulting Expert, Lumin Expert Group - Director of Software, nline Corporation - Software Engineering Manager, KLA-Tencor Education - MBA, University of Texas - M.S. Engineering, Danish Technical University Contact Karsten Weber 512-686-3469 karsten@lexbe.com

Use of Keyword Search In Discovery Early Stage Culling - Reduce amount of ESI to be reviewed by using keywords to cull document collections. Keyword-Based Responsive & Privilege Review - Construct search queries to return documents that are likely to be responsive, confidential. Search by name and email of counsel; privilege, workproduct, confidential and related keywords. ID Documents for Depo Prep - Find and assign key documents related to specific case participants to prepare for depositions. Search by email addresses used, names and nicknames used, important issues associated with deponent. ID of Key Docs for Trial - Find and mark key case documents. Code documents that will be needed for trial.

Pros of Keyword Searching Fast - Keyword search is very fast compared with other document search methodologies. Inexpensive - Good results can be obtained at little cost compared with manual review or other computer assisted methodologies. Quality - Search can deliver high quality results, particularly if keyword terms are carefully developed and tested. Avoids Manual Review Errors/Inconsistencies - Search results are computer generated, and so avoid known human review errors that can result from fatigue, inadequate training, lack of focus, etc.

Cons of Keyword Searching Search Can be Over or Under-Inclusive - Search terms can bring back too many junk results or miss good results. These are known as false positives and false negatives. Difficulty of Creating Good Search Terms - Constructing good search terms takes design time, testing, iterations, and analysis. Non-Searchable Text - Search results can only be as good as the underlying searchable text. ESI collections and review tools can miss text that a human reviewer might catch for a variety of reasons. Some file types can t be indexed - There is little consistency in what files can be indexed across litigation databases.

Construct Quality Searches Start with Request for Production - Translate the demands of the RFP into a keyword search strategy. Interview Custodians - Ask key case participants / data custodians about their ESI. Use their insights and their terminology to find obscure key documents. Include Jargon - Seek out industry or company, company sub-culture specific terms you may not be familiar with. Included Misspellings - Include misspelled versions of keywords or (use fuzzy search settings or boolean limiters) in your search string to account for emails, etc. with typos.

Use Search Expanders Search Expanders Enable Easy Expansion to Reduce False Negatives Concept - Thesaurus lookup and synonym search. Conceptually expands search query. Stemming - Expands query to include derivative terms associated with the search keywords. Fuzzy - insertion deletion, or substitution of a character in the search query to account for search error, spelling errors within the document, and potential OCR error Phonetic - Returns results that sound similar to the search query.

Use Search Expanders Concept Search Example Trade = Swap = quid pro quo

Use Search Expanders Stemming Search Example Trade = Trading = Trades

Use Search Expanders Fuzzy Search Example - Misspelling Fastow = Fastaw = Fasto

Use Search Expanders Boolean Search Basic Boolean Operators: - AND: returns results including both terms - OR : looking for at least one of a list of terms - NOT : exclude terms you don t want - ( ) : can be used to separate OR statements from the rest of the boolean string. - PRE/n : First search term does not precede the second term by more than n words. - Wildcard Characters: * replaces a letter in your search term,! allows for stemming search within a boolean query

Use Search Limiters Search Limiters Reduce False Positives (Noise) Filter Out Unneeded File Types. Some file types are unlikely to lead to useful information and can be excluded. Use Boolean Modifiers to Limit Overly Expansive Searches - Boolean modifiers can reduce the number of documents returned from a query while increasing the relevance of those files. Exclude certain words or combinations, and specify word order.

Use Search Limiters Boolean Search Example Lay! w/25 Chewco

Test Keyword Searching Results Look at Results Returned. Searching without review and testing may result in low quality results. Sample & Look for Ways to Limit Search - Create new queries that reduce false positives. More new keywords. - Viewing search results may prompt the discovery of additional keywords that could be used to expand or reduce search queries. Fuzzy and Concept Search - New keywords found by searching and returning synonyms and near identical words. Keyword searching becomes an iterative process.

Common Indexing Methods There Are Traditionally Two Types of Search Indices: Imaged and OCRed - The search text is coming from the files after they have been converted to TIFF / PDF. Extracted Text - The search text is coming from text extracted from the original file. Both approaches have significant limitations.

Search Index Based on OCR of Imaged Files Description - Native files (email, attachments, spreadsheets, etc.) are converted to a paginated image file and then OCR is applied to make the text searchable. (ex. TIFF production with no extracted text). How? - Conversion software uses a print-driver approach to virtually image what would have been physically printed. Data Not Indexed - Headers/footers/notes, comments and revisions, highlighted text, hidden sheets or text, print selections, applied filters,

Search Index Based on OCR of Imaged Files How Doc Appears Natively: OCR Based Index Will Include: Chewco 2000 Pro Forma Sheet Body Text

Search Index Based on Native Extraction Description - Available text from Native files (email, attachments, spreadsheets, etc.) is extracted and indexed by the search engine using text parsing. (ex. pure native review) How? - Only available text is used. There is no OCR applied. Data Not Indexed - Non-text files (ex. scanned documents) and embedded text, objects, or visuals will not be indexed. Different native extraction methods can also vary in their ability to recognize certain types of text.

Search Index Based on Native Extraction How Doc Appears Natively: Native Extraction Index Will Include: Page 1/12 Chewco 2000 Pro Forma Balance Statement Sheet [S1: CRITICAL ENRON EVIDENCE] Page 1/12

Dual Index The Lexbe search engine indexes both text extracted from Native files (email, attachments, spreadsheets, etc.) and a paginated file converted from Native files into PDF or TIFF and OCRed. Most comprehensive approach minimizes potential for lost and unsearchable data. Benefits of Dual Index Approach Index Method Captures Embedded Text Captures Text Excluded From Print Captures Hidden Text Imaged/OCR Yes No No Native Extraction No Yes Yes Lexbe Dual Index Yes Yes Yes

Dual Index

Thank You for Attending About Lexbe and Contact Information Phone (Toll Free) (800) 401-7809 Webinar Questions: webinars@lexbe.com Next Month s Webinar: Legal Timelines and Early Case Assessment Lexbe is an ediscovery software and services provider based in Austin, TX.