Workflow Solutions for Very Large Workspaces



Similar documents
Review Manager Guide

Pre-Installation Guide

Searching Guide Version 8.0 December 11, 2013

Managing Relativity SQL log files

Starter Template. August 14, Version 9.2

Client SSL Integration Guide

Review Easy Guide for Administrators. Version 1.0

Mixed Authentication Setup

The Best Kept Secrets to Using Keyword Search Technologies

System Requirements. Version 8.2 November 23, For the most recent version of this document, visit our documentation website.

tunnelvision End User Manual version 3.8 discover DISCOVER MORE. REVIEW LESS.

System Requirements Version 8.0 July 25, 2013

Symantec ediscovery Platform, powered by Clearwell

Pre-Installation Guide

Service Bus Guide. July 4, Version 9.4

1. Digital Asset Management User Guide Digital Asset Management Concepts Working with digital assets Importing assets in

Viewpoint ediscovery Services

Veritas ediscovery Platform

Assisted Review Guide

Tips and Tricks SAGE ACCPAC INTELLIGENCE

Salesforce Classic Guide for iphone

Backup and Data Management Best Practices

Enhancing Document Review Efficiency with OmniX

Backup and Data Management Best Practices

Setting up the Oracle Warehouse Builder Project. Topics. Overview. Purpose

Are you ready for more efficient and effective ways to manage discovery?

OnBase 13. Unity Client Retrieval, Upload and Batch Processing. User Guide. IT Training (818)

SourceForge Enterprise Edition 4.4 SP1 User Guide

MAS 500 Intelligence Tips and Tricks Booklet Vol. 1

Business Insight Report Authoring Getting Started Guide

OnBase Quick Reference Guide

Jet Data Manager 2012 User Guide

CRM Global Search: Installation & Configuration

Vector HelpDesk - Administrator s Guide

What's New In DITA CMS 4.0

Administrator Guide v2.x 5/1/2015

Whitepaper: Enterprise Vault Discovery Accelerator and Clearwell A Comparison August 2012

P R O V I S I O N I N G O R A C L E H Y P E R I O N F I N A N C I A L M A N A G E M E N T

SOFT FLOW 2012 PRODUCT OVERVIEW

Sample Electronic Discovery Request for Proposal

Frequently Asked Questions Sage Pastel Intelligence Reporting

Easy Manage Helpdesk Guide version 5.4

QAD Enterprise Applications. Training Guide Demand Management 6.1 Technical Training

Version 4.61 or Later. Copyright 2013 Interactive Financial Solutions, Inc. All Rights Reserved. ProviderPro Network Administration Guide.

IBM Unstructured Data Identification and Management

Concordance End User Training Class

Software Tool House Inc.

Sage Intelligence Sage 100 ERP Intelligence Reporting Release Notes

Document Management Getting Started Guide

VEDATRAK CRM 2.1. User's Guide

Protecting Business Information With A SharePoint Data Governance Model. TITUS White Paper

Kofax Transformation Modules Generic Versus Specific Online Learning

Using LDAP Authentication in a PowerCenter Domain

How To Sync Between Quickbooks And Act

GP REPORTS VIEWER USER GUIDE

Cloud Services. Anti-Spam. Admin Guide

ediscovery 5.3 and Release Notes

DocAve 4.1 Backup User Guide

Kofax Export Connector for Microsoft SharePoint

simplify printing TX Guide v. 1. make IT simple Tricerat, Inc Cronridge Drive Suite 100 Owings Mills, MD , All rights Reserved

WatchDox Administrator's Guide. Application Version 3.7.5

Permissions Management for Site Admins

How To Use Nearpoint Ediscovery On A Pc Or Macbook

WHAT S NEW 4.5. FileAudit VERSION.

Admin Report Kit for Active Directory

Talend Open Studio for MDM. Getting Started Guide 6.0.0

Backup and Recovery of SAP Systems on Windows / SQL Server

Microsoft Visual Studio Integration Guide

1. Digital Asset Management User Guide Digital Asset Management Concepts Working with digital assets Importing assets in

Exercise Safe Commands and Audit Trail

isupport 15 Release Notes

There are numerous ways to access monitors:

Evaluator s Guide. PC-Duo Enterprise HelpDesk v5.0. Copyright 2006 Vector Networks Ltd and MetaQuest Software Inc. All rights reserved.

Global Search v 6.1 for Microsoft Dynamics CRM Online (2013 & 2015 versions)

Litigation Support. Learn How to Talk the Talk. solutions. Document management

PORTAL ADMINISTRATION

Workflow Templates Library

Administrating LAW PreDiscovery User Guide

What's New in SAS Data Management

v7.1 SP1 Release Notes

DocAve 6 Service Pack 1 Administrator

Google Earth Connections for ArchiCAD 15. Product Manual

CTERA Agent for Mac OS-X

The Win32 Network Management APIs

IBM Unica Leads Version 8 Release 6 May 25, User Guide

IBM ediscovery Identification and Collection

ORACLE BUSINESS INTELLIGENCE WORKSHOP

Introduction. Connection security

Operating System Installation Guide

Perforce Defect Tracking Gateway Guide

Third-party software is copyrighted and licensed from Kofax s suppliers. This product is protected by U.S. Patent No. 5,159,667.

BlackShield ID Agent for Remote Web Workplace

Change Color for Export from Light Green to Orange when it Completes with Errors (31297)

Enhanced Imaging Options for Client Profiles for Windows

Transcription:

Workflow Solutions for Very Large Workspaces February 3, 2016 - Version 9 & 9.1 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - For the most recent version of this document, visit our documentation website.

Table of Contents 1 Overview 3 2 Importing data 3 3 Fields 3 4 Folders 4 5 Index management 4 6 Layouts 5 7 Mass operations 5 8 Persistent highlight sets 6 9 Production 6 10 Exporting data 6 11 Reviewer statistics script 7 12 Searching 7 13 Search terms reports 8 14 Security permissions 8 15 Snapshot auditing 9 16 Views 9 Relativity Workflow Solutions for Very Large Workspaces - 2

1 Overview Very large Relativity workspaces (VLRWs) require a great deal of time and effort to maintain. As a result, it s important to develop a plan to accommodate the workflow before your workspace reaches VLRW status. Making workflow changes after this point is inefficient and can be very time consuming. Use the following best practices to plan your VLRW workflow and improve overall performance. These guidelines can also be a helpful starting point for educating Relativity users. This guide also outlines the best practices for setting up and working in very large Relativity workspaces. 2 Importing data Importing large amounts of data can be time consuming. Consider the following recommendations: Break the load file into smaller document counts. Use multiple computers and instances to load files. Load the control numbers and folder paths into the workspace to create the records in SQL. Load text as external text files. Break each subgroup load file into reasonable sizes, such as 250,000 records per load file. Be aware of the kinds of data that you import. Importing the extracted text from large binary files, such as WAV, MPEG, and Access database file types, cause excessive load. Don't attempt to load these file types. Note: To more easily verify data and recognize errors, load in smaller batches of data and verify each batch as you go. 3 Fields Fields are not endless commodities. Too many or overly bloated fields can dramatically impact database performance. Use the following guidelines to improve field efficiency: Keep fixed-length text fields to the minimum size needed. Set fixed-length text fields to no more than 400 characters. Store choice data in single-choice or multiple-choice fields if possible. Use separate objects to store repetitive content when possible. Set up fields as Unicode in advance because the system must re-index a field if the change is made following data load. Only include fields in the text index when it s necessary to do so. Relativity Workflow Solutions for Very Large Workspaces - 3

4 Folders You can use folders to organize documents to reflect their original storage. However, in a large workspace, the use of folders can cause performance issues. Folders perform searches in the background to display documents. If searches begin to take longer as more documents are loaded, consider the following options: Carefully place an index on the custom fields may assist in pulling up documents as the folder size grows. Combine folders and then use custom views to achieve similar organization. Create a multiple-choice field and populate the field with the folder path. This will create an entry in the field tree to allow for organizing the documents by the original folder path. This also allows users to folder documents in multiple locations. 5 Index management Optimized indexing requires some knowledge of your data. Scrubbing your data before indexing saves time when creating an index and returning search results. Consider the following when creating an index: Remove file types that have no searchable content, such as system or program files. Use a separate index for searching database files and large Excel files. Note: Even if your database has only a small number of these files, creating an index without them will improve searching speed. Set up multiple dtsearch indexes, including one with a smaller document set based on one or more of the following criteria: o o o date ranges custodians text size (extracted or OCR text) Small (< 2 MB) Medium (> 2 MB and < 10 MB) Large (> 10 MB and < 25 MB) Very large (> 25 MB) Communicate with your cases team to create a search strategy for the case. Some cases have distinct words or terms that might warrant changing the default settings of an index. Note that editing these settings may affect search results. Remove numbers from the dtsearch index alphabet file if you re only searching for words this reduces the size of the index and disables numeric range searching. Set the dtsearch index to recognize and/or ignore words, characters, and digits as necessary. While these settings don't necessarily impact performance, applying them ahead of time avoids having to rebuild the index later. Relativity Workflow Solutions for Very Large Workspaces - 4

For example, if a company name appears many times throughout a document set and you don t intend to search for it, add this name to the noise words list. Configuring these settings before building a large index prevents you from having to rebuild the entire index later to include these types of characters. Note: Be sure to communicate any changes to alphabet files. Searching against multiple dtsearch indexes that use different alphabet files can result in different results, even when running the same search on identical contents. Enable dtsearch indexes to automatically recognize dates, email addresses, and credit card numbers only when necessary. Enabling this setting increases build time. Consider a using pair of dtsearch indexes when adding new data. You can update one index in the background and then swap out the outdated index with the current one. 6 Layouts You can use layouts, along with views, to improve workflow efficiency. Identify the type of information each reviewer group needs to code documents. (For example, group may be working on privilege logs, prepping for depositions, and so on.). You can then use group security permissions to adjust layout visibility as necessary. When planning layouts, think about the overall lifecycle of a document. For example, a review workflow may include the following: First Pass Review Quality Assurance Production Privilege Review Deposition Prep Trial Exhibits Some users have many layouts that they need during the course of a document review. You can use separators (-----------) to help organize layouts and build the workflow. Issue coding layouts can get long and cumbersome over time, requiring users to scroll to see all available choices. To improve the layout s usability, change the layout field display from check box list to popup picker. This unclutters the layout space by hiding the choices and presenting them only when necessary. Users can apply filtering to popup picker views to find choices. 7 Mass operations Mass operations temporarily lock the document table while executing. Depending on the number of record and users in the system, the table may lock for an extended period of time and frustrate users trying to perform standard edits. If necessary, carry out mass operations at night or at an offpeak time. Relativity Workflow Solutions for Very Large Workspaces - 5

8 Persistent highlight sets Persistent highlight sets provide a valuable way to identify terms within the document viewer. Although the size of a workspace doesn't affect how persistent highlighting works, use these guidelines to improve usability in large workspaces: Enter multiple terms on separate lines. Enter terms exactly as they appear in the document. Don't use quotation marks or connectors. If you enter variations of a term or phrase and the variations include multiple words, list the multi-term variations first. The regular expressions used by persistent highlighting look for and find a term and then move to the next term in the set. For example, you should list the terms United, United States, and United States of America, in the following order: o United States of America o United States o United Don't use special characters, quotation marks, or other punctuation. Don't use dtsearch syntax, including operators such as AND and OR. Identify and remove terms with large hit counts. List variations of a term first and the root term last. Use Highlight Fields and Search Terms Reports to generate persistent highlight sets. 9 Production Multiple productions often occur in very large workspaces. Users usually create multiple Bates fields to track various productions, but each field takes up valuable space in the document table. To better manage productions, use the Production Tracker module created by kcura Custom Development. This module uses a separate object to hold each set of Bates numbers for various production groups. 10 Exporting data Exporting large productions takes a great deal of time. Create saved searches to break up the production into roughly equal amounts, such as approximately 250,000 pages each. On multiple machines, export each saved search. Use the production images as the default. For example, the following process exports 250,000 records with approximately 2.5 million pages to a network share folder: 1. Create saved searches of the production so that each has approximately 500,000 pages. 2. Label each volume in sequential numeric order. 3. Modify each image load file to show the top folder as the production volume. 4. Combine each load file into one complete load file. 5. Export each saved search of images using a single machine. 6. Export any native files on a single machine, selecting the Beg Bates field only. 7. After the exports, create a fixed-length text field called Prod Native Path. Relativity Workflow Solutions for Very Large Workspaces - 6

8. Use the RelativityDesktop Client to overlay the exported load file from step 6 onto the Prod Native Path field. 9. Export the text files for each record using a single machine, selecting the Beg Bates field only. 10. After exporting the text files, create a fixed-length text field called Prod Text Path. 11. Use the RelativityDesktop Client to overlay the exported load file from step 9 onto the Prod Text Path field. 12. After loading the information for Prod Native Path and Prod Text Path, export the metadata for all the records. Depending on the speed of your environment, this process may assist in the export process. Relativity Admins have all necessary permissions to perform the following script-related actions: View Run Preview (locked and unlocked scripts) Create/Write Edit Link Import Applications (See the Applications Guide.) Users should not frequently run custom scripts that can have a negative impact on the system, including through SSMS access. Avoid using SSMS and the Admin Script functionality as much as possible until actions are audited and certain Relativity controls, including timeout values, are in place. To prevent scripts from negatively impacting your environment: Limit admin script access for a given workspace to one or two people. Assign an individual to review the impact of custom script executions on the system. Once you've identified the scripts safe for execution, you can make them available to users through the workspace tabs. 11 Reviewer statistics script Reviewer Statistics is a popular script in the Relativity script library that reports on the number of documents reviewed over a certain period of time. It can take a while to complete in workspaces with large audit record tables. Instead of trying to run this report regularly during review, we suggest that you schedule it to run each night after maintenance has completed. You can then have the results emailed to one or more recipients. 12 Searching Executing searches can be very resource intensive. Follow these guidelines to reduce the resources used for searching. Don't use the is like search operator on Fixed Length and Long Text fields. Using is like runs a resource-intensive bit-by-bit search rather than using the index. Relativity Workflow Solutions for Very Large Workspaces - 7

Avoid using multiple layers of nesting applied in a search. Don t use wildcards in the front or in the middle of terms. Instead use the dictionary to find multiple forms of words and paste all of them into the search box. Avoid searching on a number of unnecessary search terms. Instead, use the dtsearch Dictionary s fuzzy and stemming searches identify the best words to search. Avoid searching on the entire dataset. Instead limit the search to subsets of data, such as certain date ranges or custodians. Use filters as an alternative approach to searching. 13 Search terms reports Search terms reports (STR) simplify the process of identifying documents that contain a specific group of keywords. Instead of running complicated queries, you can use the search terms report to enter a list of terms or phrases and then generate a report listing their frequencies within a set of documents. As workspaces grow in size, a search terms report takes longer to run if the individual string is too complicated. In large workspaces, avoid using nested proximity searches or wildcards in search terms reports. Nested proximity searches run slowly in large dtsearch indexes because the search string takes longer to search the index. Using wildcards either before a term, after a term, or especially on both sides of a term, causes the search terms report to take much more time to complete. Instead of using wildcards, use the dtsearch Dictionary to identify variations of a term. Combining wildcards and nested proximity searching may create overly complicated searches. This adds a significant amount of time to running a query and sometimes prevents it from completing. For example, a very complicated search may appear as follows: (((Term1* or Term2*) w/20 Term3*) and Term4*) and (Term5* w/20 Term6*) 14 Security permissions Large workspaces usually require multiple security groups. You should organize documents and define security groups to assist with review workflow. Start with a baseline security group for each main role. For example, you may need to create a baseline group for each of the following roles: System administrator Operations administrator Operations technician Project manager Project specialist Case administrator Case review Case technician Contract reviewer Experts Relativity Workflow Solutions for Very Large Workspaces - 8

Set security permissions from the baseline, giving each user group incremental security rights as necessary. For example, three different user groups may need permissions for the following tasks: Contract review Contract review QA Access to QA layouts, fields, choices Contract review Privilege Access to Privilege Log layouts, fields, choices Contract review Dep Prep Access to Deposition Prep layouts, fields, choices, mass actions 15 Snapshot auditing Enabling Snapshot Auditing On Delete increases the database size. 16 Views When it comes to the reviewer interface, focus on workflow. Create views to filter document collections to necessary lists. Using views, you can manage the types of documents that presented to a group. Use group security permissions to turn views toggle view visibility as necessary. Analyze each group participating in the review, and map out its exact needs. For example, the First Pass group only needs to review batches. The Second Pass group needs to both review and quality check documents, and the Experts group only needs to see Production documents. Implementing a plan that coordinates views for review groups improves workspace management efficiency. In addition, consider the following best practices for views: Views should contain only the fields necessary for the review task. Avoid adding long text fields to views. Using nested saved searches as view conditions slows down loading. Relativity Workflow Solutions for Very Large Workspaces - 9

Proprietary Rights This documentation ( Documentation ) and the software to which it relates ( Software ) belongs to kcura LLC and/or kcura s third party software vendors. kcura grants written license agreements which contain restrictions. All parties accessing the Documentation or Software must: respect proprietary rights of kcura and third parties; comply with your organization s license agreement, including but not limited to license restrictions on use, copying, modifications, reverse engineering, and derivative products; and refrain from any misuse or misappropriation of this Documentation or Software in whole or in part. The Software and Documentation is protected by the Copyright Act of 1976, as amended, and the Software code is protected by the Illinois Trade Secrets Act. Violations can involve substantial civil liabilities, exemplary damages, and criminal penalties, including fines and possible imprisonment. 2016. kcura LLC. All rights reserved. Relativity and kcura are registered trademarks of kcura LLC. Relativity Workflow Solutions for Very Large Workspaces - 10