dtsearch Frequently Asked Questions Version 3



Similar documents
Frequently Asked Questions on character sets and languages in MT and MX free format fields

set in Options). Returns the cursor to its position prior to the Correct command.

DTRADE 2 Release Notes

Symbols in subject lines. An in-depth look at symbols

HTML Codes - Characters and symbols

Xerox Standard Accounting Import/Export User Information Customer Tip

Searching Guide Version 8.0 December 11, 2013

Search Requests (Overview)

The Best Kept Secrets to Using Keyword Search Technologies

TABLE OF CONTENTS. Page 2

Searching your Archive in Outlook (Normal)

dtsearch Regular Expressions

BCCC Library. 2. Spacing-. Click the Home tab and then click the little arrow in the Paragraph group.

X1 Professional Client

Administration Guide. McAfee SaaS Archiving

Address Update Web Application. User Manual. Defense Manpower Data Center Department of Defense

Chart of ASCII Codes for SEVIS Name Fields

Archiving Administrator Guide

Ask your teacher about any which you aren t sure of, especially any differences.

URL encoding uses hex code prefixed by %. Quoted Printable encoding uses hex code prefixed by =.

ASCII CODES WITH GREEK CHARACTERS

Message Archiving User Guide

Sanitary Sewer Overflow/Facility Bypass Application

ASCII control characters (character code 0-31)

T201 - SEARCHING FOR MONEY

shorewest.net Document Management

dtsearch Desktop dtsearch Network

Person Inquiry Screen

novdocx (en) 11 December 2007 VIDistribution Lists, Groups, and Organizational Roles

Lab 9 Access PreLab Copy the prelab folder, Lab09 PreLab9_Access_intro

SJC Password Self-Service System FAQ 2012

UW- Green Bay QuickBooks Accounts Receivable User Manual

Microsoft Access 2010 Part 1: Introduction to Access

1. Manuscript Formatting

Virtual Integrated Design Getting started with RS232 Hex Com Tool v6.0

APPENDIX 1 BRAILLE SYMBOLS AND INDICATORS. Braille Characters Letters Numbers Contractions Indicators Punctuation and Symbols

Vodafone multitxt faqs

Managing Your Network Password Using MyPassword

ANZ TRANSACTIVE FILE FORMATS WEB ONLY Page 1 of 118

Access I Tables, Queries, Forms, Reports. Lourdes Day, Technology Specialist, FDLRS Sunrise

Importing Contacts to Outlook

Quickstart Guide. Connect your microphone. Step 1: Install Dragon

Import Filter Editor User s Guide

USER GUIDE FOR L ANNÉE PHILOLOGIQUE ON THE INTERNET

Opening a Database in Avery DesignPro 4.0 using ODBC

Training Module 4: Document Management

12 SECOND QUARTER CLASS ASSIGNMENTS

Chapter 5 - Ethernet Setup

3 Data Properties and Validation Rules

Primo Online End User Help. Version 4.x

Searching in CURA. mindscope Staffing and Recruiting Software

Approach to E-Discovery Boolean Search

Windows, Menus, and Universal Document Shortcuts

Business Entity. Name Regulations & Additional Statutory Requirements and Restrictions

ScienceDirect. Quick Reference Guide

PCL PC -8. PCL Symbol Se t: 12G Unicode glyph correspondence table s.

Creating Codes with Spreadsheet Upload

Introduction to Microsoft Access 2007

Tips and Tricks. Table of Contents. Run Control ID

HYPERION FINANCIAL MANAGEMENT SYSTEM 9 RELEASE USER S GUIDE

Creating Cost Recovery Layouts

To Ac&vate Keyboard Apple Systems Preferences. Interna&onal Input Menu install them from an OS X CD Localized Files Switch Keyboards

Workflow Solutions for Very Large Workspaces

PaymentNet Federal Card Solutions Cardholder FAQs

Despatch Manager Online

Essential SharePoint Search Hints for 2010 and Beyond

VDF Query User Manual

Adobe Conversion Settings in Word. Section 508: Why comply?

Regular Expressions. Abstract

Access 2010 Intermediate Skills

Using Bluetooth-Enabled PosiTector 6000 with Statistical Process Control Software

Sample- for evaluation purposes only! Advanced Crystal Reports. TeachUcomp, Inc.

How To Write A File System On A Microsoft Office (Windows) (Windows 2.3) (For Windows 2) (Minorode) (Orchestra) (Powerpoint) (Xls) (

Commonly Used Excel Formulas

Step Sheet: Creating a Data Table and Charts

Sabinet Reference Guide

7-Bit coded Character Set

Chapter 7: Configuring ScanMail emanager

GENEVA COLLEGE INFORMATION TECHNOLOGY SERVICES. Password POLICY

Characters & Strings Lesson 1 Outline

Bulk Downloader. Call Recording: Bulk Downloader

How to transfer your Recipient Address Book from FedEx Ship Manager at fedex.ca to FedEx Ship Manager Software

Quick Reference Guide

Home Page. LexisNexis Butterworths LexisNexis Butterworths Academic Quick Start Guide

Your Guide to setting up Sage One Accounting from your Accountant. Setting Up Sage One Accounting

Access Queries (Office 2003)

Regular Expressions. General Concepts About Regular Expressions

AFN-PayrollEFTGuide

SEARCHING THE SUNDIAL MAIL ARCHIVE by Carl Sabanski / edited by Mac Oglesby

How To Import A File Into The Raise S Edge

FrontStream CRM Import Guide Page 2

Transcription:

dtsearch Frequently Asked Questions Version 3

Table of Contents dtsearch Indexing... 3 Q. What is dtsearch?... 3 Q. How can I view and change my indexing options?... 3 Q. What characters are indexed?... 3 Q. What characters are treated as spaces?... 3 Q. How are hyphens treated?... 3 Q. What terms/words are indexed?... 4 Q. What are Noise Words?... 4 Q. How can I index individual letters?... 4 Q. What does Merge Index do?... 4 Using Index Search... 5 Q. What search operators can be used in index searches?... 5 Q. How can I use Regular Expressions in index searches?... 5 Q. How do I search for a phrase?... 6 Q. How are search hits calculated?... 6 Q. What does Accumulate Results do?... 6 Q. Can I import a list of search terms?... 6 Q. How do I search for accented and Unicode characters?... 7 Q. Can I have a noise word in my search term?... 7 Q. Can I use punctuation and symbols in search terms?... 7

dtsearch Indexing Q. What is dtsearch? A. dtsearch is a product that performs index-based searches. dtsearch is the backbone of Index Searching in FTK. Q. How can I view and change my indexing options? A. Indexing options can only be viewed and changed when a case is first created. From the New Case Options window (the first window that opens when you select to create a new case), click the Detailed Options button, then click the Indexing Options button. Q. What characters are indexed? A. By default, the indexable alphabet includes the underscore ( _ ), numbers 0-9, and letters A-Z (upper and lower cases). Other foreign characters including the following accented characters are also indexed by default (both upper and lower cases): Accent Character Grave à è ì ò ù Acute á é í ó ú ý Circumflex â ê î ô û Tilde ã ñ õ Umlaut ä ë ï ö ü ÿ Q. What characters are treated as spaces? A. The following characters are treated as spaces by default: Single Quote Back Slash \ Exclamation Mark! Colon : Double Quotes Semicolon ; Pound Sign (Hash) # Question Mark? Dollar Sign $ At Symbol @ Percent % Left Bracket [ Ampersand & Right Parenthesis ( Lift Parenthesis ) Asterisk * Comma, Forward Slash / Right Bracket ] Carrot ^ Tick Mark ` Left Brace { Right Brace } Bar Tilde ~ Plus + Equals = Less Than < Greater Than > Carriage Return Form Feed Line Feed Space Tab Q. How are hyphens treated? A. By default, the hyphen (dash) is treated as a space, but can be set to be ignored or treated as a hyphen. If you choose to treat hyphens as hyphens rather than as spaces, the hyphen becomes part of the indexable alphabet.

Q. What terms/words are indexed? A. By default, any group of 2-32 characters from the indexable alphabet, surrounded by space characters, will be indexed as long as that term is not considered a Noise Word. Q. What are Noise Words? A. Noise words are extremely common words such as it and the, and are not added to the index. A list of all noise words can be found in the Indexing Options window. If you know when starting a case that you will need to search for terms that may contain noise words, you should remove those terms from the list of Noise Words in the Indexing Options. Q. How can I index individual letters? A. Indexing single letters can be useful if you want to search for acronyms with periods or spaces (eg. A.C.E. or A C E ). Some FTK installations will show an entry that looks like a b c d e f g h in the Noise Words list. If you see this entry, you can simply remove it from the Noise Words list to allow FTK to index individual letters. If you do not see this entry, you can do the following: 1. Open the Indexing Options while creating a new case 2. Click the Add button next to the Noise Words list, and add the string a b c d e f g h I j k l m n o p q r s t u v w x y z (unquoted) 3. Click OK 4. Click Save as My Defaults 5. Go into Indexing Options again 6. Highlight the string a b c d e f g h that you just entered and click Remove 7. Click OK Note: We do not recommend making this change to the Indexing Options if ediscovery is installed on your FTK box. Q. What does Merge Index do? A. Indexing is multi-threaded and each thread creates its own index. Because multiple files will usually contain the same words, the words in each index will overlap each other (eg. one index tracked the word berries in 55 different files, and another index tracked the word berries in another 100 files). Merging indexes will merge all the indexes into one, eliminating this overlap and resulting in faster index searches. However, the process of merging indexes can take as long as the original indexing process, so you will need to determine based on the size of your case if the trade off is worth it.

Using Index Search Q. What search operators can be used in index searches? A. Search operators along with examples are listed below: Name Operator Example Result And AND apple AND pear Matches files containing both apple and pear Or OR apple OR pear Matches files containing either apple or pear Within W/ apple w/4 pear Matches apple within 4 words of pear Not NOT apple AND NOT pear Matches files containing apple but not apple NOT w/4 pear pear Matches files containing apple but not within 4 words of pear Contains CONTAINS name CONTAINS adam Matches files whose metadata name field contains adam Wildcard * apple * Matches apple followed by any word appl* Matches appl followed by any number or characters ( apples, applied, etc.)? appl? Matches appl followed by any character ( apple, apply, etc.) Single Character Wildcard Stemming ~ apply~ Matches different stems/conjugations of apply ( applies, applied, etc.) Fuzzy % ba%nana Matches different spellings of banana ( banana, bannana", etc.) Phonic # #pear Matches words that sound like pear ( pare, pair, etc.) Synonym & fast& Matches synonyms for fast ( quick, speedy, etc.) Numeric Range ~~ 12~~20 Matches any number between 12 and 20 Q. How can I use Regular Expressions in index searches? A. As of FTK 4, we have added the ability to use Regular Expressions in index searches. We use the TR1 implementation of Regular Expressions. The full documentation of TR1 with its capabilities can be accessed at http://msdn.microsoft.com/en-us/library/bb982727.aspx. Rules: A Regular Expression in a search request must be quoted and must begin with ##. Example: "##appl." will match "apple A Regular Expression search term will only match a single word. Example: "##app.*ie" will not match "apple pie" Multiple Regular Expression search terms can be used together by using other search operators. Example: "##appl." AND "##[a-z]ie" will match "apple pie"

Some special characters in Regular Expressions: Regular Expression Effect Example Result. A period matches any single "##appl." Matches "apple" or "apply" character [abc] Brackets indicate a set of characters, one of with must be present "##car[se]" Matches "cars" or "care", but not "carb" [a-z] A dash inside brackets indicates a range of characters, one of with must be present [^a-z] A carrot indicates characters that must not be present * An asterisk must follow another regular expression and means "0 or more" of something. + A plus must follow another regular expression and means "1 or more" of something "##car[a-f]" "##appl[y]" "##app.*" "##app.+" Matches "carb" or "care", but not "cars" Matches "apple" but not "apply" Matches "app", "apps", "apple", or "applied" ("app" followed by any string or characters, or by nothing) Matches "apps", "apple", or "applied", but not "app" ("app" followed by any string or characters) Q. How do I search for a phrase? A. The most reliable way to search for a phrase is to enter it without quotes. For example, if you want to search for the phrase angry brown fox you would enter angry brown fox (unquoted) into the Terms box. Q. How are search hits calculated? A. Search hits are calculated over indexed terms, not search terms. Search terms consist of one or more indexed terms, so any match to a search term can yield 1 or more hits depending on how many indexed terms are in the search term. For example, every match to the search term fox will result in 1 hit, while every match for brown fox will result in two hits (because brown and fox are two different indexed terms). Q. What does Accumulate Results do? A. Accumulate Results shows you how many hits your search will yield. This can give you a rough idea of how long it will take to perform your search (the more hits a search finds, the longer it may take to perform the search and populate the results). Q. Can I import a list of search terms? A. Yes. By clicking the Import button to the right of the Search Criteria field you can import a text file with a list of search terms. Make sure that each search term is on a new line in the text file.

Q. How do I search for accented and Unicode characters? A. Currently, you can t type accented and Unicode characters directly into the Search Terms field. However, you can use these characters by importing a list of search terms or copy-and-pasting search terms into the Search Terms field. Q. Can I have a noise word in my search term? A. No. Any noise words in a search term will be treated as a wildcards when searching. For example, searching for statue of liberty will match any three-word phrase beginning with statue and ending with liberty (because of is a noise word by default). If you know when starting a case that you will need to search for terms that may contain noise words, you should remove those terms from the list of Noise Words in the Indexing Options. In addition, the terms "and", "not", "or," and "contains" will always be treated as operators, so you cannot search for these terms even if you remove them from the list of Noise Words. Q. Can I use punctuation and symbols in search terms? A. No. Any punctuation or symbols will be treated as a string of wildcards when searching. For example, searching for microsoft.com will match any length phrase beginning with the word microsoft and ending with com. Likewise, searching for user@microsoft.com will match any length phrase starting with the word user and ending in com that also contains the word microsoft. If you would like to search for a phrase that contains punctuation or symbols, use spaces instead of symbols in the search term (eg. searching for user microsoft com will match user@microsoft.com ).