DATASHEET www.docscorp.com/contentcrawler. Scanned images saved as TIFF or image PDF. Emails with TIFF or image-based PDF attachments



Similar documents
pdfdocs Solution Suite Content Management for Law Firms

DocsCorp Integration Matrix

Nuance ecopy ShareScan. Brings paper documents into the digital world. Document capture & distribution Nuance ecopy

Features compared: Worldox Productivity Suite modules and the full version of those products from DocsCorp

Scan Process Distribute. Secure Scanning integrated in your Office Printing Infrastructure

Konica Minolta Unity Document Suite. Powerful integrated document processing. Document capture & distribution Unity Document Suite

Nuance ecopy ShareScan 5 and ecopy PDF Pro Office 5 Reviewers' Guide

Quick Reference Guide

Toolbox 4.3. System Requirements

Simplify essential workflows with dynamic scanning capabilities. GlobalScan NX Server 32/Server 750 Capture & Distribution Solution

OCR and PDF Compression

document process automation

Nuance ecopy ShareScan v5. Document Imaging Software. Digitize and streamline paper-based workflows.

GlobalScan NX. Server 32/Server 750. Intelligent scanning for smarter workflow

easy ntelligent convenient GlobalScan NX Server 5/ Server 32/Server 750 Capture & Distribution Solution Energize Critical Workflows

In addition, a decision should be made about the date range of the documents to be scanned. There are a number of options:

CONTINIA DOCUMENT CAPTURE FACTSHEET ENGLISH LANGUAGE

Server 32/Server 750. GlobalScan NX Server 5/ SOLUTION. Intelligent scanning for smarter workflow

e CABINET AND DOCULEX Document Capture and Electronic File Conversion

Océ PRISMA archive software. Archiving made easy. Powerful, high-volume. archiving software

PRESS RELEASE. AIIM, Philadelphia, May 15 th 2006 Embargo until, May 15 th 2006, at 5:40 p.m

Document Capture and Distribution

InstaFile. Complete Document management System

Samsung SmarThru Workflow 2 Digitize your print environment with secure, cost effective document workflow

Perfecting Advanced Rendering ADLIB PDF PRODUCT GUIDE

Nuance Power PDF is PDF uncompromised.

Introduction to WIPOScan Software

NUANCE The experience speaks for itself

Whitepaper Document Solutions

Navigate your workflow

Google Drive and more Image Capture Networked MFPs

AccuRead OCR. Administrator's Guide

Sharpdesk Solution Sharpdesk Document Management Solution

ABBYY PDF Transformer+ User s Guide

Connections to External File Sources

Intelligent Data Capturing and Indexing

ADOS 6.1. ADOS version 6.1 ADOS 6.1. Effective information archive and retrieval. Security and compliance. Return on investment.

Appendix A. Functional Requirements: Document Management

Intelligent Document Solutions

Connector Access License and Bundle Update

AccuRead OCR. Administrator's Guide

Nuance AutoStore route destinations

Nuance Power PDF Advanced.

Legal Solution: vendor invoice processing. Automating the Capture, Delivery, Approval and Accessibility of Invoices

Capture INTEGRATED SCANNING WORKFLOW CAPTURE PROCESS DISTRIBUTE

I want to run my business with state of the art technology.

INTEGRATED SCANNING WORKFLOW

I want to run my business with state of the art technology.

I.R.I.S. launches IRISPdf 5.0, the new version of its production OCR solution including, for the first time, an Arabic OCR add-on!

Document Management Solutions

How To Manage Documents On A Cloud On A Pc Or Mac Or Mac (For Pc Or Ipa)

Frequently Asked Questions

OneTouch 4.0 with OmniPage OCR Features. Mini Guide

Enterprise Printing Solutions. Secure, on-premise mobile printing platform. enterprise education public printing locations print simply anywhere

Veco User Guides. Document Management

WHITE PAPER. 3-Heights Scan to PDF Server Basics and Applications

Compare and Contrast OCR and Forms Recognition Technologies. Peter Lang and Scott Hamilton

3 C i t y C e n t e r D r i v e S u i t e S t. L o u i s, MO w w w. k n o w l e d g e l a k e. c o m P a g e 3

File Formats for Electronic Document Review Why PDF Trumps TIFF

EPSON PERFECTION SCANNING BASICS

Electronic Document Management System Specification Checklist

Document Management Release Notes

Leverage SharePoint with PSI:Capture

Image Gateway for Apeos 2.0

TeleForm v10 System Requirements. Cardiff White Paper. TeleForm

document sharing platform advanced document management user-friendly administration SOFTWARE SOLUTIONS

PDF solution comparison

ABBYY Version 12 User s Guide. FineReader ABBYY Production LLC. All rights reserved.

Using CONNECT to Outlook. CONNECT to Outlook ProductInfo. A strong team: DocuWare and Microsoft Outlook. Benefits

Dispatcher Phoenix is available in three distinct and customizable solutions to meet customer needs most effectively and efficiently:

Practice Management Installation Guide. Requirements/Prerequisites: Workstation Requirements. Page 1 of 5

The Paperless Office - You Can Do It!

EMC APPLICATIONXTENDER 8.0 Real-Time Document Management

for Windows Document Management Software

The Need for PDF Search Search and Index Overview IFilter Architecture Performance and Scalability Are Essential...

Switch to Electronic Document Management save time, space, money and frustration.

Customer Tips. Xerox Network Scanning TWAIN Configuration for the WorkCentre 7328/7335/7345. for the user. Purpose. Background

Are you ready for more efficient and effective ways to manage discovery?

Hardware & Software Requirements for BID2WIN Estimating & Bidding, the BUILD2WIN Product Suite, and BID2WIN Management Reporting

PDF/A Competence Center

Contract Management with RecFind 6

Sharepoint vs. inforouter

How To Use Pdf Files On A Pc Or Mac Or Mac With A Pdf File Manager On A Microsoft Powerbook Or Powerbook On A Pdf (Powerbook) On A Mac Or Powerintosh On A Powerbook With A Powerpoint 3D

Scan to PC Desktop Professional 10 Install Instructions

ivos Technical Requirements V For Current Clients as of June 2014

ADOBE ACROBAT X PRO SCAN AND OPTICAL CHARACTER RECOGNITION (OCR)

Common Questions and Concerns About Documentum at NEF

Electronic Document Management Small to Medium Enterprise Systems Overview. Technology by DOCOsoft

SOFT FLOW 2012 PRODUCT OVERVIEW

The Employ Florida Marketplace Document Management module provides the following capabilities:

System Requirements for Web Applications

Transcription:

DATASHEET www.docscorp.com/contentcrawler Make every document retrievable and searchable Improve access to businesscritical information Reduce non-compliance and non-discovery risk Increase efficiency with automated workflows Maximize the value of investment in DMS and search technology Integration with DMS Integration with MS Windows file systems THE RISKS ARE GREAT Access to information in the digital age is crucial. Businesses have invested heavily in Content Repositories such as Document Management Systems as well as in search technology to ensure they have instant access to business-critical documents. Despite this investment, 20% of documents in Content Repositories may be nonsearchable and therefore invisible to search technology. SOURCE OF NON-SEARCHABLE CONTENT Scanned images saved as TIFF or image PDF Emails with TIFF or image-based PDF attachments Electronic faxes saved as TIFF or PDF Legacy image, PDF or email documents from business acquisitions or litigation file ingestion Failure to locate a business-critical document can undermine efficiency and productivity as well as put your organization s reputation and financial well-being at risk when it cannot comply with discovery requests. THE SOURCES ARE MANY Image-based files such as faxes, image PDFs and scanned documents often get profiled in the DMS, or saved to a MS Windows folder through a variety of workflow loopholes; email attachments, legacy documents, mobile technology, documents ingested from acquisitions and imported litigation files. These documents are invisible to your search technology. RISKS OF NON-SEARCHABLE CONTENT Non-discovery of critical documents for a case, project, matter Failure to comply with Court orders to produce documents Productivity loss searching for missing documents Investment in DMS and search technologies not being maximized User confidence in the DMS system is lost when content is not found THE SOLUTION IS SIMPLE contentcrawler give document management professionals confidence that their content is 100% searchable. contentcrawler has uncovered a range of documents, including PDFs that had previously not been searchable within our DMS. The solution has greatly enhanced our ability to find documents quickly with the use of our DMS search functionality. Mark Turner: Lubbock Fine - Managing Partner We use contentcrawler to ensure that newly profiled and legacy PDFs are fully text-searchable. DocsCorp has worked closely with us and has been very responsive to our requests for program enhancements. Jeff Hutchinson: Mendes & Mount, LLP - Director of Information Technology RETRIEVABLE AND SEARCHABLE contentcrawler is a framework for searching Content Repositories such as Windows folders, an entire DMS database or a subset of documents based on specific queries. contentcrawler identifies non-searchable content, converts it to a text-searchable PDF using DocsCorp OCR technology and saves it back into the Content Repository. For Document Management Systems, documents identified as being image documents are saved as either New Versions, Attachments, New Renditions or Related Documents (depends on DMS). A text layer is added to the document to facilitate search.

FLEXIBLE CONFIGURATION Create any number of definable Services to gain access to a Content Repository Assess the text searchability of documents in the Content Repository OCR documents that meet the text searchability threshold Add a layer of hidden text to a PDF, saving it back into the Content Repository DMS SEARCH Identify image documents, image PDFs and email message files in the DMS by default Supports TIFF, JPG, PNG and BMP image types Caters for multiple databases or libraries WINDOWS FILE SYSTEM Searches MS Windows folders for non-searchable content Searches for image-based PDFs, JPEG, TIFF, PNG, BMP and email messages ASSESS TEXT- SEARCHABILITY MAKE SEARCHABLE (OCR) SAVE TO DMS AUDIT AND REPORTING ACTIVE MONITORING Checks all non-searchable content such as image files, image PDFs and emails with attachments Identifies PDFs with little or no text. Text-based PDFs will not be processed as they are already text-searchable Checks emails stored in a Content Repository to assess text-searchability of the attachments Replaces the email attachments with OCR d PDFs if appropriate Documents are processed using OCR technology to generate a new PDF with a hidden text layer No requirement for a text file separate to the image or PDF file The text layer is searchable Use search feature in Adobe Acrobat or Reader to find and review content Integrates with Autonomy imanage 8.2 and higher Integrates with OpenText edocs DM 5.1.05 and higher, and OpenText Content Server 9.7.1 and 10 Uses DMS API for all connectivity all security models and privileges honored Save the OCR PDF into DMS as a New Version, Related Document, New Rendition or as an Attachment (depends on DMS) Rich administrative dashboard to monitor, configure and report on progress Maximum control with Hold for Review options prior to OCR and/or Save to Content Repository steps Embedded Microsoft SQL database for access to richer reporting if required Automate the protection of contentcrawler with the Active Monitoring service Assess and OCR newly-profiled or edited document profiles on a regular schedule of your choosing Automate workflows to make documents in the Content Repository searchable PATENT PENDING contentcrawler and contentcrawler OCR are trademarks of the DocsCorp International Unit. contentcrawler 2011 DocsCorp International Unit Trust The application makes use of the following recognition technologies: ABBYY FineReader Engine 9.0 2008. FINEREADER, ABBYY & ABBYY FineReader are registered trademarks of ABBYY Software Ltd. SYSTEM REQUIREMENTS OPERATING SYSTEMS Windows XP Professional (SP3) 32-bit Windows 7 32 and 64-bit Windows 2008 R2 64-bit MS.NET Framework 3.5 and 4.0 Extended INTEGRATION HP imanage MS Windows file systems OpenText edocs DM OpenText Content Server ProLaw Worldox WebDAV MS SharePoint SYDNEY LONDON NEW YORK PITTSBURGH PORTLAND MANILA info@docscorp.com www.docscorp.com

DATASHEET www.docscorp.com/ocrdesktop ocrdesktop Digitize business-critical documents No retyping of documents Increase productivity and efficiency ocrdesktop for a mobile workforce Fastest, most reliable OCR engine Cost-effective OCR solution Supports business processes and workflow Access to information can be a huge challenge for many businesses, especially if that information is locked away in paper documents or image files created from a scanner or a fax machine. Documents can be misplaced, lost or stored offsite; image files cannot be searched; there are costs associated with storage, lost productivity searching for or retyping documents; customer service can suffer as a result of not being able to locate or having to request a document from storage. INFORMATION AT YOUR FINGERTIPS INTEGRATED OCR SOLUTION ocrdesktop provides businesses with an OCR solution that digitizes critical business information contained in paper documents and image files, which can be converted to an editable document format such as MS Word, or to a text-searchable PDF. Never retype another document again. pdfdocs OCR Desktop converts PDF documents to Word files. Convert PDF documents to Word for easy editing and formatting. FOR PEOPLE ON THE MOVE ocrdesktop is ideal for individuals who are mobile and don t have access to a network, or who work at a branch office where they have to endure lengthy processing times due to queued jobs on the server. ocrdesktop allows you to OCR documents quickly and easily from your laptop. All in all, pdfdocs Desktop can serve as an adequate substitute for Acrobat Professional (and a lot less expensively). The pdfdocs OCR Server is also worth a serious look. TechnoLawyer: John Heckman, legal technology consultant I highly recommend this product to any law firm that wants efficient software to organize its PDF documents this is the best I have seen to date. Legal Assistant Today: Milton Hooper, litigation support specialist Converting paper documents and image files to textsearchable PDF documents gives everyone instant access to all the critical business information you need to make profitable business decisions. INCREASE EFFICIENCY It is important from an efficiency perspective that people have ready access to the documents they need when they need them. This is increasingly important as people collaborate more on document content. NEVER LOSE ANOTHER DOCUMENT Locating documents misfiled on a network is a timeconsuming and costly exercise. Converting image files to text searchable PDF documents can help solve this problem. Never lose another document again. ARCHIVING AND COMPLIANCE Document management is at the heart of any organization s compliance and risk management strategies. Law firms, corporate legal departments and government agencies must have complete and total control over their documents if they are to comply with laws and regulations on document management and archiving. You need to be able to search and find any documents that contains a certain keyword or phrase. Certainly, all searchable documents will be returned that meet the criteria. But, what about documents such as image PDFs, JPEGs and TIFFs, which are not text-searchable?

INTEGRATION Integrates with pdfdocs and comparedocs Save to Word directly from your Document Management System PDF CONVERSION Convert graphic files produced by fax machines, scanners and Document Management Systems to image PDF documents Convert image PDFs to text-searchable PDFs Convert documents to 100%-compliant PDF/A documents PDF OUTPUT Image only Image on text Text and images Text on images Font embedding PDF MRC PDF/A 1-a PDF/A 1-b SYSTEM REQUIREMENTS OPERATING SYSTEMS PDF (PDF/A) MRC COMPRESSION SAVE TO WORD AUTOMATION Mixed Raster Content (MRC) minimizes the PDF and PDF/A file size - up to 8 times smaller than JPEG Produces more precise character outlines for better readability Convert any PDF document to Microsoft Word. You can edit and modify the document for reuse throughout the organization. Output (RTF, DOC, DOCX, TXT) ocrdesktop integrates with scanners using pdfdocs Watchfolders. Image files in Watchfolders are automatically converted to text searchable PDF documents. OCR can be performed on one page, one document or a collated document set. Microsoft Windows XP with SP2 or 3 (32-bit and 64-bit) Microsoft Windows Vista with or without SP1 (32-bit and 64-bit) Microsoft Windows 7 (32-bit and 64-bit) Microsoft Windows Server 2003 with SP2 (32-bit and 64-bit) running Terminal Services/Citrix Metaframe Microsoft Windows Small Business Server 2003 with SP1 (32-bit and 64-bit) Microsoft Windows Server 2008 (32-bit and 64-bit) ACCURACY ADVANCED IMAGE PROCESSING FOREIGN LANGUAGE RECOGNITION Over 99% accurate. Uses cutting-edge technology to recognize and convert text Dot-matrix recognition Automatically deskew pages for alignment Removes speckles from images created using a scanner for better recognition OCR for 198 languages, including Arabic, Asian (Chinese, Japanese, Korean, Taiwanese, Thai, Vietnamese) European Multi-lingual document recognition. Can recognize multiple languages in the same document INTEGRATION HP imanage HP TRIM Objective OpenText edocs DM OpenText Content Server MS SharePoint NetDocuments ProLaw Worldox SEARCHING DOCUMENT RETRIEVAL Using the free Adobe Reader software, you can search individual PDF documents for specific information. Search highlights each instance of the word in the document in its exact location. This is useful when you need to see the words in context. Supports text searching using third party products such as Windows Desktop searching, Document Management software or Google Desktop search. Locate misfiled or missing files on your system quickly and easily by searching on a word or string of words in the document. SYDNEY LONDON NEW YORK WASHINGTON DC PORTLAND MANILA info@docscorp.com www.docscorp.com

DATASHEET www.docscorp.com/ocrserver ocrserver Increase productivity and efficiency Fastest, most reliable OCR engine No retyping of documents Advanced document management integration Cost-effective OCR solution Supports business processes and workflow ocrserver provides businesses with an OCR solution that integrates with their existing business applications to capture critical business information locked away in image files, which can then be made accessible to all in a format that is accessible and searchable. DOCUMENT MANAGEMENT INTEGRATION ocrserver integrates via desktop clients with HP imanage, HP TRIM, OpenText edocs DM, OpenText Content Server, MS SharePoint, NetDocuments, ProLaw and Worldox document management systems. INTEGRATED OCR SOLUTION When used in conjunction with pdfdocs, users may choose to OCR specific documents, or administrators can leverage multi-function devices and pdfdocs Watchfolders to provide automated workflows whereby all documents are converted to text-searchable PDFs. INFORMATION AT YOUR FINGERTIPS Converting image files to text-searchable PDF documents delivered to every desktop gives everyone instant access to all the critical business information they need to make the right business decisions. All in all, pdfdocs Desktop can serve as an adequate substitute for Acrobat Professional (and a lot less expensively). The pdfdocs OCR Server is also worth a serious look. I highly recommend this product to any law firm that wants efficient software to organize its PDF documents this is the best I have seen to date. Legal Assistant Today: Milton Hooper, litigation support specialist LOCATE DOCUMENTS INSTANTLY Never lose another document again. Locating documents misfiled on a network is a time-consuming and costly exercise. Converting image files to text-searchable PDF documents can help solve this problem. Once converted, you can search your network for specific document content using Google Desktop Search or other third party products. PDF TO WORD Never type another document again. pdfdocs OCR Server converts PDF documents to MS Word files. pdfdocs OCR Server enables users to convert PDF documents to MS Word documents, allowing free-flowing text for easy editing and formatting, and PDF/A for archiving. BRINGING IT ALL TOGETHER ocrserver integrates with your scanners, MFDs, document management systems and other businesscritical applications to produce seamless, secure business documents that can be safely distributed inside and outside the organization. TechnoLawyer: John Heckman, legal technology consultant

PDF CONVERSION Convert graphic files produced by fax machines, scanners and Document Management Systems to PDF. Adds an invisible layer of searchable text behind the original image, which remains unchanged. OCR RECOGNITION TEMPLATES Provides the ability to configure various output templates to control the options used to OCR or publish documents using pdfdocs Desktop. These options include the type of output document, languages that the source documents are typed in as well as options for manipulating scanned documents SAVE TO WORD Convert any PDF image document to MS Word. You can edit and modify the document for reuse throughout the organization. SEARCHING DOCUMENT RETRIEVAL AUTOMATION ACCURACY Using the free Adobe Reader software, you can search individual PDF documents for specific information. Search highlights each instance of the word in the document in its exact location. This is useful when you need to see the words in context. Supports text searching using third party products such as Windows Desktop searching, Document Management software or Google Desktop search. Locate misfiled or missing files on your system quickly and easily by searching on a word or string of words in the document. OCR processing is performed on your file server reducing the need for complex desktop software installations or expensive high-powered workstations. ocrserver integrates with scanners using pdfdocs Desktop Watchfolders. Image files in Watchfolders are automatically converted to text-searchable PDF documents. OCR can be performed on one page, one document or a collated document set. Searchable PDF document is delivered to every pdfdocs user. Converts 119 languages with 99% accuracy. Uses cutting-edge technology to recognize and convert text. Despeckle modules enhances difficult to read documents to help recognition. Financial, legal and medical dictionaries enhance recognition. Asian language and handwriting conversion optional extra. SYSTEM REQUIREMENTS OPERATING SYSTEMS Windows XP with Service Pack 3 (or) Windows 7 (or) Windows Server 2008 (or) Windows Server 2008 R2 32 bit or 64 bit 4GB RAM 2GB disk space for program files and document cache Microsoft.NET Framework 3.5 (this is already included in Windows 7 and Microsoft Server 2008 R2). 3.5 SP1 recommended. Microsoft.NET Framework 4.0 Extended ocrserver can be run on a virtualized Windows environment such as VMware. INTEGRATION HP imanage HP TRIM Objective OpenText edocs DM OpenText Content Server MS SharePoint NetDocuments ProLaw Worldox SYDNEY MELBOURNE LONDON NEW YORK WASHINGTON DC PORTLAND info@docscorp.com www.docscorp.com