THE FORMS INPUT PROJECT



Similar documents
ABBYY recognition technologies ideal alternative to manual data entry. Automating processing of exam tests.

Improvements of the Census Operation of Japan by Using Information Technology

Visual Scoring the 360 View: 5 Steps for Getting Started with Easier, Faster and More Effective Lead Scoring

Chapter 2. My Early Days Trading Forex

Lakeside Medical Clinic

How 2D Scanning Can Benefit your Business

GE Healthcare. Centricity Physician Office Electronic Medical Records

Novo Mail

CONTINIA DOCUMENT CAPTURE FACTSHEET ENGLISH LANGUAGE

Web Banner Design Tips

AUTOMATED FORMS PROCESSING

Compare and Contrast OCR and Forms Recognition Technologies. Peter Lang and Scott Hamilton

The National Number and the Automation of the Civil Records Project Ministry of Interior Administration of Civil Records


Hiring Position Recommendation: AdMind Technology

Image Optimization GUIDE

DICTATION & TRANSCRIPTION. INFO@DOLBEY.com

Conquering the Myths of Inventory Management with MobileInventory.

ITA Dynamics Waste & Recycling Document Management System enwis) DOCMA

Small Business CRM; Who Can Leverage it & is it Affordable?

Paying off a debt. Ethan D. Bolker Maura B. Mast. December 4, 2007

Show me the Money! Meg Eynon, Vice President

Corporate Recruiter Tells All

Software development in distributed environments

SAVING PLANS SAVING PLANS

Testing a Handprint Data Capture System (with Two Different Form Designs)

Doing Break/Fix Well By Chris Martin MSP Business Management

Archiving. Paper Documents

PDF Forms Advantages and application possibilities of electronic forms in PDF format

The Classes P and NP

Best Practices: Inventory Management for the Small to Medium-Sized Business

Document Management: Finding it in the Cloud By Steven J. Best

Our code for collecting council tax

TC Max Asset Management Software

Optical Character Recognition (OCR)

Getting Started with Neat

Achievements and challenges of the single market. S&D responses to citizens top 10 concerns

White Paper Barcoding

WHY SOFTWARE IS SO HARD TO USE: HOW CUSTOMIZED SOLUTIONS CAN HELP

Values of Healthcare

7 Secrets To Websites That Sell. By Alex Nelson

Jeffrey S. Krause, Esq.

The 9 Ugliest Mistakes Made with Data Backup and How to Avoid Them

6 HUGE Website Mistakes that could Cost You Thousands

Opinion and recommendations on challenges raised by biometric developments

Top Challenges in Payroll & HR

The Value of Intelligent Capture in Accounts Payable Automation. White Paper

Using Logistics to Grow Your Business

7 Myths of Direct Mailing Think all your mail is delivered equally? Think again.

Sage HRMS Stay in control: The benefits of in-house payroll software

Customer-Focused Service Enhancements to Online Tax Filing System (E-file) Category: Digital Government: Government to Citizen Washington

BUSINESS MOVES. Moving Guide. for Office Moves. With Bournes, it s personal.

Mobile Device Management: Are You In Control?

EPSON PERFECTION SCANNING BASICS

How to Increase Customer Satisfaction By Outsourcing Your Online Order Fulfillment

Harnessing the Untapped Potential of EHR Systems

For years, Réal s mom has been asking him, What is it you do, again?

PERSONAL WEBSITE DEVELOPMENT 101. Brought to you by the UB Graduate Student Association webmasters: Holly Keily & John Beverley

Advisers 10 best protection selling points

Think about how often you check your own and find tons of messages with subject lines like this:

MFC Mikrokomerc OFFER

The Revolution in Patent Information Publishing: Making International Full-Text Databases a Reality

Exercise 4A: What Info Do You Need for a Loan?

International money transfer 101

Resource Monitoring During Performance Testing. Experience Report by Johann du Plessis. Introduction. Planning for Monitoring

Employer Web-based Basic Pilot Survey. Login Name: Password:

SPECIAL REPORT. How To. Sell Your Home. In 9 Days Or Less. No Commissions! No Fees!

A Guide to Developing a Workflow System for Your Financial Advisory Firm

Going Paperless The Utah Experience. Mike Pecorelli Project Manager Utah DEQ

ISI Debtor Testimonials. April 2015 ISI. Tackling problem debt together

Keywords: inspection; nuclear material control and accountancy (NMC&A); quality management system

Active RFID Solutions for Asset Tracking and Inventory Management

Technology for the Benefit of Individuals with Special Needs

CardScan Executive for Mac Reviewer s Guide

marketing guide AWR www broadbean technology VOIP

Secrets to Automation Success. A White Paper by Paul Merrill, Consultant and Trainer at Beaufort Fairmont, LLC

Everything you need to know: repatriating funds

Chair: Stephen Darvill (Logica) Raporteur: Edward Phelps (EURIM) SUMMARY OF ROUND TABLE STATEMENTS AND DISCUSSION

SPECIAL REPORT: 4 BIG REASONS YOU CAN T AFFORD TO IGNORE BUSINESS CREDIT!

The kinder term for collections is compliance. But the exercise is still the same. Financial obligations unmet and somebody is in charge of

NextStar with Ribbon Scanning The Film Conversion Approach of the 21 st Century

January Dear Colleague,

Focusing on you. Focusing on the future.

2012 NASCIO Recognition Award Nomination

Individual Electoral Registration Frequently asked questions

B.Com(Computers) II Year RELATIONAL DATABASE MANAGEMENT SYSTEM Unit- I

Autism Speaks Sees a 4x Increase in Online Revenue with IPM Advancement s Integrated Fundraising Solutions

Find your perfect office space, in three easy steps.

Summary: Presenter: Start Time: Duration: Winifred Robinson: Les Matheson, Chief Executive of personal and business banking at RBS and NatWest:

Why Your CRM Process is Destroying Your Team s Prospecting and How to Fix It

A Dam with a View» BETH WILSON. Displayed with permission The American Surveyor October 2014 Copyright 2014 Cheves Media

Revolutionary Tabletop Automation. Prescription Validation, Counting and Filling System

Website Design Checklist

Calibration Control. Calibration Management Software. Tools for Management Systems

1. Name of system and ESIG category for which you are applying (Enterprise System or Single Process System).

Foreword by Martin Fowler *

Agent s Handbook. Your guide to satisfied customers

Testing, What is it Good For? Absolutely Everything!

Manage What You Measure Become More Efficient in 45 Days or Less

Transcription:

LINGVO FORM READER FINE READER FINE READER FORM READER LINGVO CASE STUDY ABBYY FORMREADER THE FORMS INPUT PROJECT IN THE STATE TAX SERVICE OF RUSSIA Can you imagine a country with citizens who don t know what is a tax return and have never seen a tax officer? The Russian Federation before 1996 was an example of such a country. At that time Russia was making its first steps on the way to market economics, but a civilized country with market economics cannot exits without an effective tax service. So the law about natural persons taxation was passed by the parliament in 1996. According to this law, every citizen must submit tax returns and income reports (the so-called W-2 forms) to the State Tax Service, and the law was to come into force as of January the 1 st, 1997. So in 1998 the overall volume of tax return and income report forms amounted to 4 000 000 documents from Moscow region citizens only. As there are 7 pages on average per tax return form, the amount of paper is 28 000 000 pages (30 five-ton full-loaded trucks are required to transport this pile of paper!). The data from these forms should be input into information databases of the State Tax Service (now the Ministry of Taxes and Duties). It went without saying that is was impossible to input such an amount of data in time manually, because it would require hundreds of operators doing an unskilled labor retyping forms for many months, as it was before with balance accounts input. And so they would have been typing in the forms instead of raising taxes from solvent citizens and thus guaranteeing the salary for physicians, teaches, scientists, etc., but an automatic solution was already on its way. By that time, ABBYY had already developed a number of data input projects for local offices of the State Tax Service. So in 1997 the State Tax Service contracted ABBYY to find a solution for this data input task. When one develops such projects in Russia, one usually encounters many difficulties of both technical and administrative nature that make the development age-long. But due to the extraordinary efforts made by Information Resources Department of the State Tax Service of Russia and ABBYY Software House the first automatic form data input system was developed and installed in less than 6 months. The project details The primary concerns of the developers were, naturally, the data input issues, but the scale of the project was much larger, because nothing of the kind has been done before in the State Tax Service. There were many organizational issues that should have been taken into account, from the question of where and how to print the forms through the procedures of their distribution among the local tax inspectorates to the questions of how and where to keep the forms archives and so on. So the task of ABBYY was not confined to the development of the data input system; ABBYY was to develop the whole technology of data collection and input. And ABBYY performed most successfully. A single example: a set of special folders and bags to be used for transportation of tax return forms from the local tax offices to the data input centers was designed ABBYY, in order to prevent the possible damage to the tax return forms. A lot of other technology 1

components were also designed, such as information banners telling the people how to fill in the forms, the system of signs in the tax offices telling the people where to get empty forms and where to put the filled ones, etc. One of the main features of this project that made it a difficult and unique task was its revolutionary character. Not a single similar project has ever been run in Russia, but that s not all as yet, there weren t a single country in the whole world who ever tried to develop a project of this kind. Moreover, this project was so serious that not a single mistake in the form input could be tolerated. So, it was decided that a pilot system should be developed and tested in Moscow, and only after the testing results are analyzed should the system be spread across the whole Russia. And the work began. The first step was to develop the tax return form itself, as such kind of document had never been used before in Russia. The new automatic system of processing documents required the machine-readable forms to be used. The machine-readable forms were new for our country. The only institution where such forms were used was the Pension Fund of Russia (semi-governmental organization). And in any case the tax return form were the first multi-page machine-readable forms. Another issue is that the Russian law requires a great number of official approvals from various governmental bodies to be got before a form of an official document may be used officially, and the final approval of the whole tax return form would have obviously taken much time. But the deadline was too close; so we ve made the decision only to receive approval of the future tax return form content, while the decisions on the overall appearance and structure of the form were to be made by the regional tax service offices. So, the content of the machinereadable tax return form was approved by the Ministry of Taxes and Duties, and ABBYY was entrusted to create a sample form layout. ABBYY specialists did everything possible to develop a layout that would suit the task best. First, red was taken as the background color. All inscriptions were in red too, a little darker than background. Using this color makes scanning easier and better. Second, each page was given its own unique barcode and unique letter. These features let the system choose the required form template automatically. The automatic template matching is crucial for the automatic recognition of multi-page forms. So, the tax return form was made ready for using. And then some additional problems emerged. As it was already mentioned, the forms contained several pages. So there was a problem of collecting the different pages of the single form of a single person together. There were some interesting suggestions, for example to assign each citizen a unique identification number, but unfortunately there was no time to do it, so the only way was to transport forms with all possible accuracy to avoid page mix. That was another reason for developing special folders and bags for forms transportation. Another difficulty was that the people themselves, the taxpayers. For the most part, they were to see a tax return form for the first time in their lives, and they tended to make mistakes in form filling because they were unable to follow the instructions properly, as it was all too unusual for them. So the tax officers were to verify all incoming information to prevent mistakes in the tax return data. 2

Technology of centralized tax return form processing This technology was developed with the help of ABBYY experts. All information about the income of each person is kept in a united regional database. The following channels are used to put information into the database: Manual data input from paper documents Automatic data input from scanning stations Data input from electronic media Information is archived periodically and kept in special archives from which it can be easily got at any moment. First ten thousand pages In January 1998 the first tax return forms were collected by in the tax services. The FormReader system was already installed on 33 workstations in the Moscow Tax Office 39 by that time. The system could process more than 10000 forms per day. According to the law, all citizens were to report about their income until the end of April, so a huge river of documents began flooding the tax offices in spring. And they had only 120 days after May 1 to process all of them! Such time limits made it impossible to input all forms manually. But the FormReader system eliminated this problem for at least one Tax Office in Moscow. 3

Igor Popov, Chief of the Data Processing Center of the Tax Ministry of Russia. The advantages of the system developed together with ABBYY are evident considering the season-dependent conditions of our work, says Igor Popov, former Chief of Tax Office 39, now Chief of the Moscow Data Processing Center of the Ministry of Taxes and Duties of Russia, If we tried to input 4 000 000 documents in 120 days manually, we would have to employ more than 1000 professional typists and to buy the same amount of computers, and rent the workspace of about 6000 m 2. And what we would have to do with all these people and workspace during the rest 9 months of the year? Results The pilot project was declared a success, and the technology of automatic data input spread all over Russia. It was planned to install automatic form processing systems in 5 biggest regions of Russia in 1999. The Moscow Tax Office 39 became the Moscow Data Processing Center of the Ministry of Taxes and Duties of Russia. As of now, there are 3 systems of Automatic Data Input installed there, each consists of more than 30 workstations, and the tax return forms from almost every Moscow tax office are brought there for processing. The scanning capacity of the system is 80-100 pages per minute on average. The scanner used is the BancTec S-185S scanner. One operator processes 3-6 pages per minute on average and the verification speed grows as the operator becomes more and more experienced. Further development and modification of the system performed by ABBYY s specialists each year increase the recognition quality and make the verification process more fast and convenient. Effectiveness and cost efficiency of automatic data input technology are now evident to everybody. The input quality featured by FormReader is much greater than the quality of manual input. Figure 1. Pilot project scheme installed in the Moscow Tax Office 39 in 1998 4

Even without verification FormReader makes about 5 mistakes per 1000 handprinted characters * and only 1 mistake per 1000 printed ** characters. It is 4 times less mistakes than a professional typist does in the morning and almost 6 times less mistakes than the same typist does in the evening. Besides, the application automatically checks input information, using reference files, databases, compares sum in figures and sum in words, etc., thus achieving the 100% reliable recognition results. Besides, it goes without saying that FormReader is faster than any, even super professional, typist. Verification is the only stage when the throughput of the whole system is affected by human productivity. That is why ABBYY s experts paid special attention to this stage. We provided a whole set of tools and techniques to be used separately or in combination to organize efficient verification. By changing template settings and adding rules it is possible to fine-tune the verification process to minimize human efforts and solve specific tasks. The three-step verification technology implemented in FormReader (group, context, and in-form verification), customizable error display level and checking level, etc. provide the flexibility that will help you to build the most efficient processing technology. Thanks to all these tools, an operator needs only to check some uncertainly recognized characters. It allows to process 800-1000 pages per day instead of 250-300 pages per day in case of manual input. As of now, 8 largest regions of Russia make effective use of the enterprise version of FormReader system. High form processing speed, perfect quality of recognition, automatic control of recognition results and low cost make FormReader the most efficient automatic data capture system. One of the most important results of this project is the increase of the total sum of taxes raised. The FormReader made the State Tax Service of Russia one of the first tax services in the whole world to use an automatic bulk input system of tax return forms. * for neatly written characters without corrections ** for documents with good printing quality 5