OCRopus Addons. Internship Report. Submitted to:
|
|
|
- Kenneth Patrick
- 9 years ago
- Views:
Transcription
1 OCRopus Addons Internship Report Submitted to: Image Understanding and Pattern Recognition Lab German Research Center for Artificial Intelligence Kaiserslautern, Germany Submitted by: Ambrish Dantrey, B. Tech. III year, E&CE Indian Institute of Technology, Roorkee Roorkee, India Supervisors: Faisal Shafait, Illya Mezhirov Reviewer: prof. Dr. Thomas Breuel Start Date for Internship: 15 th May, 2007 End Date for Internship: 27 th July, 2007
2 Report Date: 27 th July, 2007 Preface This report documents the work done during the summer internship at Image Understanding and Pattern Recognition(IUPR) Lab, Deutsche Forschungszentrum für Künstliche Intelligenz(DFKI), Germany under the supervision of Prof. Dr. Thomas Breuel. The report first shall give an overview of the tasks completed during the period of internship with technical details. Then the results obtained shall be discussed and analyzed. Report shall also elaborate on the the future works which can be persuaded as an advancement of the current work. I have tried my best to keep report simple yet technically correct. I hope I succeed in my attempt. Ambrish Dantrey
3 Acknowledgments Simply put, I could not have done this work without the lots of help I received cheerfully from whole IUPR. The work culture in IUPR really motivates. Everybody is such a friendly and cheerful companion here that work stress is never comes in way. I would specially like to thank Dr. Thomas Breuel and Dr. Daniel keysers for proving the nice ideas to work upon. Not only did they advised about my project but listening to their discussions in IPeT meeting have evoked a good interest in Image analysis. I am also highly indebted to my supervisors Faisal Shafait and Ilya Mezhirov, who seemed to have solutions to all my problems. Author
4 Abstract The report presents the three tasks completed during summer internship at IUPR which are listed below: 1. Detection of headlines in document images with black run lengths and OCRopus performance evaluation in detecting headlines 2. Re engineering the zone classification module 3. Evaluation of different segmentation algorithms performance All these tasks have been completed successfully and results were according to expectations. The detection of headlines achieved a low error rate of 2.85% as against 6.52 of previously used methods. During evaluation of segmentation algorithms XY cut was found to gain a lot by noise cleanup, which is an interesting result as it strengthen the claim of XY cut segmentation algorithm as a suitable method for OCRopus. The re engineering and porting of zone classification module to OCRopus makes it possible for OCRopus to have a text/image segmentation if it is required in future. Author
5 OCRopus : Introduction Though the field of optical character recognition(ocr) is considered to be widely explored, the development of an efficient system for use in real world situations still remains a challenge for developers. OCRopus is a state of the art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, multi lingual capabilities and is being developed at IUPR. This being a very big project, I was assigned the tasks of developing tools for layout analysis and evaluation. The Goals: Following goals were set as I proceeded in my work: 1. Conversion of ground truth data in MARG database from XML format to hocr micro format[1]. 2. Development of a rule based headline detection method using the median black run length of the lines.
6 3. Development of segmentation classification module and evaluation of performance of different segmentation algorithms as against noise. 1. XML to hocr: hocr is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well defined support for most scripts, languages, and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR related information co exist in the same file and survives editing and manipulation. hocr markup is independent of the presentation. Due to all above qualities of hocr format, it is highly desirable to have ground truth in this format. I was assigned the task of converting the MARG database ground truth into hocr format. For this purpose I have written following script. Script Name : xml to hocr Language Used: Python Command line argument form: xml to hocr FILE.XML FILE.XML : The file in XML format to be converted into hocr micro format. Note: The script does not take care of latex characters yet. It would be an improvement to incorporate this feature. 2. Headline detection Based on black run length and its integration into OCRopus: Detection of headlines in document images is one issue that is mostly overlooked but yet is highly desirable to properly format the output of OCR. OCRopus had till now used a rule based method which used space between lines as the criteria for detection of headlines. Though this method worked for many images, it also failed many times. It was an obvious observation that black run lengths of headlines are more than the black run length of the normal line, and we tried to build upon this
7 concept. We used median black run length of a line as the deciding criteria. The median was used instead of mean because mean run length could have easily been affected by the noise merging with text and would have produce errors. The whole approach is simple as discussed below: 1. Calculate the median black run length for the each line on page. 2. Compare this run length for each line with the lines below and above it. 3. If black run length for a line has been found K1(a parameter) times the median run length of line below it, and K2(another parameter) times the median run length of the line above it,set it as a headline. The value of parameters K1 and K2 was to be found experimentally. After many times evaluating the performance of the program, the value of K1 and K2 has been set to 1.5 and 1.1 respectively. We used histogram based method to find the median run length. A histogram of the number of occurrences versus run length was calculated, once we have such a histogram we normalize it with the largest value of occurrence. Then we calculated the cumulative distribution function for this normalized histogram. The point when cumulative distribution function reches a value of 0.5, corresponds to the median runlength. The program for detection of headlines was written in C++ and used standard OCRopus classes. The program has been successfully integrated into OCRopus and Evaluation: We also designed a tool which evaluates the performance of the OCRopus in detecting headlines. As according to OCRopus standards, this tool has been developed to work with files in hocr micro format. This tool comprises of two programs: 1. The first program takes the OCRopus output and the corresponding ground truth file in hocr format and outputs the total no of false positives and
8 false negatives which occurred in detection. It also outputs the total no of true headlines which are present in the ground truth. The command line form of this programs is: headline eval hocr true hocr actual 2. The second program is for parsing the file produced by running above program on a large no of files(or on a database) and counts the total no of false positives and false negatives occurred in whole database and tells the error rate of OCRopus on whole database. The command line form of this programs is count_errors FILE.TXT Both of the above programs were written in PYTHON. Criteria for evaluation: For evaluating the performance of OCRopus in detection of headlines we define the the error rate as: e = (fp+fn)/t e = percentage error fp = total no of false positives fn = total no of false negatives We evaluated the performance on standard University of Washington III (UW III) database [2]. The results for headline detection program showed clearly that median black run length criteria is better than the space between lines criteria, yet errors were still present. While visually analyzing the output, an observation was made that run length based criteria and space based criteria both produced different false negatives and positives. Hence it was clear that one of the method can be used to remove the errors produced by other. So we tried to combine the both approaches in such a way that space based criteria is used as a filter to detect false positives produced by the run length based criteria. The rule which was used to combine them was as follows: 1. Use run length based criteria to find the headlines. 2. Calculate the median black run length for whole page
9 3. Compare the median black run length of all lines found to be headline in step 1 with the median black run length of the page. Since median black runlength of the page represents just the simple line not a headline, if any headline found in step 1 has a run length less than or equal to the run length for whole page, it is a suspicious case. Recheck for this line with space based criteria. Results: The results were as expected. Only run length based criteria performed better than only space based criteria and a combination of both the criteria as described above outperformed the both. The error rates on standard UW3 database for different approaches are as follows: Space based headline detection: total no of text lines: total no of false positives: total no of false negatives: % error = 6.52% Black Run length based headline detection: total no of text lines: total no of false positives: total no of false negatives: % error = 4.14% Both approaches combined (using space based approach as a filter to remove false positives) total no of text lines: total no of false positives:
10 total no of false negatives: % error = 2.85% Next we show some of the examples:
11
12 3. Text/Image Segmentation and Classification Document image layout analysis is a crucial step in many applications related to document images, like text extraction using optical character recognition (OCR), reflowing documents, and layout based document retrieval. Layout analysis is the process of identifying layout structures by analyzing page images. Layout structures can be physical (text, graphics, pictures,... ) or logical (titles, paragraphs, captions, headings,... ). The identification of physical layout structures is called physical or geometric layout analysis, while assigning different logical roles to the detected regions is termed as logical layout analysis [3]. The task of a geometric layout analysis system is to segment the document image into homogeneous zones, each consisting of only one physical layout structure, and to identify their spatial relationship (e.g. reading order). Therefore, the performance of layout analysis methods depends heavily on the page segmentation algorithm used. A detailed explanation of defferent segmentation algorithms and their performance comparison can be found in [4,5]. Also, another important subtask of document image analysis in the classification of physically segmented blocks into one of the predefined classes. In most of the cases the classification steps follows the segmentation and it is highly desirable to evaluate the system performance on whole segmentation/classification task. With the help of such an evaluation, it is easy to decide if the incorporation of these step in OCRopus would result in improved performance. also it would be easy to decide which segmentation algorithm to use. For classification step we used method as described in [6] this being the best classification method. We used only two classes text and non text which were relevent to OCRopus, instead of eight classes as described in this paper. We already had an implementation of various segmentation algorithms and classification step. The task included re engineering the classification step's code and porting the whole segmentation classification module into OCRopus, making it use standard OCRopus classes and functions. The task has been completed
13 successfully and now we have a version of whole segmentation classification module in OCR repository and it can be integrated with OCRopus if the results and experiments comes positive. The command line form of the program is : ocr classify and display i IMAGE b BOUNDING BOX FILE o OUTPUT IMAGE IMAGE : The image to be classified BOUNDING BOX FILE : The bounding box file produced by segmentation algorithms OUTPUT IMAGE : The name of output image to be written Evaluation As discussed earlier the evaluation of both segmentation and classification steps combined together is highly desirable. The purpose of developing a evaluation module was to decide which segmentation algorithm would best suite the need of OCRopus. We developed a evaluation program which evaluates the performance of two steps as against the ground truth. Our criteria for the evaluation is the hamming distance between the text/non text zone image produced from ground truth and that from the Zone classification module. The error rate is defined as follows: e = HD*100/T e = error rate HD = Hamming distance between ground truth text non text image and actual text non text image T = Total no of pixels present in image %efficiency = 100 e This program was developed in C++. The command line argument form of the program is :
14 ocr evaluate gt GROUND TRUTH IMAGE ai ACTUAL IMAGE GROUND TRUTH IMAGE : Text/non text image produced from ground truth ACTUAL IMAGE : Text/non text image produced from actual program Issue of noise cleanup: Document Image Noise affects the performance of segmentation algorithms greatly. It was our view that the performance of all the algorithms should improve after noise cleanup. A better explanation can be found in [5]. We used noise cleanup system as explained in [7] Also we expected improvement in performance of simple segmentation algorithms like XYcut to be more than that of complex algorithms like voronoi, reason being XYcut gets more affected by noise than voronoi does and as we evaluated the performance of these algorithms with and without noise, we proved correct. Results: Three segmentation algorithms (Voronoi, Docstrum and XYcut )performance was evaluated by our program. The results were as we had expected and hence were quite encouraging. Below are the error rates for all these algorithms with and without noise cleanup. Algorithm Percentage efficiency without noise cleanup Percentage efficiency with noise cleanup Voronoi Docstrum XYcut As evident the performance of all the algorithms increase with noise cleanup, but the improvement was much more for XYcut compared to other algorithms. After noise cleanup XYcut has an efficiency much close to that of Voronoi and being a simple algorithms XYcut can be an optimum choice for the OCRopus. Conclusion:
15 The whole experience of working at IUPR was great. This organization has a superb work culture, great minds and very high quality of work. I learned a lot of about image processing and analysis. The work I could complete here was very satisfactory. I have tried to develop as many add ons as possible for OCRopus and even got very encouraging results with some of them. I hope my work on OCRopus helps it meet its goals.
16 References 1. T. M. Breuel: The hocr Microformat for OCR Workflow and Results : ICDAR,2007, accepted for publication 2. I. Guyon, R.M. Haralick, J.J. Hull and I.T. Phillips: Data sets for OCR and document image understanding research. In: Handbook of character recognition and document image analysis, World Scientific, (1997) R. Cattoni, T. Coianiz,, Messelodi, S. Modena, C.M.: Geometric layout analysis techniques for document image understanding: a review. Technical report, IRST, Trento, Italy (1998) * 4. F. Shafait, D. Keysers, and T. M. Breuel : Performance Comparison of Six Algorithms for Page Segmentation: 7 th IAPR Workshop on Document Analssis Systems(DAS),pages F. Shafait, D. Keysers, T. M. Breuel : Pixel Accurate Representation and Evaluation of Page Segmentation in Document Images : ICPR 2006, International Conference on Pattern Recognition, pages * 6. T. M. Breuel, D. Keysers, F. Shafait : Document Image Zone Classification A Simple High Perfomance Approach : VISAPP 2007, pages T.Gupta: OCRopus addons: tech reports, IUPR, 2007
Page Frame Detection for Marginal Noise Removal from Scanned Documents
Page Frame Detection for Marginal Noise Removal from Scanned Documents Faisal Shafait 1, Joost van Beusekom 2, Daniel Keysers 1, and Thomas M. Breuel 2 1 Image Understanding and Pattern Recognition (IUPR)
The OCRopus Open Source OCR System
The OCRopus Open Source OCR System Thomas M. Breuel DFKI and U. Kaiserslautern Kaiserslautern, Germany [email protected] ABSTRACT OCRopus is a new, open source OCR system emphasizing modularity, easy extensibility,
Adobe Acrobat 9 Pro Accessibility Guide: PDF Accessibility Overview
Adobe Acrobat 9 Pro Accessibility Guide: PDF Accessibility Overview Adobe, the Adobe logo, Acrobat, Acrobat Connect, the Adobe PDF logo, Creative Suite, LiveCycle, and Reader are either registered trademarks
Text versus non-text Distinction in Online Handwritten Documents
Text versus non-text Distinction in Online Handwritten Documents Emanuel Indermühle Horst Bunke Institute of Computer Science and Applied Mathematics University of Bern, CH-3012 Bern, Switzerland {eindermu,bunke}@iam.unibe.ch
2. Distributed Handwriting Recognition. Abstract. 1. Introduction
XPEN: An XML Based Format for Distributed Online Handwriting Recognition A.P.Lenaghan, R.R.Malyan, School of Computing and Information Systems, Kingston University, UK {a.lenaghan,r.malyan}@kingston.ac.uk
Visual Structure Analysis of Flow Charts in Patent Images
Visual Structure Analysis of Flow Charts in Patent Images Roland Mörzinger, René Schuster, András Horti, and Georg Thallinger JOANNEUM RESEARCH Forschungsgesellschaft mbh DIGITAL - Institute for Information
Machine Learning: Overview
Machine Learning: Overview Why Learning? Learning is a core of property of being intelligent. Hence Machine learning is a core subarea of Artificial Intelligence. There is a need for programs to behave
IFS-8000 V2.0 INFORMATION FUSION SYSTEM
IFS-8000 V2.0 INFORMATION FUSION SYSTEM IFS-8000 V2.0 Overview IFS-8000 v2.0 is a flexible, scalable and modular IT system to support the processes of aggregation of information from intercepts to intelligence
Creating Accessible PDF Documents with Adobe Acrobat 7.0 A Guide for Publishing PDF Documents for Use by People with Disabilities
Creating Accessible PDF Documents with Adobe Acrobat 7.0 A Guide for Publishing PDF Documents for Use by People with Disabilities 2005 Adobe Systems Incorporated. All rights reserved. Adobe, the Adobe
The Role of Reactive Typography in the Design of Flexible Hypertext Documents
The Role of Reactive Typography in the Design of Flexible Hypertext Documents Rameshsharma Ramloll Collaborative Systems Engineering Group Computing Department Lancaster University Email: [email protected]
FPGA Implementation of Human Behavior Analysis Using Facial Image
RESEARCH ARTICLE OPEN ACCESS FPGA Implementation of Human Behavior Analysis Using Facial Image A.J Ezhil, K. Adalarasu Department of Electronics & Communication Engineering PSNA College of Engineering
2 SYSTEM DESCRIPTION TECHNIQUES
2 SYSTEM DESCRIPTION TECHNIQUES 2.1 INTRODUCTION Graphical representation of any process is always better and more meaningful than its representation in words. Moreover, it is very difficult to arrange
Classifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
Web Data Extraction: 1 o Semestre 2007/2008
Web Data : Given Slides baseados nos slides oficiais do livro Web Data Mining c Bing Liu, Springer, December, 2006. Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008
FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 2, February 2014,
Data Mining & Data Stream Mining Open Source Tools
Data Mining & Data Stream Mining Open Source Tools Darshana Parikh, Priyanka Tirkha Student M.Tech, Dept. of CSE, Sri Balaji College Of Engg. & Tech, Jaipur, Rajasthan, India Assistant Professor, Dept.
Kentico CMS 5 Developer Training Syllabus
Kentico CMS 5 Developer Training Syllabus June 2010 Page 2 Contents About this Course... 4 Overview... 4 Audience Profile... 4 At Course Completion... 4 Course Outline... 5 Module 1: Overview of Kentico
PDF Primer PDF. White Paper
White Paper PDF Primer PDF What is PDF and what is it good for? How does PDF manage content? How is a PDF file structured? What are its capabilities? What are its limitations? Version: 1.0 Date: October
StreamServe Persuasion SP4 StreamServe Connect for SAP - Business Processes
StreamServe Persuasion SP4 StreamServe Connect for SAP - Business Processes User Guide Rev A StreamServe Persuasion SP4StreamServe Connect for SAP - Business Processes User Guide Rev A SAP, mysap.com,
CSE 203 Web Programming 1. Prepared by: Asst. Prof. Dr. Maryam Eskandari
CSE 203 Web Programming 1 Prepared by: Asst. Prof. Dr. Maryam Eskandari Outline Basic concepts related to design and implement a website. HTML/XHTML Dynamic HTML Cascading Style Sheets (CSS) Basic JavaScript
Face detection is a process of localizing and extracting the face region from the
Chapter 4 FACE NORMALIZATION 4.1 INTRODUCTION Face detection is a process of localizing and extracting the face region from the background. The detected face varies in rotation, brightness, size, etc.
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
The Re-emergence of Data Capture Technology
The Re-emergence of Data Capture Technology Understanding Today s Digital Capture Solutions Digital capture is a key enabling technology in a business world striving to balance the shifting advantages
PDF Accessibility Overview
Contents 1 Overview of Portable Document Format (PDF) 1 Determine the Accessibility Path for each PDF Document 2 Start with an Accessible Document 2 Characteristics of Accessible PDF files 4 Adobe Acrobat
ICS Technology. PADS Viewer Manual. ICS Technology Inc PO Box 4063 Middletown, NJ 077748 732-671-5400 www.icstec.com
ICS Technology PADS Viewer Manual ICS Technology Inc PO Box 4063 Middletown, NJ 077748 732-671-5400 www.icstec.com Welcome to PADS Viewer Page 1 of 1 Welcome to PADS Viewer Welcome to PADS (Public Area
Writing a Project Report: Style Matters
Writing a Project Report: Style Matters Prof. Alan F. Smeaton Centre for Digital Video Processing and School of Computing Writing for Computing Why ask me to do this? I write a lot papers, chapters, project
Adaptive Automated GUI Testing Producing Test Frameworks to Withstand Change
Adaptive Automated GUI Testing Producing Test Frameworks to Withstand Change Abstract Although QA and Development managers may see many challenges in creating an automated GUI testing framework, it is
Terms and Definitions for CMS Administrators, Architects, and Developers
Sitecore CMS 6 Glossary Rev. 081028 Sitecore CMS 6 Glossary Terms and Definitions for CMS Administrators, Architects, and Developers Table of Contents Chapter 1 Introduction... 3 1.1 Glossary... 4 Page
Connections to External File Sources
Connections to External File Sources By using connections to external sources you can significantly speed up the process of getting up and running with M-Files and importing existing data. For instance,
International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014
Efficient Attendance Management System Using Face Detection and Recognition Arun.A.V, Bhatath.S, Chethan.N, Manmohan.C.M, Hamsaveni M Department of Computer Science and Engineering, Vidya Vardhaka College
Web Document Clustering
Web Document Clustering Lab Project based on the MDL clustering suite http://www.cs.ccsu.edu/~markov/mdlclustering/ Zdravko Markov Computer Science Department Central Connecticut State University New Britain,
Internships and graduation jobs Development
Internships and graduation jobs Development We strongly believe in the power of students. Therefore we offer challenging internships and graduation projects to jumpstart your career. Your job not listed?
Deposit Identification Utility and Visualization Tool
Deposit Identification Utility and Visualization Tool Colorado School of Mines Field Session Summer 2014 David Alexander Jeremy Kerr Luke McPherson Introduction Newmont Mining Corporation was founded in
Contribution of Multiresolution Description for Archive Document Structure Recognition
Contribution of Multiresolution Description for Archive Document Structure Recognition Aurélie Lemaitre, Jean Camillerapp, Bertrand Coüasnon To cite this version: Aurélie Lemaitre, Jean Camillerapp, Bertrand
Euler Vector: A Combinatorial Signature for Gray-Tone Images
Euler Vector: A Combinatorial Signature for Gray-Tone Images Arijit Bishnu, Bhargab B. Bhattacharya y, Malay K. Kundu, C. A. Murthy fbishnu t, bhargab, malay, [email protected] Indian Statistical Institute,
CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS
TABLES, CHARTS, AND GRAPHS / 75 CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS Tables, charts, and graphs are frequently used in statistics to visually communicate data. Such illustrations are also a frequent
Smart Connection 9 Element Labels
08 Smart Connection 9 Element Labels This document is part of the documentation for Smart Connection 9 and is an extract from the former Smart Connection 9 User Guide for InDesign. For more information
Computational Geometry. Lecture 1: Introduction and Convex Hulls
Lecture 1: Introduction and convex hulls 1 Geometry: points, lines,... Plane (two-dimensional), R 2 Space (three-dimensional), R 3 Space (higher-dimensional), R d A point in the plane, 3-dimensional space,
Module 9. User Interface Design. Version 2 CSE IIT, Kharagpur
Module 9 User Interface Design Lesson 21 Types of User Interfaces Specific Instructional Objectives Classify user interfaces into three main types. What are the different ways in which menu items can be
Skills Inventory: Art/Media Communications. 1. Pre-employment Training/Career Development. A. Formal; e.g., certificates. Date Description Location
Skills Inventory: Art/Media Communications 1. Pre-employment Training/Career Development A. Formal; e.g., certificates Date Description Location Art/Design and Communication Skills Inventory: Art/Media
Lecture 14. Point Spread Function (PSF)
Lecture 14 Point Spread Function (PSF), Modulation Transfer Function (MTF), Signal-to-noise Ratio (SNR), Contrast-to-noise Ratio (CNR), and Receiver Operating Curves (ROC) Point Spread Function (PSF) Recollect
Client Overview. Engagement Situation. Key Requirements
Client Overview Our client is one of the leading providers of business intelligence systems for customers especially in BFSI space that needs intensive data analysis of huge amounts of data for their decision
Hypothesis testing. c 2014, Jeffrey S. Simonoff 1
Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there
Limitations of Human Vision. What is computer vision? What is computer vision (cont d)?
What is computer vision? Limitations of Human Vision Slide 1 Computer vision (image understanding) is a discipline that studies how to reconstruct, interpret and understand a 3D scene from its 2D images
Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform
Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform May 2015 Contents 1. Introduction... 3 2. What is BIM... 3 2.1. History of BIM... 3 2.2. Why Implement BIM... 4 2.3.
Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment
2009 10th International Conference on Document Analysis and Recognition Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment Ahmad Abdulkader Matthew R. Casey Google Inc. [email protected]
Novel Automatic PCB Inspection Technique Based on Connectivity
Novel Automatic PCB Inspection Technique Based on Connectivity MAURO HIROMU TATIBANA ROBERTO DE ALENCAR LOTUFO FEEC/UNICAMP- Faculdade de Engenharia Elétrica e de Computação/ Universidade Estadual de Campinas
Chapter 6 Experiment Process
Chapter 6 Process ation is not simple; we have to prepare, conduct and analyze experiments properly. One of the main advantages of an experiment is the control of, for example, subjects, objects and instrumentation.
Document Image Processing - A Review
Document Image Processing - A Review Shazia Akram Research Scholar University of Kashmir (India) Dr. Mehraj-Ud-Din Dar Director, IT & SS University of Kashmir (India) Aasia Quyoum Research Scholar University
IBM SPSS Direct Marketing
IBM Software IBM SPSS Statistics 19 IBM SPSS Direct Marketing Understand your customers and improve marketing campaigns Highlights With IBM SPSS Direct Marketing, you can: Understand your customers in
Feed-Forward mapping networks KAIST 바이오및뇌공학과 정재승
Feed-Forward mapping networks KAIST 바이오및뇌공학과 정재승 How much energy do we need for brain functions? Information processing: Trade-off between energy consumption and wiring cost Trade-off between energy consumption
Visual Studio 2008: Windows Presentation Foundation
Visual Studio 2008: Windows Presentation Foundation Course 6460A: Three days; Instructor-Led Introduction This three-day instructor-led course provides students with the knowledge and skills to build and
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE Venu Govindaraju BIOMETRICS DOCUMENT ANALYSIS PATTERN RECOGNITION 8/24/2015 ICDAR- 2015 2 Towards a Globally Optimal Approach for Learning Deep Unsupervised
How To Create A Data Transformation And Data Visualization Tool In Java (Xslt) (Programming) (Data Visualization) (Business Process) (Code) (Powerpoint) (Scripting) (Xsv) (Mapper) (
A Generic, Light Weight, Pluggable Data Transformation and Visualization Tool for XML to XML Transformation Rahil A. Khera 1, P. S. Game 2 1,2 Pune Institute of Computer Technology, Affiliated to SPPU,
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,
Abdullah Mohammed Abdullah Khamis
Abdullah Mohammed Abdullah Khamis Jeddah, Saudi Arabia Email: [email protected] Mobile: +966 567243182 Tel: +966 2 6340699 (Yemeni) Research and Professional Objective To Complete my Ph.D. in Pattern
Computer Information Systems
Computer Information System Courses Description 0309331 0306331 0309332 0306332 0309334 0306334 0309341 0306341 0309353 0306353 Database Systems Introduction to database systems, entity-relationship data
Operational Risk Information Sensors for Unstructured Data
Operational Risk Information Sensors for Unstructured Data Abstract Banks and financial institutions use social media extensively and therefore need complex risk detection tools to monitor and understand
Search Result Optimization using Annotators
Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining CPSC 533c, Information Visualization Course Project, Term 2 2003 Fengdong Du [email protected] University of British Columbia
Formulas, Functions and Charts
Formulas, Functions and Charts :: 167 8 Formulas, Functions and Charts 8.1 INTRODUCTION In this leson you can enter formula and functions and perform mathematical calcualtions. You will also be able to
Rotorcraft Health Management System (RHMS)
AIAC-11 Eleventh Australian International Aerospace Congress Rotorcraft Health Management System (RHMS) Robab Safa-Bakhsh 1, Dmitry Cherkassky 2 1 The Boeing Company, Phantom Works Philadelphia Center
AQA GCSE in Computer Science Computer Science Microsoft IT Academy Mapping
AQA GCSE in Computer Science Computer Science Microsoft IT Academy Mapping 3.1.1 Constants, variables and data types Understand what is mean by terms data and information Be able to describe the difference
EMC CAPTIVA 7 MODULES OVERVIEW
EMC CAPTIVA 7 MODULES OVERVIEW ABSTRACT This document provides an overview on the EMC Captiva Capture and Advanced Recognition modules, including the image capture, document classification, and data recognition
Enterprise Resource Planning Analysis of Business Intelligence & Emergence of Mining Objects
Enterprise Resource Planning Analysis of Business Intelligence & Emergence of Mining Objects Abstract: Build a model to investigate system and discovering relations that connect variables in a database
The Value of Intelligent Capture in Accounts Payable Automation. White Paper
The Value of Intelligent Capture in Accounts Payable Automation White Paper Contents Executive Summary... 2 Evolution of Capture in AP... 2 Intelligent Capture for AP... 3 Any Source or Format... 3 Integration
MATHEMATICAL INDUCTION. Mathematical Induction. This is a powerful method to prove properties of positive integers.
MATHEMATICAL INDUCTION MIGUEL A LERMA (Last updated: February 8, 003) Mathematical Induction This is a powerful method to prove properties of positive integers Principle of Mathematical Induction Let P
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
Development and Implementation of the OpenLR Map Interface for Shapefiles
Mohammed Alhessi Development and Implementation of the OpenLR Map Interface for Shapefiles Duration of the Thesis: 6 months Completion: May 2013 Tutor: Rainer Schützle, MSc Examiner: Prof. Dr.-Ing. Volker
Module 6 Other OCR engines: ABBYY, Tesseract
Uwe Springmann Module 6 Other OCR engines: ABBYY, Tesseract 2015-09-14 1 / 20 Module 6 Other OCR engines: ABBYY, Tesseract Uwe Springmann Centrum für Informations- und Sprachverarbeitung (CIS) Ludwig-Maximilians-Universität
Automatic Extraction of Signatures from Bank Cheques and other Documents
Automatic Extraction of Signatures from Bank Cheques and other Documents Vamsi Krishna Madasu *, Mohd. Hafizuddin Mohd. Yusof, M. Hanmandlu ß, Kurt Kubik * *Intelligent Real-Time Imaging and Sensing group,
Image Processing Based Automatic Visual Inspection System for PCBs
IOSR Journal of Engineering (IOSRJEN) ISSN: 2250-3021 Volume 2, Issue 6 (June 2012), PP 1451-1455 www.iosrjen.org Image Processing Based Automatic Visual Inspection System for PCBs Sanveer Singh 1, Manu
Comparing Methods to Identify Defect Reports in a Change Management Database
Comparing Methods to Identify Defect Reports in a Change Management Database Elaine J. Weyuker, Thomas J. Ostrand AT&T Labs - Research 180 Park Avenue Florham Park, NJ 07932 (weyuker,ostrand)@research.att.com
Determining optimal window size for texture feature extraction methods
IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec
Best practices for producing high quality PDF files
University of Michigan Deep Blue deepblue.lib.umich.edu 2006-05-05 Best practices for producing high quality PDF files Formats Group, Deep Blue http://hdl.handle.net/2027.42/58005 Best practices for producing
ROBOTRACKER A SYSTEM FOR TRACKING MULTIPLE ROBOTS IN REAL TIME. by Alex Sirota, [email protected]
ROBOTRACKER A SYSTEM FOR TRACKING MULTIPLE ROBOTS IN REAL TIME by Alex Sirota, [email protected] Project in intelligent systems Computer Science Department Technion Israel Institute of Technology Under the
USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE
USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE Ria A. Sagum, MCS Department of Computer Science, College of Computer and Information Sciences Polytechnic University of the Philippines, Manila, Philippines
Guide to Writing a Project Report
Guide to Writing a Project Report The following notes provide a guideline to report writing, and more generally to writing a scientific article. Please take the time to read them carefully. Even if your
The application of image division method on automatic optical inspection of PCBA
1 1 1 1 0 The application of image division method on automatic optical inspection of PCBA Min-Chie Chiu Department of Automatic Control Engineering Chungchou Institute of Technology, Lane, Sec. 3, Shanchiao
Working With Templates in Web Publisher. Contributed by Paul O Mahony Developer Program
Working With Templates in Web Publisher Contributed by Paul O Mahony Developer Program Overview... 3 Template Options... 3 Web Publisher Editor Templates... 3 Advanced Content Editor... 3 ewebeditpro +
Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection
CSED703R: Deep Learning for Visual Recognition (206S) Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection Bohyung Han Computer Vision Lab. [email protected] 2 3 Object detection
Masters in Information Technology
Computer - Information Technology MSc & MPhil - 2015/6 - July 2015 Masters in Information Technology Programme Requirements Taught Element, and PG Diploma in Information Technology: 120 credits: IS5101
FreeForm Designer. Phone: +972-9-8309999 Fax: +972-9-8309998 POB 8792, Natanya, 42505 Israel www.autofont.com. Document2
FreeForm Designer FreeForm Designer enables designing smart forms based on industry-standard MS Word editing features. FreeForm Designer does not require any knowledge of or training in programming languages
Information Technology Career Field Pathways and Course Structure
Information Technology Career Field Pathways and Course Structure Courses in Information Support and Services (N0) Computer Hardware 2 145025 Computer Software 145030 Networking 2 145035 Network Operating
TU e. Advanced Algorithms: experimentation project. The problem: load balancing with bounded look-ahead. Input: integer m 2: number of machines
The problem: load balancing with bounded look-ahead Input: integer m 2: number of machines integer k 0: the look-ahead numbers t 1,..., t n : the job sizes Problem: assign jobs to machines machine to which
Strand: Reading Literature Topics Standard I can statements Vocabulary Key Ideas and Details
Strand: Reading Literature Key Ideas and Details Craft and Structure RL.3.1 Ask and answer questions to demonstrate understanding of a text, referring explicitly to the text as the basis for the answers.
ICDAR2015 Competition on Recognition of Documents with Complex Layouts RDCL2015
ICDAR2015 Competition on Recognition of Documents with Complex Layouts RDCL2015 A. Antonacopoulos, C. Clausner, C. Papadopoulos and S. Pletschacher Pattern Recognition and Image Analysis (PRImA) Research
Intrusion Detection System using Log Files and Reinforcement Learning
Intrusion Detection System using Log Files and Reinforcement Learning Bhagyashree Deokar, Ambarish Hazarnis Department of Computer Engineering K. J. Somaiya College of Engineering, Mumbai, India ABSTRACT
THE CERN/SL XDATAVIEWER: AN INTERACTIVE GRAPHICAL TOOL FOR DATA VISUALIZATION AND EDITING
THE CERN/SL XDATAVIEWER: AN INTERACTIVE GRAPHICAL TOOL FOR DATA VISUALIZATION AND EDITING Abstract G. Morpurgo, CERN As a result of many years of successive refinements, the CERN/SL Xdataviewer tool has
This manual assumes you already have Precurio installed and configured appropriately.
Precurio Workflows As you must already know, workflow applications help automate processes that require series of steps usually requiring human intervention, such as an approval from various people in
Increase your efficiency with maximum productivity and minimal work
>> ELO DocXtractor II FORM Increase your efficiency with maximum productivity and minimal work ELO DocXtractor II Form eliminates the high costs of capturing forms from day one. The intelligent module
Core Curriculum to the Course:
Core Curriculum to the Course: Environmental Science Law Economy for Engineering Accounting for Engineering Production System Planning and Analysis Electric Circuits Logic Circuits Methods for Electric
Building an Advanced Invariant Real-Time Human Tracking System
UDC 004.41 Building an Advanced Invariant Real-Time Human Tracking System Fayez Idris 1, Mazen Abu_Zaher 2, Rashad J. Rasras 3, and Ibrahiem M. M. El Emary 4 1 School of Informatics and Computing, German-Jordanian
