A Method of Caption Detection in News Video

Save this PDF as:
 WORD  PNG  TXT  JPG

Size: px
Start display at page:

Download "A Method of Caption Detection in News Video"

Transcription

1 3rd International Conference on Multimedia Technology(ICMT 3) A Method of Caption Detection in News Video He HUANG, Ping SHI Abstract. News video is one of the most important media for people to get information. However, it is an urgent problem to find the useful information from a huge amount of news video efficiently and correctly. The caption in news video highly summarizes the related news story and can be used for effective retrieval. In this paper, based on the feature of news video, a method of caption detection in news video is proposed. Firstly, the key frames with captions are detected by using color and edge information. Then, the caption text is extracted by Otsu algorithm. The experiment results show that all the caption frames can be detected and combined with OCR software, the proposed method can give an average recognition rate of 87.3%. Keywords: news video caption detection edge detection Otsu. Introduction As a typical video, news video is the major path through which people could get informed. However, with the accelerating pace of life and a significant increase in news events, the regularly news video on TV is no longer able to meet people s needs, and the internet is now growing so fast that it becomes the main way for people to watch the news. Although provided the freedom to choose any videos as we like, the traditional linear way to find what we need is time-consuming and low efficient. Moreover, important information is often missed. Therefore, it s He HUANG ( ) Information Engineering School, Communication University of China, Beijing, China Pin SHI( ) Information Engineering School, Communication University of China, Beijing, China 3. The authors - Published by Atlantis Press 5

2 necessary to establish news video library for content-based news video retrieval to solve this problem and caption detection is the most important part. The characteristics of news video make the establishment of news video library possible. First, the fixed news video production makes the boundary of each news story clear. Secondly, the title caption of news video providing the basis for video retrieval, since it serves not only as the summary of the news story, but also an important sign of the news video structure. Many research projects have engaged in detection and recognition of caption in recent years []-[6]. Huang at al. [] performs Harris corner detection on stroke map of detected text lines which is based on Log-Gabor filters. Then morphological operation is utilized to connect these corners into text regions. Zhao at al. [] uses a corner based approach which is inspired by the observation that there exist dense and orderly presences of corner points in captions. Leon et al.[3] develops a method combining texture and geometric features to detect captions and also takes advantage of the region-based image model. Sharma at al. [4] presents a new method based on dominant text pixel selection, text representatives and region growing arbitrarily-oriented text detection in video. Cai at al. [5] proposes an algorithm based on edge detection, threshold calculation and edge size limitations, filters non-text regions by the scope of the text pixel density. T.Sato at al.[6] first deals with an image with a 3 3 horizontal differential filter, then extracts vertical edge features with a suitable binary threshold, finally gets independent caption region by detecting aggregated regions and calculating the rectangles around. In this paper, we present an effective method to capture captions in news video: first picking up the key frames with captions by color statistic and edge detection and then obtaining the caption text with Otsu algorithm. Analysis of the caption text in news video The texts in news video can be divided into two categories [7] : scene texts and caption texts. Scene texts are the part of the image captured by the camera, such as texts in scenes and the license plate s, as shown in Fig.. Because of the unfixed location and shown time, it is generally difficult to detect scene texts. Yet caption texts summarize the main information of news story and can be extracted easily because of the fixed location and shown time, as shown in Fig.. There are some common properties of caption texts in most news video programs, as summarize below:. The caption texts in news video have fixed size and fonts in the same news video programs.. There is always a rectangle background behind the caption texts. 3. There is a strong contrast between the background color of caption texts and the image frame color. 4. The location of the caption texts in the same news video programs is fixed. 53

3 5. The caption texts stay in the screen for at least several seconds. According to rough statistics, they can last 5 seconds to seconds. Fig. Scene texts Fig. Caption texts 3. Caption detection In this section, we detail the process and algorithms of key frame detection and caption text extraction. 3. Caption detection process News video is composed of a sequence of image frames. Therefore, the image frames should be picked up from the news video, and the problem of caption detection in news video is then converted into caption detection in news images. The procedure of caption detection in news video is shown in Fig.3.The texts recognition is implemented in the OCR software, it s not included in this paper. News video input Detection of key frames Caption area location Caption segmentation Fig. 3 The procedure of caption detection Texts recognition 54

4 3. Detection of key frames This step is to detect the key frames which have captions and the different frames with the same captions should be abandoned. The most obvious characteristic of key frames is that there is caption border after edge detection [8]. Comparing results of several edge detections, we find the texts are clear but the caption border is incomplete after Roberts edge detection, which can be formulated as = {[ f ( x, y) f ( x +, y + )] + [ f ( x, y + ) f ( x, )] } () g ( x, y) + y Where f(x,y) is the input image. The texts are incomplete but the caption border is clear after Sobel and Prewitt edge detection. Prewitt edge detection operator has two operators which generally known as the template, one is horizontal, the other is vertical, each approaching a partial derivative, as shown in formulae (). p = v p h = () The difference between Sobel operator and Prewitt operator is that they use different templates. Sobel operator is shown in formulae (3). s = s = (3) In conclusion, we choose Prewitt edge to detect caption border and Roberts edge to get frame-to-frame differences. News videos contain two kinds of captions, one of which is the title caption which contains important desired semantic information, and the other is dialogue in an interview which should be abandoned. Differences of background color in these two kinds of captions could be applied as basis of detection. Therefore, we detect topic caption by statistical color characteristics based on region information. The title caption remains unchanged in the screen for at least 5 seconds. In order to accelerate operating speed, we detect the key frames at intervals of 5 seconds. This may give rise to redundancy when the duration of a caption is above seconds and the different frames with the same captions are left out by frameto-frame differences. The procedure of key frame detection in news video is shown in Fig.4. 55

5 Frame extraction every 5 seconds Gray-scale processing Color statistic Prewitt edge Roberts edge Frame-to-frame differences Color judgment Bolder judgment Save or not If all three conditions satisfied, it s key frame Fig. 4 The procedure of key frame detection 3.3 Locating of caption After comparing several algorithms of caption region localization, we find that the algorithm based on edge detection is easy to locate captions. But when the caption background color is similar to the frame color, the detection accuracy is lowered; In addition, the algorithm based on Fuzzy C-Means clustering is hard to find the appropriate initial cluster centres.at the same time the effectiveness is influenced by the caption color information, so it s not suit for all kinds of news video. Considering the location of captions in one kind of news video is fixed, we propose an easy method for one specific kind of News video, which is getting the location by experiments, and it will fit easily for other news video by simply modifying the location. 3.4 caption segmentation Caption segmentation is to divide the caption into two unique regions: texts and background. Among all the image segmentation algorithms, we find Otsu algorithm is appropriate for caption segmentation [9]. If the grayscale of one image is L, pixel gray value is[,,,l],the of pixels whose gray value is i is n i, so the of pixels is 56

6 N = L i= n (4) Probability of pixels with a gray value of i is P = n n (5) i i / Separate the image pixels with a gray threshold value of T into two categories: one class is the pixels with the grayscales of [,.,T], denoted as D, the other class is the pixels with the grayscales of [T+,..,L], denoted asd.the probability of D and D are P (T)and P (T), the grayscale average of D and D is µ ( ) and µ ( ).The variance of D and D are δ ( ) and δ ( ).The gray T T value of the whole image can be formulated as L i T T µ = pi = Po ( T ) µ ( T ) + P ( T ) µ ( T ) (6) i The squared distance between the two classes can be formulated as δ ( T ) = P ( T )( µ ( T ) µ ) + P ( T )( µ ( T ) µ ) b In order to improve processing speed and at the same time combine the results, the formulate can be simplified as ( µ ( T ) µ ( T )) δ b ( T ) = (8) P ( T ) P( T ) The result is shown in Figure 5. (7) (a) Grayscale (b) Binary image Fig. 5 The result of Otsu algorithm 4. Results and Analysis In order to test the effectiveness of the algorithm, five CCTV News programs are selected to be tested. The recall (R) and precision (P), which are defined as follow, are used to evaluate performance of the proposed method. RA P = (9) R + R A B 57

7 R () A R = RA + RC Where R A, R B, R C indicate the of total key frames, the of error detected frames, and the of missed frames respectively. The result is shown in Table. After caption segmentation, the binary captions are taken into OCR software for recognition. The result of recognition is shown in Table. As shown in Table, no frame of the detected video is missed in key frame detection, yet errors exist. The reason lies in cases where there is no caption in the frame, yet other region contains borders after edge detection. In caption text recognition, the average recognition rate is 87.3%. The error is always at the end of caption where the background color is similar to the frame. Table The result of key frame detection Sequence Total frame Key frame Detected frame Missed frame Error frame Recall P Precision R % % % % % % % % % % Table The result of text extraction Sequence Number of words False detected words Recognition rate % % % % % 5. Conclusion Many algorithms of caption detection in news video have been proposed in recent years. But the difference of caption color, size and location between different news programs makes it difficult to obtain an efficient approach for all the news video. While the approach in this paper is fit for one news program with a set of parameters, it will fit easily for other news programs by simply modifying the parameters. 58

8 Acknowledgments: This work is supported by "863" national project, No. AA7 References. X. H. Huang, H. D. Ma "Automatic Detection and Localization of Natural Scene Text in Video" International Conference on Pattern Recognition X. Zhao, K. H. Lin, and Y. Fu. "Text From Corners: A Novel Approach to Detect Text and Caption in Videos" IEEE transactions on image processing, vol., No. 3, Mar..pp M. Leon, V. Vilaplana, A. Gasull and F. Marques. "Region-based Caption Text Extraction, "Proc. IEEE Symp. th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS). IEEE Press. Nov.. pp Sharma, N.Shivakumara, P,Pal, U. ; Blumenstein, M,Tan, C.L. "A New Method for Arbitrarily-Oriented Text Detection in Video" th IAPR International Workshop on Document Analysis Systems (DAS),.9/DAS B.Cai, D.R.Zhou, H.B.Hu "the study and implementation of caption detection and extraction in digital video" Journal of Computer-Aided Design&Computer Graphics 3,5(7): T.Sato,T.Kanade, E.K.Jughes, M.A.Smith and S.Satoh. "Video OCR:indexing digital news libraries by recognition of superimposed captions "ACM Multimedia Systems: Special Issue on Video Libraries. Vo.7.N.5, 999, PP M.Li, B.C.Li, D.W.Su. "Caption Detection and text content extraction algorithm Video Engineering -869(5) H.Su, H.X.Zhou, Z.H.Li. The study of edge detection in image processing [J].Computer Development & Applications.,5(): L.N.Qi, B.Zhang, Z.K.Wang " The application of Otsu in image processing " Radio Engineering 6.36(7) 59

A New Robust Algorithm for Video Text Extraction

A New Robust Algorithm for Video Text Extraction A New Robust Algorithm for Video Text Extraction Pattern Recognition, vol. 36, no. 6, June 2003 Edward K. Wong and Minya Chen School of Electrical Engineering and Computer Science Kyungpook National Univ.

More information

A Dynamic Approach to Extract Texts and Captions from Videos

A Dynamic Approach to Extract Texts and Captions from Videos Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

Face detection is a process of localizing and extracting the face region from the

Face detection is a process of localizing and extracting the face region from the Chapter 4 FACE NORMALIZATION 4.1 INTRODUCTION Face detection is a process of localizing and extracting the face region from the background. The detected face varies in rotation, brightness, size, etc.

More information

Comparison of Text Extraction Techniques- A Review

Comparison of Text Extraction Techniques- A Review Comparison of Text Extraction Techniques- A Review Divya gera 1, Neelu Jain 2 ME Scholar, Dept of E & C, PEC University of Technology, Chandigarh, India 1 Associate Professor, Dept of E & C, PEC University

More information

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic

More information

Signature Region of Interest using Auto cropping

Signature Region of Interest using Auto cropping ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 1 Signature Region of Interest using Auto cropping Bassam Al-Mahadeen 1, Mokhled S. AlTarawneh 2 and Islam H. AlTarawneh 2 1 Math. And Computer Department,

More information

Keywords Gaussian probability, YCrCb,RGB Model

Keywords Gaussian probability, YCrCb,RGB Model Volume 4, Issue 7, July 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Skin Segmentation

More information

REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING

REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING Ms.PALLAVI CHOUDEKAR Ajay Kumar Garg Engineering College, Department of electrical and electronics Ms.SAYANTI BANERJEE Ajay Kumar Garg Engineering

More information

Thresholding technique with adaptive window selection for uneven lighting image

Thresholding technique with adaptive window selection for uneven lighting image Pattern Recognition Letters 26 (2005) 801 808 wwwelseviercom/locate/patrec Thresholding technique with adaptive window selection for uneven lighting image Qingming Huang a, *, Wen Gao a, Wenjian Cai b

More information

Method for Extracting Product Information from TV Commercial

Method for Extracting Product Information from TV Commercial Method for Extracting Product Information from TV Commercial Kohei Arai Information Science Department Saga University Saga, Japan Herman Tolle Software Engineering Department Brawijaya University Malang,

More information

The Design and Implementation of Traffic Accident Identification System Based on Video

The Design and Implementation of Traffic Accident Identification System Based on Video 3rd International Conference on Multimedia Technology(ICMT 2013) The Design and Implementation of Traffic Accident Identification System Based on Video Chenwei Xiang 1, Tuo Wang 2 Abstract: With the rapid

More information

Automatic Traffic Estimation Using Image Processing

Automatic Traffic Estimation Using Image Processing Automatic Traffic Estimation Using Image Processing Pejman Niksaz Science &Research Branch, Azad University of Yazd, Iran Pezhman_1366@yahoo.com Abstract As we know the population of city and number of

More information

Online Play Segmentation for Broadcasted American Football TV Programs

Online Play Segmentation for Broadcasted American Football TV Programs Online Play Segmentation for Broadcasted American Football TV Programs Liexian Gu 1, Xiaoqing Ding 1, and Xian-Sheng Hua 2 1 Department of Electronic Engineering, Tsinghua University, Beijing, China {lxgu,

More information

Handwritten Character Recognition from Bank Cheque

Handwritten Character Recognition from Bank Cheque International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-4, Special Issue-1 E-ISSN: 2347-2693 Handwritten Character Recognition from Bank Cheque Siddhartha Banerjee*

More information

Lecture Video Indexing and Analysis Using Video OCR Technology

Lecture Video Indexing and Analysis Using Video OCR Technology Lecture Video Indexing and Analysis Using Video OCR Technology Haojin Yang, Maria Siebert, Patrick Lühne, Harald Sack, Christoph Meinel Hasso Plattner Institute (HPI), University of Potsdam P.O. Box 900460,

More information

Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features

Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features Semantic Video Annotation by Mining Association Patterns from and Speech Features Vincent. S. Tseng, Ja-Hwung Su, Jhih-Hong Huang and Chih-Jen Chen Department of Computer Science and Information Engineering

More information

Video OCR for Sport Video Annotation and Retrieval

Video OCR for Sport Video Annotation and Retrieval Video OCR for Sport Video Annotation and Retrieval Datong Chen, Kim Shearer and Hervé Bourlard, Fellow, IEEE Dalle Molle Institute for Perceptual Artificial Intelligence Rue du Simplon 4 CH-190 Martigny

More information

A ROBUST BACKGROUND REMOVAL ALGORTIHMS

A ROBUST BACKGROUND REMOVAL ALGORTIHMS A ROBUST BACKGROUND REMOVAL ALGORTIHMS USING FUZZY C-MEANS CLUSTERING ABSTRACT S.Lakshmi 1 and Dr.V.Sankaranarayanan 2 1 Jeppiaar Engineering College, Chennai lakshmi1503@gmail.com 2 Director, Crescent

More information

AN AUTOMATED APPROACH FOR BACTERIAL COLONY COUNTER Shruti Nagpal

AN AUTOMATED APPROACH FOR BACTERIAL COLONY COUNTER Shruti Nagpal AN AUTOMATED APPROACH FOR BACTERIAL COLONY COUNTER Shruti Nagpal Abstract Counting of bacterial colonies is complex task for microbiologist. To a large extent, accurate colony counting depends on the ability

More information

Locating and Decoding EAN-13 Barcodes from Images Captured by Digital Cameras

Locating and Decoding EAN-13 Barcodes from Images Captured by Digital Cameras Locating and Decoding EAN-13 Barcodes from Images Captured by Digital Cameras W3A.5 Douglas Chai and Florian Hock Visual Information Processing Research Group School of Engineering and Mathematics Edith

More information

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network Proceedings of the 8th WSEAS Int. Conf. on ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING & DATA BASES (AIKED '9) ISSN: 179-519 435 ISBN: 978-96-474-51-2 An Energy-Based Vehicle Tracking System using Principal

More information

FACIAL EXPRESSION RECOGNITION BASED ON EDGE DETECTION

FACIAL EXPRESSION RECOGNITION BASED ON EDGE DETECTION FACIAL EXPRESSION RECOGNITION BASED ON EDGE DETECTION Xiaoming CHEN and Wushan CHENG College of Mechanical Engineering, Shanghai University of Engineering Science, Shanghai 0160, China ABSTRACT Relational

More information

Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I

Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} PSNR_Y

More information

Image Content-Based Email Spam Image Filtering

Image Content-Based Email Spam Image Filtering Image Content-Based Email Spam Image Filtering Jianyi Wang and Kazuki Katagishi Abstract With the population of Internet around the world, email has become one of the main methods of communication among

More information

Morphological segmentation of histology cell images

Morphological segmentation of histology cell images Morphological segmentation of histology cell images A.Nedzved, S.Ablameyko, I.Pitas Institute of Engineering Cybernetics of the National Academy of Sciences Surganova, 6, 00 Minsk, Belarus E-mail abl@newman.bas-net.by

More information

An Efficient Geometric feature based License Plate Localization and Stop Line Violation Detection System

An Efficient Geometric feature based License Plate Localization and Stop Line Violation Detection System An Efficient Geometric feature based License Plate Localization and Stop Line Violation Detection System Waing, Dr.Nyein Aye Abstract Stop line violation causes in myanmar when the back wheel of the car

More information

Automatic Extraction of Direction Information from Road Sign Images Obtained by a. Mobile Mapping System. Abstract

Automatic Extraction of Direction Information from Road Sign Images Obtained by a. Mobile Mapping System. Abstract Automatic Extraction of Direction Information from Road Sign Images Obtained by a Mobile Mapping System Junhee Youn 1) Gi Hong Kim 2) Kyusoo Chong 3) 1) Senior Researcher, Korea Institute of Construction

More information

Analecta Vol. 8, No. 2 ISSN 2064-7964

Analecta Vol. 8, No. 2 ISSN 2064-7964 EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,

More information

Laser Gesture Recognition for Human Machine Interaction

Laser Gesture Recognition for Human Machine Interaction International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-04, Issue-04 E-ISSN: 2347-2693 Laser Gesture Recognition for Human Machine Interaction Umang Keniya 1*, Sarthak

More information

Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies

Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Paper / Case Study Available online at: www.ijarcsms.com Extraction

More information

The Role of Size Normalization on the Recognition Rate of Handwritten Numerals

The Role of Size Normalization on the Recognition Rate of Handwritten Numerals The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,

More information

ECE 533 Project Report Ashish Dhawan Aditi R. Ganesan

ECE 533 Project Report Ashish Dhawan Aditi R. Ganesan Handwritten Signature Verification ECE 533 Project Report by Ashish Dhawan Aditi R. Ganesan Contents 1. Abstract 3. 2. Introduction 4. 3. Approach 6. 4. Pre-processing 8. 5. Feature Extraction 9. 6. Verification

More information

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition Send Orders for Reprints to reprints@benthamscience.ae The Open Electrical & Electronic Engineering Journal, 2014, 8, 599-604 599 Open Access A Facial Expression Recognition Algorithm Based on Local Binary

More information

Robustly Extracting Captions in Videos Based on Stroke-Like Edges and Spatio-Temporal Analysis

Robustly Extracting Captions in Videos Based on Stroke-Like Edges and Spatio-Temporal Analysis 482 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 2, APRIL 2012 Robustly Extracting Captions in Videos Based on Stroke-Like Edges and Spatio-Temporal Analysis Xiaoqian Liu, Member, IEEE, and Weiqiang Wang,

More information

IMAGE MODIFICATION DEVELOPMENT AND IMPLEMENTATION: A SOFTWARE MODELING USING MATLAB

IMAGE MODIFICATION DEVELOPMENT AND IMPLEMENTATION: A SOFTWARE MODELING USING MATLAB Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 8, August 2015,

More information

Interactive Flag Identification using Image Retrieval Techniques

Interactive Flag Identification using Image Retrieval Techniques Interactive Flag Identification using Image Retrieval Techniques Eduardo Hart, Sung-Hyuk Cha, Charles Tappert CSIS, Pace University 861 Bedford Road, Pleasantville NY 10570 USA E-mail: eh39914n@pace.edu,

More information

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003 Yun Zhai, Zeeshan Rasheed, Mubarak Shah Computer Vision Laboratory School of Computer Science University of Central Florida, Orlando, Florida ABSTRACT In this

More information

Model-based Chart Image Recognition

Model-based Chart Image Recognition Model-based Chart Image Recognition Weihua Huang, Chew Lim Tan and Wee Kheng Leow SOC, National University of Singapore, 3 Science Drive 2, Singapore 117543 E-mail: {huangwh,tancl, leowwk@comp.nus.edu.sg}

More information

A Fast Algorithm for Multilevel Thresholding

A Fast Algorithm for Multilevel Thresholding JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 17, 713-727 (2001) A Fast Algorithm for Multilevel Thresholding PING-SUNG LIAO, TSE-SHENG CHEN * AND PAU-CHOO CHUNG + Department of Electrical Engineering

More information

Multiple Object Tracking Using SIFT Features and Location Matching

Multiple Object Tracking Using SIFT Features and Location Matching Multiple Object Tracking Using SIFT Features and Location Matching Seok-Wun Ha 1, Yong-Ho Moon 2 1,2 Dept. of Informatics, Engineering Research Institute, Gyeongsang National University, 900 Gazwa-Dong,

More information

Time and Date OCR in CCTV Video

Time and Date OCR in CCTV Video Time and Date OCR in CCTV Video Ginés García-Mateos 1, Andrés García-Meroño 1, Cristina Vicente-Chicote 3, Alberto Ruiz 1, and Pedro E. López-de-Teruel 2 1 Dept. de Informática y Sistemas 2 Dept. de Ingeniería

More information

Saving Mobile Battery Over Cloud Using Image Processing

Saving Mobile Battery Over Cloud Using Image Processing Saving Mobile Battery Over Cloud Using Image Processing Khandekar Dipendra J. Student PDEA S College of Engineering,Manjari (BK) Pune Maharasthra Phadatare Dnyanesh J. Student PDEA S College of Engineering,Manjari

More information

Keywords image processing, signature verification, false acceptance rate, false rejection rate, forgeries, feature vectors, support vector machines.

Keywords image processing, signature verification, false acceptance rate, false rejection rate, forgeries, feature vectors, support vector machines. International Journal of Computer Application and Engineering Technology Volume 3-Issue2, Apr 2014.Pp. 188-192 www.ijcaet.net OFFLINE SIGNATURE VERIFICATION SYSTEM -A REVIEW Pooja Department of Computer

More information

Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode Value

Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode Value IJSTE - International Journal of Science Technology & Engineering Volume 1 Issue 10 April 2015 ISSN (online): 2349-784X Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode

More information

International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014

International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014 Efficient Attendance Management System Using Face Detection and Recognition Arun.A.V, Bhatath.S, Chethan.N, Manmohan.C.M, Hamsaveni M Department of Computer Science and Engineering, Vidya Vardhaka College

More information

Enhanced LIC Pencil Filter

Enhanced LIC Pencil Filter Enhanced LIC Pencil Filter Shigefumi Yamamoto, Xiaoyang Mao, Kenji Tanii, Atsumi Imamiya University of Yamanashi {daisy@media.yamanashi.ac.jp, mao@media.yamanashi.ac.jp, imamiya@media.yamanashi.ac.jp}

More information

Automatic Caption Localization in Compressed Video

Automatic Caption Localization in Compressed Video IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 22, NO. 4, APRIL 2000 385 Automatic Caption Localization in Compressed Video Yu Zhong, Hongjiang Zhang, and Anil K. Jain, Fellow, IEEE

More information

Degree Reduction of Interval SB Curves

Degree Reduction of Interval SB Curves International Journal of Video&Image Processing and Network Security IJVIPNS-IJENS Vol:13 No:04 1 Degree Reduction of Interval SB Curves O. Ismail, Senior Member, IEEE Abstract Ball basis was introduced

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Using MATLAB Second Edition Rafael C. Gonzalez University of Tennessee Richard E. Woods MedData Interactive Steven L. Eddins The MathWorks, Inc. Gatesmark Publishing A Division

More information

Barcode Based Automated Parking Management System

Barcode Based Automated Parking Management System IJSRD - International Journal for Scientific Research & Development Vol. 2, Issue 03, 2014 ISSN (online): 2321-0613 Barcode Based Automated Parking Management System Parth Rajeshbhai Zalawadia 1 Jasmin

More information

Web Page Layout Via Visual Segmentation

Web Page Layout Via Visual Segmentation Web Page Layout Via Visual Segmentation Ayelet Pnueli, Ruth Bergman, Sagi Schein, Omer Barkol HP Laboratories HPL-2009-160 Keyword(s): Layout understanding, Layout analysis, Web page segmentation, HTML,

More information

Research on News Video Multi-topic Extraction and Summarization

Research on News Video Multi-topic Extraction and Summarization International Journal of New Technology and Research (IJNTR) ISSN:2454-4116, Volume-2, Issue-3, March 2016 Pages 37-39 Research on News Video Multi-topic Extraction and Summarization Di Li, Hua Huo Abstract

More information

Semantic classification of business images

Semantic classification of business images Semantic classification of business images Berna Erol and Jonathan J. Hull Ricoh California Research Center 2882 Sand Hill Rd. Suite 115, Menlo Park, California, USA {berna_erol, hull}@rii.ricoh.com ABSTRACT

More information

Speed Performance Improvement of Vehicle Blob Tracking System

Speed Performance Improvement of Vehicle Blob Tracking System Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu, nevatia@usc.edu Abstract. A speed

More information

IMPLEMENTATION OF IMAGE PROCESSING IN REAL TIME CAR PARKING SYSTEM

IMPLEMENTATION OF IMAGE PROCESSING IN REAL TIME CAR PARKING SYSTEM IMPLEMENTATION OF IMAGE PROCESSING IN REAL TIME CAR PARKING SYSTEM Ms.SAYANTI BANERJEE Ajay Kumar Garg Engineering College, Department of electrical and electronics Ms.PALLAVI CHOUDEKAR Ajay Kumar Garg

More information

Recognition Method for Handwritten Digits Based on Improved Chain Code Histogram Feature

Recognition Method for Handwritten Digits Based on Improved Chain Code Histogram Feature 3rd International Conference on Multimedia Technology ICMT 2013) Recognition Method for Handwritten Digits Based on Improved Chain Code Histogram Feature Qian You, Xichang Wang, Huaying Zhang, Zhen Sun

More information

Research of Digital Character Recognition Technology Based on BP Algorithm

Research of Digital Character Recognition Technology Based on BP Algorithm Research of Digital Character Recognition Technology Based on BP Algorithm Xianmin Wei Computer and Communication Engineering School of Weifang University Weifang, China wfxyweixm@126.com Abstract. This

More information

Canny Edge Detection

Canny Edge Detection Canny Edge Detection 09gr820 March 23, 2009 1 Introduction The purpose of edge detection in general is to significantly reduce the amount of data in an image, while preserving the structural properties

More information

Circle Object Recognition Based on Monocular Vision for Home Security Robot

Circle Object Recognition Based on Monocular Vision for Home Security Robot Journal of Applied Science and Engineering, Vol. 16, No. 3, pp. 261 268 (2013) DOI: 10.6180/jase.2013.16.3.05 Circle Object Recognition Based on Monocular Vision for Home Security Robot Shih-An Li, Ching-Chang

More information

Blog Post Extraction Using Title Finding

Blog Post Extraction Using Title Finding Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School

More information

ISSN: 2348 9510. A Review: Image Retrieval Using Web Multimedia Mining

ISSN: 2348 9510. A Review: Image Retrieval Using Web Multimedia Mining A Review: Image Retrieval Using Web Multimedia Satish Bansal*, K K Yadav** *, **Assistant Professor Prestige Institute Of Management, Gwalior (MP), India Abstract Multimedia object include audio, video,

More information

Tracking Moving Objects In Video Sequences Yiwei Wang, Robert E. Van Dyck, and John F. Doherty Department of Electrical Engineering The Pennsylvania State University University Park, PA16802 Abstract{Object

More information

ESE498. Intruder Detection System

ESE498. Intruder Detection System 0 Washington University in St. Louis School of Engineering and Applied Science Electrical and Systems Engineering Department ESE498 Intruder Detection System By Allen Chiang, Jonathan Chu, Siwei Su Supervisor

More information

Email Spam Detection Using Customized SimHash Function

Email Spam Detection Using Customized SimHash Function International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 35-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Email

More information

Line Separation for Complex Document Images Using Fuzzy Runlength

Line Separation for Complex Document Images Using Fuzzy Runlength Line Separation for Complex Document Images Using Fuzzy Runlength Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition(CEDAR) State University of New York at Buffalo,

More information

Object tracking & Motion detection in video sequences

Object tracking & Motion detection in video sequences Introduction Object tracking & Motion detection in video sequences Recomended link: http://cmp.felk.cvut.cz/~hlavac/teachpresen/17compvision3d/41imagemotion.pdf 1 2 DYNAMIC SCENE ANALYSIS The input to

More information

The Dynamic Background Generation Scheme Using an Image Frame

The Dynamic Background Generation Scheme Using an Image Frame The Dynamic Background Generation Scheme Using an Image Frame Statistical Comparison Method *1, Corresponding Author Wen-Yuan Chen, Department of Electronic Engineering, National Chin-Yi University of

More information

Automatic Extraction of Signatures from Bank Cheques and other Documents

Automatic Extraction of Signatures from Bank Cheques and other Documents Automatic Extraction of Signatures from Bank Cheques and other Documents Vamsi Krishna Madasu *, Mohd. Hafizuddin Mohd. Yusof, M. Hanmandlu ß, Kurt Kubik * *Intelligent Real-Time Imaging and Sensing group,

More information

3D Scanner using Line Laser. 1. Introduction. 2. Theory

3D Scanner using Line Laser. 1. Introduction. 2. Theory . Introduction 3D Scanner using Line Laser Di Lu Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute The goal of 3D reconstruction is to recover the 3D properties of a geometric

More information

Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences

Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences Byoung-moon You 1, Kyung-tack Jung 2, Sang-kook Kim 2, and Doo-sung Hwang 3 1 L&Y Vision Technologies, Inc., Daejeon,

More information

Template-based Eye and Mouth Detection for 3D Video Conferencing

Template-based Eye and Mouth Detection for 3D Video Conferencing Template-based Eye and Mouth Detection for 3D Video Conferencing Jürgen Rurainsky and Peter Eisert Fraunhofer Institute for Telecommunications - Heinrich-Hertz-Institute, Image Processing Department, Einsteinufer

More information

An Implementation of Leaf Recognition System using Leaf Vein and Shape

An Implementation of Leaf Recognition System using Leaf Vein and Shape An Implementation of Leaf Recognition System using Leaf Vein and Shape Kue-Bum Lee and Kwang-Seok Hong College of Information and Communication Engineering, Sungkyunkwan University, 300, Chunchun-dong,

More information

Automatic License Plate Recognition using Python and OpenCV

Automatic License Plate Recognition using Python and OpenCV Automatic License Plate Recognition using Python and OpenCV K.M. Sajjad Department of Computer Science and Engineering M.E.S. College of Engineering, Kuttippuram, Kerala me@sajjad.in Abstract Automatic

More information

EXTRACTION OF UNCONSTRAINED CAPTION TEXT FROM GENERAL-PURPOSE VIDEO

EXTRACTION OF UNCONSTRAINED CAPTION TEXT FROM GENERAL-PURPOSE VIDEO The Pennsylvania State University The Graduate School Department of Computer Science and Engineering EXTRACTION OF UNCONSTRAINED CAPTION TEXT FROM GENERAL-PURPOSE VIDEO A Thesis in Computer Science and

More information

SIGNATURE VERIFICATION

SIGNATURE VERIFICATION SIGNATURE VERIFICATION Dr. H.B.Kekre, Dr. Dhirendra Mishra, Ms. Shilpa Buddhadev, Ms. Bhagyashree Mall, Mr. Gaurav Jangid, Ms. Nikita Lakhotia Computer engineering Department, MPSTME, NMIMS University

More information

Visual Structure Analysis of Flow Charts in Patent Images

Visual Structure Analysis of Flow Charts in Patent Images Visual Structure Analysis of Flow Charts in Patent Images Roland Mörzinger, René Schuster, András Horti, and Georg Thallinger JOANNEUM RESEARCH Forschungsgesellschaft mbh DIGITAL - Institute for Information

More information

2D GEOMETRIC SHAPE AND COLOR RECOGNITION USING DIGITAL IMAGE PROCESSING

2D GEOMETRIC SHAPE AND COLOR RECOGNITION USING DIGITAL IMAGE PROCESSING 2D GEOMETRIC SHAPE AND COLOR RECOGNITION USING DIGITAL IMAGE PROCESSING Sanket Rege 1, Rajendra Memane 2, Mihir Phatak 3, Parag Agarwal 4 UG Student, Dept. of E&TC Engineering, PVG s COET, Pune, Maharashtra,

More information

Real Time Eye Tracking and Mouse Control for Physically Disabled

Real Time Eye Tracking and Mouse Control for Physically Disabled Real Time Eye Tracking and Mouse Control for Physically Disabled Sourabh Kanwar VIT University Keywords: Glint, Mouse control, ROI, Tracking Abstract: In the cases of paralysis a person s ability to move

More information

Lecture 4: Thresholding

Lecture 4: Thresholding Lecture 4: Thresholding c Bryan S. Morse, Brigham Young University, 1998 2000 Last modified on Wednesday, January 12, 2000 at 10:00 AM. Reading SH&B, Section 5.1 4.1 Introduction Segmentation involves

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK EFFICIENT FATIGUE DETECTION USING EFFECTIVE FACE TRACKING ALGORITHM MISS. KANCHAN

More information

A New Image Edge Detection Method using Quality-based Clustering. Bijay Neupane Zeyar Aung Wei Lee Woon. Technical Report DNA #2012-01.

A New Image Edge Detection Method using Quality-based Clustering. Bijay Neupane Zeyar Aung Wei Lee Woon. Technical Report DNA #2012-01. A New Image Edge Detection Method using Quality-based Clustering Bijay Neupane Zeyar Aung Wei Lee Woon Technical Report DNA #2012-01 April 2012 Data & Network Analytics Research Group (DNA) Computing and

More information

Arrowsmith: Automatic Archery Scorer Chanh Nguyen and Irving Lin

Arrowsmith: Automatic Archery Scorer Chanh Nguyen and Irving Lin Arrowsmith: Automatic Archery Scorer Chanh Nguyen and Irving Lin Department of Computer Science, Stanford University ABSTRACT We present a method for automatically determining the score of a round of arrows

More information

AUTOMATIC CROWD ANALYSIS FROM VERY HIGH RESOLUTION SATELLITE IMAGES

AUTOMATIC CROWD ANALYSIS FROM VERY HIGH RESOLUTION SATELLITE IMAGES In: Stilla U et al (Eds) PIA11. International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences 38 (3/W22) AUTOMATIC CROWD ANALYSIS FROM VERY HIGH RESOLUTION SATELLITE IMAGES

More information

Text Information Extraction in Images and Video: A Survey. Keechul Jung, Kwang In Kim, Anil K. Jain

Text Information Extraction in Images and Video: A Survey. Keechul Jung, Kwang In Kim, Anil K. Jain Text Information Extraction in Images and Video: A Survey Keechul Jung, Kwang In Kim, Anil K. Jain Abstract Text data present in images and video contain useful information for automatic annotation, indexing,

More information

Binary Image Scanning Algorithm for Cane Segmentation

Binary Image Scanning Algorithm for Cane Segmentation Binary Image Scanning Algorithm for Cane Segmentation Ricardo D. C. Marin Department of Computer Science University Of Canterbury Canterbury, Christchurch ricardo.castanedamarin@pg.canterbury.ac.nz Tom

More information

Natural Language Querying for Content Based Image Retrieval System

Natural Language Querying for Content Based Image Retrieval System Natural Language Querying for Content Based Image Retrieval System Sreena P. H. 1, David Solomon George 2 M.Tech Student, Department of ECE, Rajiv Gandhi Institute of Technology, Kottayam, India 1, Asst.

More information

Research on Chinese financial invoice recognition technology

Research on Chinese financial invoice recognition technology Pattern Recognition Letters 24 (2003) 489 497 www.elsevier.com/locate/patrec Research on Chinese financial invoice recognition technology Delie Ming a,b, *, Jian Liu b, Jinwen Tian b a State Key Laboratory

More information

Build 3D Scanner System based on Binocular Stereo Vision

Build 3D Scanner System based on Binocular Stereo Vision JOURNAL OF COMPUTERS, VOL. 7, NO., FEBRUARY 01 399 Build 3D Scanner System based on Binocular Stereo Vision Zhihua Lv College of Information Engineering, Northwest A&F University, Yangling, 71100, China

More information

Human behavior analysis from videos using optical flow

Human behavior analysis from videos using optical flow L a b o r a t o i r e I n f o r m a t i q u e F o n d a m e n t a l e d e L i l l e Human behavior analysis from videos using optical flow Yassine Benabbas Directeur de thèse : Chabane Djeraba Multitel

More information

Chess Vision. Chua Huiyan Le Vinh Wong Lai Kuan

Chess Vision. Chua Huiyan Le Vinh Wong Lai Kuan Chess Vision Chua Huiyan Le Vinh Wong Lai Kuan Outline Introduction Background Studies 2D Chess Vision Real-time Board Detection Extraction and Undistortion of Board Board Configuration Recognition 3D

More information

Plant Identification Using Leaf Images

Plant Identification Using Leaf Images Plant Identification Using Leaf Images Sachin D. Chothe 1, V.R.Ratnaparkhe 2 P.G. Student, Department of EE, Government College of Engineering, Aurangabad, Maharashtra, India 1 Assistant Professor, Department

More information

SbLRS: Shape based Leaf Retrieval System

SbLRS: Shape based Leaf Retrieval System SbLRS: Shape based Leaf Retrieval System Komal Asrani Department of Information Technology B.B.D.E.C., Lucknow, India Renu Jain Deptt. of C.S.E University Institute of Engineering and Technology, Kanpur,

More information

Questioned Document Examination using CEDAR-FOX. Center for Excellence in Document Analysis and Recognition (CEDAR) August 24, 2007

Questioned Document Examination using CEDAR-FOX. Center for Excellence in Document Analysis and Recognition (CEDAR) August 24, 2007 Questioned Document Examination using CEDAR-FOX Center for Excellence in Document Analysis and Recognition (CEDAR) August 24, 2007 1 CEDAR-FOX is a versatile system for analyzing scanned, handwritten documents

More information

Character Image Patterns as Big Data

Character Image Patterns as Big Data 22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,

More information

The MPEG Standard. MPEG-1 (1992) actually a video player. plays out audio/video streams same type of access as home VCR

The MPEG Standard. MPEG-1 (1992) actually a video player. plays out audio/video streams same type of access as home VCR The MPEG Standard MPEG-1 (1992) actually a video player plays out audio/video streams same type of access as home VCR MPEG-2 (1995) introduced for compression and transmission of digital TV signals still

More information

Vision based distance measurement system using single laser pointer design for underwater vehicle

Vision based distance measurement system using single laser pointer design for underwater vehicle Indian Journal of Marine Sciences Vol. 38(3), September 2009, pp. 324-331 Vision based distance measurement system using single laser pointer design for underwater vehicle Muljowidodo K 1, Mochammad A

More information

Cloud tracking with optical flow for short-term solar forecasting

Cloud tracking with optical flow for short-term solar forecasting Cloud tracking with optical flow for short-term solar forecasting Philip Wood-Bradley, José Zapata, John Pye Solar Thermal Group, Australian National University, Canberra, Australia Corresponding author:

More information

An Improved Segmentation Method for Spray Painting Characters on Slab Surface Chun-Hui HUANG1, Qi-Jie ZHAO1, 2,a*, Zhen-Nan KE1, Xian-Fa LI1

An Improved Segmentation Method for Spray Painting Characters on Slab Surface Chun-Hui HUANG1, Qi-Jie ZHAO1, 2,a*, Zhen-Nan KE1, Xian-Fa LI1 Advances in Engineering Research, volume 103 Proceedings of the 3rd International Conference on Material Engineering and Application (ICMEA 2016) An Improved Segmentation Method for Spray Painting Characters

More information

* Mohit Mudgil Research Scholar, PDM College of Engineering, Bahadurgarh, Distt. Jhajjar (HARYANA).

* Mohit Mudgil Research Scholar, PDM College of Engineering, Bahadurgarh, Distt. Jhajjar (HARYANA). Multi-Scale Distance Matrix for leaf Recognition using MATLAB * Mohit Mudgil Research Scholar, PDM College of Engineering, Bahadurgarh, Distt. Jhajjar (HARYANA). ** Rajiv Dahiya H.O.D. PDM College of Engineering,

More information

A Study of Automatic License Plate Recognition Algorithms and Techniques

A Study of Automatic License Plate Recognition Algorithms and Techniques A Study of Automatic License Plate Recognition Algorithms and Techniques Nima Asadi Intelligent Embedded Systems Mälardalen University Västerås, Sweden nai10001@student.mdh.se ABSTRACT One of the most

More information