Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I

Size: px

Start display at page:

Download "Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I"

Barrie Golden
10 years ago
Views:

1 Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I Multimedia Computing, Universität Augsburg [email protected]

2 PSNR_Y Goal: Text Extraction Locate text of any size at any position in images, web pages and videos Segment and recognize text Encode extracted text as rigid foreground object in MPEG4 (with Yen-Kuang Chen) Signle VOP KBits/sec Multiple VOP 2

as rigid foreground object in MPEG4 (with Yen-Kuang Chen) 27.5 31.5 31 30.

3 Related Work 1. Y. Zhong, K. Karu and A. K. Jain. Locating Text in Complex Color Images. Pattern Recognition, Vol. 28, No. 10, pp , October Rainer Lienhart and Frank Stuber. Automatic Text Recognition in Digital Videos. In Image and Video Processing IV 1996, Proc. SPIE , pp , Jan. 1996; also TR , Dec B.-L. Yeo, B. Liu. Visual Content Highlightning via Auromatic Extraction of Embedded Captions on MPEG Compressed Video. IS&T / SPIE Digital Video Compression: Algorithms and Technologies, Feb Rainer Lienhart. Automatic Text Recognition for Video Indexing. Proc. ACM Multimedia 96, Boston, MA, Nov. 1996, pp S. Sato and T. Kanade. NAME-IT: Association of Face and Name in Video. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Juan, Puerto Rico, June, Sato, T., Kanade, T., Hughes, E., Smith, M. Video OCR for Digital News Archives. IEEE Workshop on Content- Based Access of Image and Video Databases (CAIVD'98), Bombay, India, January, Anil K. Jain and Bin Yu. Automatic Text Location in Images and Video Frames. Pattern Recognition, Vol. 31, No. 12, pp , H. Li, O. Kia and D. Doermann. Text Enhancement In Digital Videos. In Proceedings of SPIE99, Document Recognition and Retrieval, Rainer Lienhart and Wolfgang Effelsberg. Automatic Text Segmentation and Text Recognition for Video Indexing. ACM/Springer Multimedia Systems Magazine, Vol. 8, pp , Jan Huiping Li, David Doemann, Omid Kia. Automatic text detection and tracking in digital video. IEEE Transactions on Image Processing, Vol. 9, No. 1, Jan Daniel Loprestie and JiangYing Zhou. Locating and Recognizing Text in WWW Images. Information Retrieval 2 (Kluwer Academic Publishers.), , (2000). 12. Axel Wernicke and Rainer Lienhart. On the Segmentation of Text in Videos. IEEE Int. Conference on Multimedia and Expo (ICME2000), Vol.3, pp , July More information at Rainer Lienhart, Axel Wernicke. Localizing and Segmenting Text in Images and Videos. IEEE Transactions on Circuits and Systems for Video Technology, pp , April ,

Visual Content Highlightning via Auromatic Extraction of Embedded Captions on MPEG Compressed Video. IS&T / SPIE Digital Video Compression: Algorithms and Technologies, Feb. 1996. 4. Rainer Lienhart.

4 Design Decisions What kind of text occurrences? Scene text Overlay text With what style attributes? Font size Font type Text color In what kind of media data? Image-based Video-based any both What should be achieved? Localization Segmentation Recognition Integrated recognition How will the results be used? Indexing both Object-based video encoding 4

Font size Font type Text color In what kind of media data?

5 Overview OCR result: Dec

6 Text Localization (1/2) 6

7 Text Box Consolidation (2/2) Derive initial text bounding boxes Refine bounding boxes Remove text boxes which are Too small/large, or Have a bad width-to-height aspect ratio 7

8 Monitoring + Tracking Result: Text Objects 8

9 Background Removal Temporal alignment of text lines 3 bitmap at t, t+45, t+90 Low variance image Border floodfilling Binarized image 9

10 Experimental Results Text localization Image-based: 69.5% (boxes) / 85% (pixels) Video-based: 94.9% (boxes) Text segmentation 79.6% correctly segmented 7.6% damaged, but still recognizable Text recognition 70% (over all steps) 10

11 Demo 11

A Dynamic Approach to Extract Texts and Captions from Videos

A Dynamic Approach to Extract Texts and Captions from Videos Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,