Industrial Challenges for Content-Based Image Retrieval



Similar documents
Open issues and research trends in Content-based Image Retrieval

Content-Based Image Retrieval

Steven C.H. Hoi School of Information Systems Singapore Management University

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

T-REDSPEED White paper

Information Technology Systems (2012)

GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING

Florida International University - University of Miami TRECVID 2014

Online Credit Card Application and Identity Crime Detection

Similarity Search in a Very Large Scale Using Hadoop and HBase

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.

Introduction. A. Bellaachia Page: 1

Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval

technical notes trimble realworks software

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig

Industry 4.0 and Big Data

A Statistical Text Mining Method for Patent Analysis

Trimble Realworks Software

Concept and Project Objectives

Using Data Mining for Mobile Communication Clustering and Characterization

Advanced Analytics for Audit Case Selection

Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) ( ) Roman Kern. KTI, TU Graz

Virtual Environments - Basics -

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

Rich Media & HD Video Streaming Integration with Brightcove

Digital Identity & Authentication Directions Biometric Applications Who is doing what? Academia, Industry, Government

Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features

Master s Program in Information Systems

Massive Labeled Solar Image Data Benchmarks for Automated Feature Recognition

WebSphere Business Monitor

Web Browsing Quality of Experience Score

PSG College of Technology, Coimbatore Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

Monitoring Web Browsing Habits of User Using Web Log Analysis and Role-Based Web Accessing Control. Phudinan Singkhamfu, Parinya Suwanasrikham

A MACHINE LEARNING APPROACH TO FILTER UNWANTED MESSAGES FROM ONLINE SOCIAL NETWORKS

Symantec Enterprise Vault for Lotus Domino

The Need for Training in Big Data: Experiences and Case Studies

Visual Structure Analysis of Flow Charts in Patent Images

Delivering Smart Answers!

Tracking and Recognition in Sports Videos

A Tool for Exploratory Visualization of Bus Mobility and Ridership: A case study of Lisbon, Portugal

Big Data: Image & Video Analytics

Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics

Location Analytics for Financial Services. An Esri White Paper October 2013

Video Analytics A New Standard

Data Visualization and Business Insights Using SAS Visual Analytics. University of Connecticut Dan Sokol Thulasi Kumar 1/13/2015

Sentiment Analysis on Big Data

Spam Filtering in Online Social Networks Using Machine Learning Technique

White Paper. The Assurance Checklist for Branch Networks A pragmatic guide for building high performance branch office networks.

IBM Data Security Services for endpoint data protection endpoint data loss prevention solution

Document Image Retrieval using Signatures as Queries

Semantic Search in Portals using Ontologies

DIGITAL ASSET MANAGEMENT

Search and Information Retrieval

Analecta Vol. 8, No. 2 ISSN

M3039 MPEG 97/ January 1998

Deep Learning Meets Heterogeneous Computing. Dr. Ren Wu Distinguished Scientist, IDL, Baidu

IBM Data Security Services for endpoint data protection endpoint data loss prevention solution

An Overview of Knowledge Discovery Database and Data mining Techniques

Feasibility Study of Searchable Image Encryption System of Streaming Service based on Cloud Computing Environment

Complex, true real-time analytics on massive, changing datasets.

IBM Content Analytics: Rapid insight for crime investigation

Pixels Description of scene contents. Rob Fergus (NYU) Antonio Torralba (MIT) Yair Weiss (Hebrew U.) William T. Freeman (MIT) Banksy, 2006

WebSphere Business Monitor

Module II: Multimedia Data Mining

Data Mining Algorithms Part 1. Dejan Sarka

Database Marketing, Business Intelligence and Knowledge Discovery

The Implementation of Face Security for Authentication Implemented on Mobile Phone

Big Data Text Mining and Visualization. Anton Heijs

Digital Asset Management (DAM) Protecting, preserving, retrieving and distributing digital assets

NUIX WHITE PAPER THE INVESTIGATIVE LAB: A MODEL FOR EFFICIENT COLLABORATIVE DIGITAL INVESTIGATIONS WHITE PAPER

Filling the Semantic Gap: A Genetic Programming Framework for Content-Based Image Retrieval

Face Recognition: Some Challenges in Forensics. Anil K. Jain, Brendan Klare, and Unsang Park

Master Technology Teacher Standards

Securing and Accelerating Databases In Minutes using GreenSQL

Steven C.H. Hoi. School of Computer Engineering Nanyang Technological University Singapore

Scalability and Performance Report - Analyzer 2007

Small Steps Towards Big Data Ric Clarke, Australian Bureau of Statistics

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS

Patent data mining and effective patent portfolio management

Template-based Eye and Mouth Detection for 3D Video Conferencing

FOTOSTATION. The best media file organizer. It s organized

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

Reporting and Incident Management for Firewalls

Server Consolidation with SQL Server 2008

Final Project Report

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Big Data & Scripting Part II Streaming Algorithms

Transcription:

Title Slide Industrial Challenges for Content-Based Image Retrieval Chahab Nastar, CEO Vienna, 20 September 2005 www.ltutech.com LTU technologies Page 1

Agenda CBIR what is it good for? Technological challenges of CBIR Evaluating CBIR About LTU technologies Q&A LTU technologies Page 2

CBIR... what is it good for? Historically, (generic) CBIR is a cool technology... - convergence of information retrieval and computer vision - «user in the loop», link to web search engines - «query by example» approach - feature extraction, fast search, relevance feedback a lot of academic papers... in search of a useful application in the «real» world - worst-case scenario for entrepreneurship: technology preceding market traction LTU technologies Page 3

LTU technologies Page 4 Not interesting: Similar sunsets!

LTU technologies Page 5 Google does the job: category search

CBIR s redemption a powerful paradigm for object/scene recognition finding a series of images of the same scene/object user is in the loop linking the unknown world to the known world LTU technologies Page 6

CBIR in the «real» world Unknown world (query) Known world (reference) duplicate images B clone images similar images Image mining LTU technologies Page 7

User scenarios Verification: 1:1 - unrelated to information retrieval Identification: 1:N - target search - applications: stolen art, counterfeiting, media monitoring Investigation: q:n - find common ground between images - applications: child exploitation, IT forensics, intelligence Classification: A K LTU technologies Page 8

LTU s technology generalized CBIR Input Any digital image format, e-mail attachement, video frame, Real-time Analysis Visual Feature Extraction color, shape, texture, arrangements, moments, 1 1 0 0 1 1 1 1 0 0 1 1 Image DNA Image Database / Knowledge Base Similarity Retrieval Clone Detection Output Categorization face 100% indoor 80% LTU technologies Page 9

Technological challenges of CBIR Application «Clone» detection Visual retrieval Image annotation Problem Matching: A = B? Similarity: A ~ B? Classification: A K? Metric Matching distance Visual/semantic distance Hamming distance where A and B are images, and K is a concept LTU technologies Page 10

Technological challenges of CBIR Application «Clone» detection Visual retrieval Image annotation Problem Matching: A = B? Similarity: A ~ B? Classification: A K? Metric Matching distance Visual/semantic distance Hamming distance where A and B are images, and K is a concept LTU technologies Page 11

Clone Detection Image clones same visual content, different graphical rendering (typically Photoshop manipulation) original old B&W negatized bad scan cropped inverted titled face erased face erased stretched reencoded LTU technologies Page 12

Technological challenges of CBIR Application «Clone» detection Visual retrieval Image annotation Problem Matching: A = B? Similarity: A ~ B? Classification: A K? Metric Matching distance Visual/semantic distance Hamming distance where A and B are images, and K is a concept LTU technologies Page 13

Cross-modal retrieval: http://corbis.ltutech.com LTU technologies Page 14

Technological challenges of CBIR Application «Clone» detection Visual retrieval Image annotation Problem Matching: A = B? Similarity: A ~ B? Classification: A K? Metric Matching distance Visual/semantic distance Hamming distance where A and B are images, and K is a concept LTU technologies Page 15

Annotation by supervised learning Classifier bank Fusion module AA 1 1 0 0 1 1 1 1 0 0 1 1 BB CC Keyword DD Finite vocabulary Binary or multi-class classification LTU technologies Page 16

LTU technologies Page 17 Annotation by case-based reasoning

Evaluation of CBIR depends upon Image Content - Scenes/objects represented - Image Quality - Domain-specific knowledge User scenario - identification benchmarking database? - investigation using a series of images - classification Usage scenario - Search Acceptable max rank of «retrieved images»? precision or recall? - Classification LTU technologies Page 18

Evaluation within a customer project Large database with Ground Truth - provided by customer or - put together by LTU User scenario & usage scenario taken into account - in most mission-critical scenarios, maximizing recall is the main goal - precision is unimportant LTU technologies Page 19

The LTUtech dataset LTU has collected (mostly over the web) more than 80,000 images of 267 visual categories A visual category is a set of examples of an object - animals, CD covers, gloves, watches etc. An average of 300 images per visual category The total database size is 1.3 GB The images vary in quality, size, format We do not known the copyright of the images The database is shared with the WP3 members of the MUSCLE network LTU technologies Page 20

About LTU Technologies Golden Award for Technology The world s top 100 innovators MIT Technology Review Founded: 1999 from our group s research at INRIA (imedia research group) Mission: Bringing image/video indexing/retrieval to the market Europe s Top 100 Companies The Red Herring Headquartered: Paris & Washington D.C. Acquired by company JASTEC in 2005 European Information Society Technologies Prize LTU technologies Page 21

Value Prop: Law Enforcement & Intelligence Providing tools to automatically mine information from large, unknown sets of images Data Collection Data Analysis Content Management Seized HD Database of images w/ case information Internet Database of suspect Images P2P E-mail LTU technologies evidence Page 22

Key-Clients (1) Government, Police, International agencies Investigations Child protection, Stolen art Computer forensics Counterfeiting, ID theft Intelligence and monitoring Computer, Network or Media LTU technologies Page 23

Value Prop: Media & Intellectual Property Providing efficient access to images and tools for monitoring their usage Enroll Manage Distribute Internet E-Mail 3 Keywording 1 Search 2 Monitoring LTU technologies Page 24

Key-Clients (2) Media and IP protection Image bank management e-commerce, media Patent and design protection Search, classification of designs Online IP protection Logos, visual assets Partners DAM software, text search engines Content filtering, Antivirus software LTU technologies Page 25

Summary Challenges in bringing CBIR to the market - mission-critical applications Evaluation is case-by-case - domain-specific - customer-specific LTU technologies Page 26

Questions & Answers Chahab Nastar CEO Paris, France cn AT LTUtech.com LTU technologies Page 27