Face Identification by Human and by Computer: Two Sides of the Same Coin, or Not? Tsuhan Chen tsuhan@cmu.edu

Similar documents

Teaching Methodology for 3D Animation

Face Recognition. George Lovell. (Based on Roth & Bruce)

A Comparison of Photometric Normalisation Algorithms for Face Verification

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION

School of Computer Science

Big datasets - promise or Big data, shmig data. D.A. Forsyth, UIUC

A General Framework for Tracking Objects in a Multi-Camera Environment

Taking Inverse Graphics Seriously

A Learning Based Method for Super-Resolution of Low Resolution Images

Methods of psychological assessment of the effectiveness of educational resources online

Masters in Artificial Intelligence

Virtual Environments - Basics -

Lecture 2, Human cognition

Masters in Information Technology

ROBOTRACKER A SYSTEM FOR TRACKING MULTIPLE ROBOTS IN REAL TIME. by Alex Sirota, alex@elbrus.com

Face Model Fitting on Low Resolution Images

Bayesian Image Super-Resolution

Masters in Human Computer Interaction

Masters in Advanced Computer Science

Learning Styles and Aptitudes

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

Robust Real-Time Face Detection

Frequently Asked Questions About VisionGauge OnLine

Overview Basic Design Studio A (MCD1330) Visual Arts Studio A (MCD1340) Drawing A (MCD1270)... 4

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Feature Tracking and Optical Flow

Introduction to Computer Graphics

TAMALPAIS UNION HIGH SCHOOL DISTRICT Larkspur, California. GRAPHIC DESIGN (Beginning)

Face Recognition For Remote Database Backup System

Masters in Networks and Distributed Systems

Masters in Computing and Information Technology

Diploma/BA (Hons) Digital Arts - GI401

BRAIN DOMINANCE. By Shaleene Lemke Period 4

Speed Performance Improvement of Vehicle Blob Tracking System

Understanding The Face Image Format Standards

Parametric Comparison of H.264 with Existing Video Standards

Simultaneous Gamma Correction and Registration in the Frequency Domain

Masters in Human Computer Interaction

Extracting a Good Quality Frontal Face Images from Low Resolution Video Sequences

Speech Signal Processing: An Overview

Annual Report H I G H E R E D U C AT I O N C O M M I S S I O N - PA K I S TA N

Introduction. Selim Aksoy. Bilkent University

3D U ser I t er aces and Augmented Reality

Performance Comparison of Visual and Thermal Signatures for Face Recognition

A STATISTICS COURSE FOR ELEMENTARY AND MIDDLE SCHOOL TEACHERS. Gary Kader and Mike Perry Appalachian State University USA

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE.

Case Study: Real-Time Video Quality Monitoring Explored

T-REDSPEED White paper

The Scientific Data Mining Process

Video compression: Performance of available codec software

CS 534: Computer Vision 3D Model-based recognition

Face Recognition: Some Challenges in Forensics. Anil K. Jain, Brendan Klare, and Unsang Park

A Survey of Video Processing with Field Programmable Gate Arrays (FGPA)

Course Outline. Course Information. Course Code and Title: Course Section: Department: Program: Total Hours: 180. Course Description:

Arkansas Teaching Standards

B-bleaching: Agile Overtraining Avoidance in the WiSARD Weightless Neural Classifier

2012 VISUAL ART STANDARDS GRADES K-1-2

Designing and Developing Web Applications by using the Microsoft.NET Framework

Particles, Flocks, Herds, Schools

Customer Success Story

Ware Public Schools VISUAL ARTS Grades 5-7

What is Artificial Intelligence?

A secure face tracking system

Goal We want to know. Introduction. What is VoIP? Carrier Grade VoIP. What is Meant by Carrier-Grade? What is Meant by VoIP? Why VoIP?

Digital Photography 1

Design Philosophy. Should the School Building be Part of the Pedagogy? (From the Perspective of an Architect)

Child Psychology and Education with Technology

Computer Science Electives and Clusters

Blackboard Exemplary Course Program Rubric

Rendering Microgeometry with Volumetric Precomputed Radiance Transfer

Face Recognition in Low-resolution Images by Using Local Zernike Moments

Concept-Mapping Software: How effective is the learning tool in an online learning environment?

School of Computer Science

Effects of Pronunciation Practice System Based on Personalized CG Animations of Mouth Movement Model

Problem-Based Group Activities for a Sensation & Perception Course. David S. Kreiner. University of Central Missouri

USING COMPUTER VISION IN SECURITY APPLICATIONS

TExES Art EC 12 (178) Test at a Glance

PASSIVE DRIVER GAZE TRACKING WITH ACTIVE APPEARANCE MODELS

A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA

FPGA Implementation of Human Behavior Analysis Using Facial Image

I-Max Touch Range. PAN / CEPH / 3D digital panoramic unit. Evolutive 3 in 1 panoramic unit

Dan French Founder & CEO, Consider Solutions

How To Use The Dc350 Document Camera

Chapter 1. Animation. 1.1 Computer animation

GLOVE-BASED GESTURE RECOGNITION SYSTEM

COURSE TITLE: Elementary Art (Grades 1 5) PREREQUISITE:

UNIT: PSYCHOLOGICAL RESEARCH

An Iterative Image Registration Technique with an Application to Stereo Vision

Selecting a Master s Thesis Research Theme (Ideas for Thesis Research) DR. ANTHONY FAIOLA DR. KARL MACDORMAN

Transcription:

Face Identification by Human and by Computer: Two Sides of the Same Coin, or Not? Tsuhan Chen tsuhan@cmu.edu Carnegie Mellon University Pittsburgh, USA What do you see? 1

What do you see? 2

What do you see? [http://www.palmyra.demon.co.uk] [Tony Karp, Illusion of Beauty ] 3

[Adam Finkelstein, Mona ] What do you see? [http://www.palmyra.demon.co.uk] 4

From Human to Computer Face Identification: A Generalization Problem Single gallery image Pattern Recognition Need to generalize for all variations, without observing those variations Single probe image 5

Computer vs. Human Feng-Shui ( 風水 ) as an example Ancient Chinese room arrangement technique Way 1: Write down all the rules Too many and do not generalize Way 2: Imagine how a dragon would move through the room to arrange it in a livable manner Intuitive and creative Done by Feng-Shui masters Biomimetics? Neural networks Among the first biologically motivated PR; some success in face detection and recognition Limited by training data poor generalization 6

Lesson from Deep Blue November 5, 1997: Deep Blue beat Kasparov (2-w, 1-l, 3-d), the first time in history Deep blue was not designed to mimic humans. Instead, Kasparov it said was it best, designed quantity to take is sometimes quality advantage of a computer s strengths, i.e. speed and memory Deep Blue beat Kasparov by memorizing a large amount of information and table lookup Quantity is not always quality Unfortunately, most object recognition work is under the Deep Blue paradigm Deeper search, more data, faster computers, etc. Object recognition requires some attributes in computers that will be more human-like Generalization (intuition) Adapting from the past Bounds on knowledge Face perception as example Initial overall examination of external features, followed by a sequential analysis of internal features [Matthews, 1978] [Fraser and Parker, 1986] 7

Generalization Banca Database (ICPR2004) Controlled Degraded Adverse - Studio lighting - High quality camera - Minimal pose variation - Varying lighting - Low quality web camera - Some pose variation - Varying lighting - High quality camera - Noticeable pose and other variations 8

Parts Representation Of The Face DCT/Gabor Transform Estimating Parametric Model EM Learner GMM DCT/Gabor Transform 9

Combining Representations Hypothesis Monolithic representation: low-frequency information Parts representation: both low- and high- frequency representation Weighting can be view-dependent Combine the scores with sum rule LDA-COS Input face + COMB FSC-GMM Comparative Results Algorithm/Protocol Mc Ud Ua P LDA-NC [1] 4.93 15.99 20.24 14.79 ORG-SVM [1] 5.43 25.43 30.11 20.33 PCA-MAH 10.2 17.84 26.63 21.57 LDA-COS 6.46 10.99 20.39 14.96 FSC-GMM 2.14 24.78 17.06 21.97 COMB 1.42 9.65 16.51 12.52 [1] M. Sadeghi, J. Kittler, A. Kostin, and K.Messer, A comparative study of automatic face verification algorithms on the BANCA database, in AVBPA, pp. 35 43, 2003. 10

Some Motivations Holistic vs. Parts [Young et al.,1987, Valentine,1995] 11

Thatcher Illusion [Thomson, 1980] Thatcher Illusion [Thomson, 1980] 12

Holistic vs. Parts Parts of faces are [Tanaka and Farah, 1993] easily recognized in typical whole-face configuration less easily in new configuration most poorly recognized in isolation Chins differences detected first [Sargent, 1984] Not as obvious when faces are inverted These suggest Face perception is holistic and by parts Orientation is important Bounds on Knowledge and Adapt from Past 13

Bounds on Knowledge Socrates (470-399 B.C.) "The only true wisdom is in knowing you know nothing." Computer is no where near this yet. It thinks it knows every conceivable variation (but in fact only limited to what has been programmed to it) Adapt from Past Humans are good at adapting using past experience. Can computers do the same? Yes, it is called relevance adaptation (RA) Previously used in speech recognition Obtains a subject-dependent model from a subject-independent average distribution (the past), using a small amount of adaptation data 14

No Relevance Adaptation Relevance Adaptation 15

Another Aspect: 3D/Video 3/4-View Frontal/profile views result in poorer recognition by human than 3/4-view for unfamiliar faces [Baddeley & Woodhead, 1981; Bruce, 1982] profile view ¾ view frontal view ¾ view profile view 3/4-view looks good too! 16

Face Mosaic m v 1 v 2 w m 17

Face in Video Moving faces are significantly better recognized by human than still images Movement provides 3D structure of the face and allows recognition of facial gestures [Knight and Johnston, 1997] To pixelating or blurring, moving images of faces are recognized better than still images [Lander, et al. 1999] (perhaps masking or super-resolution ) Face Recognition from Video Computer can use video too More than simple majority voting or frame selection Integration of temporal/motion/geometry information Updating over time Most variations are continuous (at 30Hz): pose, illumination, expression, registration, etc. 18

Face-in-Action (FiA) Database Other aspects to be explored... 19

Stages of Face Identification Face Identity Name [Young et al, 1982] Common situations: Case 1: Can not recognize the face Case 2: The face looks familiar without identity Case 3: Identify the face (e.g., occupation) but can t recall the name Own-Race Bias We are better at identifying faces belonging to races with which we are familiar [Shapiro and Penrod, 1986] 20

Own-Race Bias Independent Modules Facial expression identified independently of face identity [Bruce, 1986, Young et al.,1986] Prosopagnosia patients can still identify facial emotion Some patients cannot identify facial emotion, but could identify famous faces 21

Independent Modules McGurk effect [McGurk and MacDonald 76] Audio + Visual Perceived ba ga da pa ga ta ma ga na Internet Psychology Lab http://kahuna.psych.uiuc.edu//ipl A prosopagnosia patient can still experience McGurk effect [Campbell et al., 1986], suggesting that holistic face recognition is affected, but not the by-part A few words on sampling 22

How many samples for a face? Reconstruct One Single Image 16 12 8 Number of all possible 16 12 images = 2 >> number of all possible face images [Baker and Kanade, Hallucinating Faces ] >> 30 60 60 24 365 human history world population Power of prior; adapt from past Some Art Work 12 x 16 LEDs, 8-bit Grayscale [Jim Campbell, Portrait of a Portrait of Harry Nyquist ] 23

More 12 x 16 LEDs, 8-bit Grayscale [Jim Campbell, Portrait of a Portrait of Claude Shannon ] Finally The most compelling shapes are those near to our hearts: people s faces, a gracefully moving body, a natural scene with rustling leaves and flowing water. Evolution has tuned us to these sights. By combining vision and graphics, capturing and creating images of these scenes may soon be within reach. [Lengyel, 1998] 24

Try this [http://www.palmyra.demon.co.uk] Advanced Multimedia Processing Lab Please visit us at: http://amp.ece.cmu.edu 25