Active Audition and Sensorimotor Integration for Sound Source Localization



Similar documents
What Audio Engineers Should Know About Human Sound Perception. Part 2. Binaural Effects and Spatial Hearing

Tonal Detection in Noise: An Auditory Neuroscience Insight

Lecture 2, Human cognition

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL

Lecture 4: Jan 12, 2005

Room Acoustic Reproduction by Spatial Room Response

A Sound Localization Algorithm for Use in Unmanned Vehicles

Vision: Receptors. Modes of Perception. Vision: Summary 9/28/2012. How do we perceive our environment? Sensation and Perception Terminology

The Design and Implementation of Multimedia Software

Type-D EEG System for Regular EEG Clinic

Hearing and Deafness 1. Anatomy & physiology

THE problem of tracking multiple moving targets arises

A mixed structural modeling approach to personalized binaural sound rendering

Function (& other notes)

OAE hearing screening at your fingertips OAE+

Learners Who are Deaf or Hard of Hearing Kalie Carlisle, Lauren Nash, and Allison Gallahan

Development of sound localization

Analecta Vol. 8, No. 2 ISSN

Analog Representations of Sound

PURE TONE AUDIOMETRY Andrew P. McGrath, AuD

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning

Sound Perception. Sensitivity to Sound. Sensitivity to Sound 1/9/11. Not physically sensitive to all possible sound frequencies Range

Speech-Language Pathology Curriculum Foundation Course Linkages

Glossary of commonly used Occupational Therapy terms

Summary Table Voluntary Product Accessibility Template

Il est repris ci-dessous sans aucune complétude - quelques éléments de cet article, dont il est fait des citations (texte entre guillemets).

Summary Table Voluntary Product Accessibility Template

CHAPTER 6 PRINCIPLES OF NEURAL CIRCUITS.

Nature of the Sound Stimulus. Sound is the rhythmic compression and decompression of the air around us caused by a vibrating object.

Artificial Neural Network for Speech Recognition

Accessibility-MiVoice-Business.docx Page 1

Anatomy and Physiology of Hearing (added 09/06)

Chapter 14: The Cutaneous Senses

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

Colorado School of Mines Computer Vision Professor William Hoff

Self Organizing Maps: Fundamentals

Development of a distributed recommender system using the Hadoop Framework

So, how do we hear? outer middle ear inner ear

Structural Health Monitoring Tools (SHMTools)

Introduction to Robotics Analysis, Systems, Applications

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN AUDIOLOGY (MSc[Audiology])

AP Psychology ~ Ms. Justice

Information visualization examples

A Human Factors Approach to understanding Slip, Trip and Fall. Change in Elevation

Using artificial intelligence for data reduction in mechanical engineering

Time-domain Beamforming using 3D-microphone arrays

Emotion Recognition Using Blue Eyes Technology

How To Get A Computer Engineering Degree

Feed-Forward mapping networks KAIST 바이오및뇌공학과 정재승

Problem-Based Group Activities for a Sensation & Perception Course. David S. Kreiner. University of Central Missouri

Nikki White Children s Occupational Therapist Barnet Community Services

Your Hearing ILLUMINATED

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Cirrus 0.2T. MRI for Everyone. North America, Asia, Europe. contact:

AUDIO POINTER V2 JOYSTICK & CAMERA IN MATLAB

Convention Paper Presented at the 112th Convention 2002 May Munich, Germany

Appendices master s degree programme Artificial Intelligence

ISO xx STEP. Sommaire. étendue. STandard for the Exchange of Product model data. Hervé Panetto CRAN nancy.

Practice Standards for Hearing Service Providers

The Scientific Data Mining Process

Canalis. CANALIS Principles and Techniques of Speaker Placement

SPATIAL IMPULSE RESPONSE RENDERING: A TOOL FOR REPRODUCING ROOM ACOUSTICS FOR MULTI-CHANNEL LISTENING

THREE-DIMENSIONAL (3-D) sound is becoming increasingly

ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION

CHAPTER 5 PREDICTIVE MODELING STUDIES TO DETERMINE THE CONVEYING VELOCITY OF PARTS ON VIBRATORY FEEDER

Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with Panchromatic Textural Features

Voltage. Oscillator. Voltage. Oscillator

Multisensor Data Fusion and Applications

Summary Table Voluntary Product Accessibility Template

DIGITAL IMAGE PROCESSING AND ANALYSIS

Summary Table Voluntary Product Accessibility Template. Not Applicable- Not a web-based application. Supports. Not Applicable

The Visualization Pipeline

The Role of the Efferent System in Auditory Performance in Background Noise

Aircraft cabin noise synthesis for noise subjective analysis

HEARING SCREENING (May 2006)

Summary Table Voluntary Product Accessibility Template. Criteria Supporting Features Remarks and explanations

VPAT Voluntary Product Accessibility Template

Microcontrollers, Actuators and Sensors in Mobile Robots

Bachelorarbeit. Sylvia Sima HRTF Measurements and Filter Design for a Headphone-Based 3D-Audio System

Welcome to the United States Patent and TradeMark Office

Diagnostic Tests for Vestibular Problems By the Vestibular Disorders Association

Convention Paper Presented at the 133rd Convention 2012 October San Francisco, USA

PCL - SURFACE RECONSTRUCTION

Light wear for a powerful hearing. Bone Conduction Headset

QAM Demodulation. Performance Conclusion. o o o o o. (Nyquist shaping, Clock & Carrier Recovery, AGC, Adaptive Equaliser) o o. Wireless Communications

Recent advances in Digital Music Processing and Indexing

T Automatic Speech Recognition: From Theory to Practice

Expanding Performance Leadership in Cochlear Implants. Hansjuerg Emch President, Advanced Bionics AG GVP, Sonova Medical

3D sound in the telepresence project BEAMING Olesen, Søren Krarup; Markovic, Milos; Madsen, Esben; Hoffmann, Pablo Francisco F.; Hammershøi, Dorte

5th Congress of Alps-Adria Acoustics Association NOISE-INDUCED HEARING LOSS

Transcription:

Active Audition and Sensorimotor Integration for Sound Source Localization Mathieu Bernard 25 novembre 2011 1/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Introduction CIFRE thesis. Co-direction : Patrick Pirim, Brain Vision Systems Bruno Gas, ISIR, UPMC Alain de Cheveigné, LPP, Paris Descartes Outline Artificial auditory system for sound localization, Audio-tactile model for texture recognition, Active audition and sensorimotor integration. 2/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Autonomous robotics Psikharpax, the robot rat 3/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Artificial auditory system Bioinspired sound localization Outer ear Outer ear Inner ear Inner ear Binaural localiation cues ITD - ILD Auditory model Implementation robot sound capture core library real time - c++ standalone programs wav files robotic simulation simulated sound sources 4/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Outer ear model Pinna Auditory fovea Microphone Oriented toward the fovea Support Servomotor Outer ear = Pinna, microphone and software capture. Spectral and directional cues. 5/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Inner ear model (1/2) Inner ear = cochlear model and pulse train generation. Cochlear model Gammatone filterbank Usually 30 channels from 300Hz to 8kHz at 20kHz 6/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Inner ear model (2/2) Inner ear = cochlear model and pulse train generation. Pulse train generation Cochlear channel output Pulse train Discrete representation, noise suppression. A pulse = temporal and amplitude information. 7/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

ILD computation (1/2) Energy for each pulse p(t), the energy is : E(t) = u=t u=t T p(u) 2 (1) 8/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

ILD computation (2/2) ILD = Interaural Level Difference For each channel i : ILD i (t) = 2E left,i (t) E left,i (t)+e right,i (t) 1 9/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

ITD computation (1/3) ITD = Onset extraction and delay lines. Onset extraction from a pulse train Comparison with a dynamic threashold, 2 parameters. 10/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

ITD computation (2/3) Delay lines model ITD = Onset extraction and delay lines. 11/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

ITD computation (3/3) ITD = Onset extraction and delay lines. 12/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Auditory and tactile model for texture perception (1/3) Similar model for transduction and processing Audition and touch support fine texture discrimination skills. Rat vibrissae and cochlea transduction based on resonance. Strong interaction for auditory/tactile spectral processing. 13/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Auditory and tactile model for texture perception (2/3) Whiskers filterbank adapted from gammatone cochlear model : Feature extraction = Instantaneous Mean Power : 14/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Auditory and tactile model for texture perception (3/3) Texture classification : 8 textures, 3 layer perceptron Response to a pure tone (whisker at 630 Hz) Influence of the number of channels 15/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Auditory Evoked Behaviors (1/2) Outer ear Inner ear Energy ILD Outer ear Inner ear Energy Auditory model Motor control Neck and body Motor control Neck ILD minimization ILD < 0 turn right, ILD > 0 turn left. Video! Wheels Follow neck orientation Constant speed, Smooth rotations. Video! 16/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Auditory Evoked Behaviors (2/2) Phonotaxis trajectories. Front-back disambiguation. 17/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Sensorimotor Approach (1/3) Environment state e E and motor state m M, Sensory state s S, we have s = Φ(m, e), Φ is called a sensorimotor law. 18/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Sensorimotor Approach (2/3) Quand on dit [...] que nous localisons tel objet en tel point de l espace [...] cela signifie simplement que nous nous représentons les mouvements qu il faut faire pour atteindre cet objet. [...] Nous nous représentons les sensations musculaires qui accompagnent [ces mouvements] et qui n ont aucun caractère géométrique, qui par conséquent n impliquent nullement la préexistance de la notion d espace. H. Poincaré, L espace et la géométrie, 1845. Proposed formalization for localization Find the motor state m such as m = argmin Φ(m, e) Φ(m 0, e 0 ). m M 19/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Sensorimotor Approach (3/3) Sound source localization Two ways for m estimation Orienting behavior After completion we have m = m end s end = Φ(m end, e) = Φ(m 0 + δm, e 0 + δe), Manifold learning Sensory space S lies on a low-dim manifold R, Dimension reduction technique : S R, Same topology as the embodying space (at least locally). 20/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Manifold learning (1/3) CAMIL database, ILD vectors : 21/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Manifold learning (2/3) CAMIL database, ITD vectors : d=690 d=3 22/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Manifold learning (3/3) 2 outer ears models : HRTF and directive filters Front-back disambiguation with spectral cues : 23/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Sensorimotor integration (1/2) Learning algorithm Iterative learning of R, Self-supervised. 24/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Sensorimotor integration (2/2) Simulation results over 400 experiments 25/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Intrinsic Dimension Estimation Simulation (ILD, azimtuh 180 & 360) CAMIL dataset (ILD & ITD, azimtuh 360, elevation [-30, 30]) 26/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration

Perspectives SensoriMOTOR 27/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration