Analog VLSI Circuits for Attention-Based, Visual Tracking



Similar documents
WINNER-TAKE-ALL ABSTRACT

A Current-Mode Hysteretic Winner-take-all Network, with Excitatory and Inhibitory Coupling

History of eye-tracking in psychological research

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL

ISSCC 2003 / SESSION 13 / 40Gb/s COMMUNICATION ICS / PAPER 13.7

THE HUMAN BRAIN. observations and foundations

Visibility optimization for data visualization: A Survey of Issues and Techniques

Low-resolution Image Processing based on FPGA

Abstract. Cycle Domain Simulator for Phase-Locked Loops

Next Generation Artificial Vision Systems

Agent Simulation of Hull s Drive Theory

Levels of Analysis and ACT-R

CHAPTER 6 PRINCIPLES OF NEURAL CIRCUITS.

Analecta Vol. 8, No. 2 ISSN

ON THE K-WINNERS-TAKE-ALL NETWORK

Visual & Auditory Skills Lab

Software & systems for the neuromorphic generation of computing. Peter Suma co-ceo peter.suma@appliedbrainresearch.

TDMS Test & Data Management Software

Sensor-Based Robotic Model for Vehicle Accident Avoidance

PASSENGER/PEDESTRIAN ANALYSIS BY NEUROMORPHIC VISUAL INFORMATION PROCESSING

Video-Based Eye Tracking

Wireless Security Camera

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.7

Sensory-motor control scheme based on Kohonen Maps and AVITE model

The Binding Problem Solutions to the spatial binding problem

Electrical Resonance

Power Reduction Techniques in the SoC Clock Network. Clock Power

CURRICULUM VITAE. Dept. of Mechanical Engineering and Industrial Design Τ.Ε.Ι. of Western Macedonia KOZANI, GREECE

Neural Network Design in Cloud Computing

Test & Data Management Software

The Development of a Pressure-based Typing Biometrics User Authentication System

PRODUCT SHEET.

Accurate Measurement of the Mains Electricity Frequency

Building an Advanced Invariant Real-Time Human Tracking System

A Cognitive Approach to Vision for a Mobile Robot

Assistant Professor Division of Psychology Office: (65) School of Humanities and Social Science Fax: (65)

Psychological Research Springer-Verlag 1989

MPC 4. Machinery Protection Card Type MPC 4 FEATURES. Continuous on-line Machinery Protection Card

RF Etch Monitor. Introduction

Computer Controlled Generating Stations Control and Regulation Simulator, with SCADA SCE

It's Great But Not Necessarily About Attention

A System for Capturing High Resolution Images

Spike-Based Sensing and Processing: What are spikes good for? John G. Harris Electrical and Computer Engineering Dept

Programmable Single-/Dual-/Triple- Tone Gong SAE 800

Learning to classify complex patterns using a VLSI network of spiking neurons

Chapter 9 Latches, Flip-Flops, and Timers

CHARGE pumps are the circuits that used to generate dc

Impact of Reflectors on Solar Energy Systems

A Surveillance Robot with Climbing Capabilities for Home Security

Technical Aspects of Creating and Assessing a Learning Environment in Digital Electronics for High School Students

CHAPTER 11: Flip Flops

FRC WPI Robotics Library Overview

Eye movement trajectories and what they tell us

TRAINING A LIMITED-INTERCONNECT, SYNTHETIC NEURAL IC


Technical Trading Rules as a Prior Knowledge to a Neural Networks Prediction System for the S&P 500 Index

Static Environment Recognition Using Omni-camera from a Moving Vehicle

FSI Machine Vision Training Programs

DISPLAYING SMALL SURFACE FEATURES WITH A FORCE FEEDBACK DEVICE IN A DENTAL TRAINING SIMULATOR

Stock Prediction using Artificial Neural Networks

Chapter 6: From Digital-to-Analog and Back Again

ANN Based Modeling of High Speed IC Interconnects. Q.J. Zhang, Carleton University

Depth and Excluded Courses

A 10,000 Frames/s 0.18 µm CMOS Digital Pixel Sensor with Pixel-Level Memory

Rhea R. Kimpo, PhD. Education. Grants, Fellowships and Awards. Research University of California, San Francisco Ph.D.

A true low voltage class-ab current mirror

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

The Psychonomic Society, Inc. Vanderbilt University, Nashville, Tennessee

Research-Grade Research-Grade. Capture

Electronic Pressure Switch EDS 3000

E27 SPRING 2013 ZUCKER PROJECT 2 PROJECT 2 AUGMENTED REALITY GAMING SYSTEM

A Visualization System and Monitoring Tool to Measure Concurrency in MPICH Programs

MSAN-001 X-Band Microwave Motion Sensor Module Application Note

Room Temperature based Fan Speed Control System using Pulse Width Modulation Technique

System-Level Display Power Reduction Technologies for Portable Computing and Communications Devices

Barcode Based Automated Parking Management System

Design and analysis of flip flops for low power clocking system

A 1-GSPS CMOS Flash A/D Converter for System-on-Chip Applications

2.6. In-Laboratory Session QICii Modelling Module. Modelling Module Description

CELL PHONE INDUCED PERCEPTUAL IMPAIRMENTS DURING SIMULATED DRIVING

ARTIFICIAL NEURAL NETWORKS FOR ADAPTIVE MANAGEMENT TRAFFIC LIGHT OBJECTS AT THE INTERSECTION

Table 1 SDR to DDR Quick Reference

SMART SENSOR COLLECTION

Development of a high-resolution, high-speed vision system using CMOS image sensor technology enhanced by intelligent pixel selection technique

Movement/monitoring 1307/1.0 (6325xx)

Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network

ISSCC 2006 / SESSION 2 / BIOMEDICAL SYSTEMS / 2.5

VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS

The role of context in Müller-Lyer illusion: The case of negative Müller-Lyer illusion

How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm

FPGA Design of Reconfigurable Binary Processor Using VLSI

Chapter 3 SYSTEM SCANNING HARDWARE OVERVIEW

Using the NI 17xx Smart Camera Direct Drive Lighting Controller

Transcription:

Analog VLSI Circuits for Attention-Based, Visual Tracking Timothy K. Horiuchi Computation and Neural Systems California Institute of Technology Pasadena, CA 91125 timmer@klab.caltech.edu Christof Koch Computation and Neural Systems California Institute of Technology Pasadena, CA 91125 Tonia G. Morris lectrical and Computer ngineering Georgia Institute of Technology Atlanta, GA, 3332-25 tmorris@eecom.gatech.edu Stephen P. DeWeerth lectrical and Computer ngineering Georgia Institute of Technology Atlanta, GA, 3332-25 Abstract A one-dimensional visual tracking chip has been implemented using neuromorphic, analog VLSI techniques to model selective visual attention in the control of saccadic and smooth pursuit eye movements. The chip incorporates focal-plane processing to compute image saliency and a winner-take-all circuit to select a feature for tracking. The target position and direction of motion are reported as the target moves across the array. We demonstrate its functionality in a closed-loop system which performs saccadic and smooth pursuit tracking movements using a one-dimensional mechanical eye. 1 Introduction Tracking a moving object on a cluttered background is a difficult task. When more than one target is in the field of view, a decision must be made to determine which target to track and what its movement characteristics are. If motion information is being computed in parallel across the visual field, as is believed to occur in the middle temporal area (MT) of primates, some mechanism must exist to preferentially extract the activity of the neurons associated with the target at the appropriate

Analog VLSI Circuits for Attention-Based, VISual Tracking 77 Photoreceptors Temporal Spatial Direction of Motion Hysteretic winner-take-all ST Target -...----...----...--- Position ---~~---~----~-. Sa~ade Trigger Figure 1: System Block Diagram: P = adaptive photoreceptor circuit, TD = temporal derivative circuit, SD = spatial derivative, DM = direction of motion, HYS WTA = hysteretic winner-take-all, P2V = position to voltage, ST = saccade trigger. The TD and SD are summed to form the saliency map from which the WTA finds the maximum. The output of the WTA steers the direction-of-motion information onto a common output line. Saccades are triggered when the selected pixel is outside a specified window located at the center of the array. time. Selective visual attention is believed to be this mechanism. In recent years, many studies have indicated that selective visual attention is involved in the generation of saccadic [1] [7] [12] [15] and smooth pursuit eye movements [9] [6] [16]. These studies have shown that attentional enhancement occurs at the target location just before a saccade as well as at the target location during smooth pursuit. In the case of saccades, attempts to dissociate attention from the target location has been shown to disrupt the accuracy or latency. Koch and Ullman [11] have proposed a model for attentional selection based on the formation of a saliency map by combining the activity of elementary feature maps in a topographic manner. The most salient locations are where activity from many different feature maps coincide or at locations where activity from a preferentiallyweighted feature map, such as temporal change, occurs. A winner-take-all (WTA) mechanism, acting as the center of the attentional "spotlight," selects the location with the highest saliency. Previous work on analog VLSI-based, neuromorphic, hardware simulation of visual tracking include a one-dimensional, saccadic eye movement system triggered by temporal change [8] and a two-dimensional, smooth pursuit system driven by visual motion detectors [5]. Neither system has a mechanism for figure-ground discrimination of the target. In addition to this overt form of attentional shifting, covert

78 T. Horiuchi, T. G. Morris, C. Koch and S. P. DeWeerth g.t::. C. 1.5 V > ~ 6 C/) CD t: ::J f- CD t: ::J,t it ~ F? ' "~...-r,..,,..,..,. 4 7 1 13 16 19 22 Pixel Position Vphoto I Spatial I I Temporal I Direction of Motion Figure 2: xample stimulus - Traces from top to bottom: Photoreceptor voltage, absolute value of the spatial derivative, absolute value of the temporal derivative, and direction-of-motion. The stimulus is a high-contrast, expanding bar, which provides two edges moving in opposite directions. The signed, temporal and spatial derivative signals are used to compute the direction-of-motion shown in the bottom trace. attentional shifts have been modeled using analog VLSI circuits [4] [14], based on the Koch and Ullman model. These circuits demonstrate the use of delayed, transient inhibition at the selected location to model covert attentional scanning. In this paper we describe an analog VLSI implementation of an attention-based, visual tracking architecture which combines much of this previous work. Using a hardware model of the primate oculomotor system [8], we then demonstrate the use of the tracking chip for both saccadic and smooth pursuit eye movements. 2 System Description The computational goal of this chip is the selection of a target, based on a given measure of saliency, and the extraction of its retinal position and direction of motion. Figure 1 shows a block diagram of the computation. The first few stages of processing compute simple feature maps which drive the WTA-based selection of a target to track. The circuits at the selected location signal their position and the computed direction-of-motion. This information is used by an external saccadic and smooth pursuit eye movement system to drive the eye. The saccadic system uses the position information to foveate the target and the smooth pursuit system uses the motion information to match the speed of the target. Adaptive photoreceptors [2] (at the top of Figure 1) transduce the incoming pattern of light into an array of voltages. The temporal (TD) and spatial (SD) derivatives are computed from these voltages and are used to generate the saliency map and direction of motion. Figure 2 shows an example stimulus and the computed features. The saliency map is formed by summing the absolute-value of each derivative (ITDI + ISD I) and the direction-of-motion (DM) signal is a normalized product of the two

Analog VLSI Circuits for Attention-Based, VISual Tracking 79 3.5 3. 2.5 2. 1.5 Target Position 1.+---r-~---+---r--~--+---r-~---+---r--~ -.1..1.2.3.4.5.6.7.8.9 1. Time (seconds) Figure 3: xtracting the target's direction of motion: The WTA output voltage is used to switch the DM current onto a common current sensing line. The output of this signal is seen in the top trace. The zero-motion level is indicated by the flat line shown at 2.9 volts. The lower trace shows the target's position from the position-to-voltage encoding circuits. The target's position and direction of motion are used to drive saccades and smooth pursuit eye movements during tracking. d TD SD erlvatives. ITDI+lsDI In the saliency map, the temporal and spatial derivatives can be differentially weighted to emphasize moving targets over stationary targets. The saliency map provides the input to a winner-take-all (WTA) computation which finds the maximum in this map. Spatially-distributed hysteresis is incorporated in this winnertake-all computation [4] by adding a fixed current to the winner's input node and its neighbors. This distributed hysteresis is motivated by the following two ideas: 1) once a target has been selected it should continue to be tracked even if another equally interesting target comes along, and 2) targets will typically move continuously across the array. Hysteresis reduces oscillation of the winning status in the case where two or more inputs are very close to the winning input level and the local distribution of hysteresis allows the winning status to freely shift to neighboring pixels rather than to another location further away. The WTA output signal is used to drive three different circuits: the position-tovoltage (P2V) circuit [3], the DM-current-steering circuit (see Figure 3), and the saccadic triggering (ST) circuit. The only circuits that are active are those at the winning pixel locations. The P2V circuit drives the common position output line to a voltage representing it's position in the array, the DM-steering circuit puts the local DM circuit's current onto the common motion output line, and the ST circuit drives a position-specific current onto a common line to be compared against an externally-set threshold value. By creating a "V" shaped profile of ST currents centered on the array, winning pixels away from the center will exceed the threshold

71 T. Horiuch~ T. G. Mo"is, C. Koch and S. P. DeWeerth "2 o 2.7 (/) o Q. a; 2.6.~ Q. -; 2.5 N ' > 2.4 2.1+----+----+---~----~--_4----~--~~--~ o 5 1 15 2 25 3 35 4 Time (msec) Figure 4: Position vs. time traces for the passage of a strong edge across the array at five different speeds. The speeds shown correspond to 327, 548, 142, 1578,2294 pixels/sec. and send saccade requests off-chip. Figure 3 shows the DM and P2V outputs for an oscillating target. To test the speed of the tracking circuit, a single edge was passed in front of the array at varying speeds. Figure 4 shows some of these results. The power consumption of the chip (23 pixels and support circuits, not including the pads) varies between.35 m Wand.6 m W at a supply voltage of 5 volts. This measurement was taken with no clock signal driving the scanners since this is not essential to the operation of the circuit. 3 System Integration The tracking chip has been mounted on a neuromorphic, hardware model of the primate oculomotor system [8] and is being used to track moving visual targets. The visual target is mounted to a swinging apparatus to generate an oscillating motion. Figure 5 shows the behavior of the system when the retinal target position is used to drive re-centering saccades and the target direction of motion is used drive smooth pursuit. Saccades are triggered when the selected pixel is outside a specified window centered on the array and the input to the smooth pursuit system is suppressed during saccades. The smooth pursuit system mathematically integrates retinal motion to match the eye velocity to the target velocity. 4 Acknowledgements T. H. is supported by an Office of Naval Research AASRT grant and by the NSF Center for Neuromorphic Systems ngineering at Caltech. T. M. is supported by the Georgia Tech Research Institute.

Analog VLSI Circuits for Attention-Based, VISual Tracking 711 4 Analog VLSI Human Subject ~ --~--~~~ --~--~~oo 2 From..5 1. 1.5 2. 2.5 3. 3.S Time (seconds) Collewijn and Tamminga, 1984 Figure 5: Saccades and Smooth Pursuit: In this example, a swinging target is tracked over a few cycles. Re-centering saccades are triggered when the target leaves a specified window centered on the array. For comparison, on the right, we show human data for the same task [1]. References [1] H. Collewijn and. Tamminga, "Human smooth and saccadic eye movements during voluntary pursuit of different target motions on different backgrounds" J. Physiol., VoL 351, pp. 217-25. (1984) [2] T. Delbriick, Ph.D. Thesis, Computation and Neural Systems Program California Institute of Technology (1993) [3] S. P. DeWeerth, "Analog VLSI Circuits for Stimulus Localization and Centroid Computation" Inti. 1. Compo Vis. 8(3), pp. 191-22. (1992) [4] S. P. DeWeerth and T. G. Morris, "CMOS Current Mode Winner-Take-All with Distributed Hysteresis" lectronics Letters, Vol. 31, No. 13, pp. 151-153. (1995) [5] R. tienne-cummings, J. Van der Spiegel, and P. Mueller "A Visual Smooth Pursuit Tracking Chip" Advances in Neural Information Processing Systems 8 (1996) [6] V. Ferrara and S. Lisberger, "Attention and Target Selection for Smooth Pursuit ye Movements" J. Neurosci., 15(11), pp. 7472-7484, (1995) [7] J. Hoffman and B. Subramaniam, "The Role of Visual Attention in Saccadic ye Movements" Perception and Psychophysics, 57(6), pp. 787-795, (1995) [8] T. Horiuchi, B. Bishofberger, and C. Koch, "An Analog VLSI Saccadic System" Advances in Neural Information Processing Systems 6, Morgan Kaufmann, pp. 582-589, (1994) [9] B. Khurana, and. Kowler, "Shared Attentional Control of Smooth ye Movement and Perception" Vision Research, 27(9), pp. 163-1618, (1987)

712 T. Horiuchi, T. G. Morris, C. Koch and S. P. DeWeerth 1 Analog VLSI Monkey 5 Ci) Cl Q) -5 :s Q) C> c: «-1-15 Target Position... ye Position 1 msec saccadic delay added -2 -I---+--+---+-----1--+--_+_---.,r-----+---+----i 2.8 2.9 3. 3.1 3.2 3.3 3.4 3.5 3.6 3.7 3.8 nme (seconds) From Lisberger et ai., 1987 Figure 6: Step-Ramp xperiment: In this experiment, the target jumps from the fixation point to a new location and begins moving with constant velocity. On the left, the analog VLSI system tracks the target. For comparison, on the right, we show data from a monkey performing the same task [13]. [1]. Kowler,. Anderson, B. Dosher,. Blaser, "The Role of Attention in the Programming of Saccades" Vision Research, 35(13), pp. 1897-1916, (1995) [11] C. Koch and S. Ullman, "Shifts in selective visual attention: towards the underlying neural circuitry" Human Neurobiology, 4:219-227, (1985) [12] R. Rafal, P. Calabresi, C. Brennan, and T. Scioltio, "Saccade Preparation Inhibits Reorienting to Recently Attended Locations" 1. xp. Psych: Hum. Percep. and Perf., 15, pp. 673-685, (1989) [13] S. G. Lisberger,. J. Morris, and L. Tychsen, "Visual motion processing and sensory-motor integration for smooth pursuit eye movements." In Ann. Rev. Neurosci., Cowan et al., editors. Vol. 1, pp. 97-129, (1987) [14] T. G. Morris and S. P. DeWeerth, "Analog VLSI Circuits for Covert Attentional Shifts" Proc. 5th Inti. Conf. on Microelectronics for Neural Networks and Fuzzy Systems - MicroNeur96, Feb 12-14, 1996. Lausanne, Switzerland, I Computer Society Press, Los Alamitos, CA, pp. 3-37, (1996) [15] S. Shimojo, Y. Tanaka, O. Hikosaka, and S. Miyauchi, "Vision, Attention, and Action - inhibition and facilitation in sensory motor links revealed by the reaction time and the line-motion." In Attention and Performance XVI, T. Inui & J. L. McClelland, editors. MIT Press, (1995) [16] W. J. Tam and H. Ono, "Fixation Disengagement and ye-movement Latency" Perception and Psychophysics, 56(3) pp. 251-26, (1994)