Multi-Modal Acoustic Echo Canceller for Video Conferencing Systems
|
|
|
- Sylvia French
- 10 years ago
- Views:
Transcription
1 Multi-Modal Acoustic Echo Canceller for Video Conferencing Systems Mario Gazziro,Guilherme Almeida,Paulo Matias, Hirokazu Tanaka and Shigenobu Minami ICMC/USP, Brazil Wernher von Braun Center, Brazil IFSC/USP, Brazil Toshiba Semiconductors, Japan Abstract In video conferencing, if two people talk at the same time howling noises can appear due to failures in the echo cancellation system. In order to reduce these noise situations, a novel AEC (Acoustic Echo Cancellation) system is proposed by using audio and video information. Through image processing techniques we have developed a new DTD (Double Talk Detector) based on video information (video-dtd). A novel Acoustic Echo Cancellation system was successfully developed, integrating audio and video information. In simulated results using data from a real scenary the misalignment of the new AEC (with video-dtd) was smaller than a traditional AEC. Index Terms Acoustic Echo Cancellation, Double Talk Detection, Multi-Modal, Video Conference, Image Processing. Far-end Talker Open during double talk Echo Double Talk Detector Echo Path I. INTRODUCTION With the advent of internet, communication has become easier in such a way that people can hold a video conference session at their own houses. But if two people talk at the same time, howling noises can appear due to failures in the echo cancelation system [3]. In order to reduce these noise situations a novel AEC (Acoustic Echo Canceller) system is proposed by using audio and video information about the user. Murata et al [1] attempted to integrate sound and image before solving echo cancellation problems, but using only binarized lip image processing [2] and performing a low integration with the AEC. The implementation of an high integrated system is the objective of this paper. e(k) x(k) Fig. 1. Fig. 2. Acoustic echo canceller structure. G(k) Comp ADJ Minami DTD algorithm block diagram. Near-end Talker Inhibition II. ACOUSTIC ECHO CANCELLER In acoustic echo cancellation, a measured microphone signal contains two signals: the Near-end speech signal and the Farend echoed speech signal. The goal is to remove the Far-end echoed speech signal from the microphone signal so that only the Near-end speech signal is transmitted. The block diagram of an AEC is showed in Figure 1. In a full-duplex conversation, the presence of signals other than the Far-end signal convolves with the echo path, thus inhibiting the ability of the adaptive algorithm to model the system. The presence of the Near-end talker during the Far-end speech is a source of disruption in the adaptation of the filter. Therefore, adaptation of the filter must be prevented during this double talk scenario via a double talk detector (DTD). A. Double Talk Detector There are many approaches to determining when a double talk scenario occurs, but all of them follow the same general procedure, which says that a detection statistic, η, can be formulated from the excitation, desired, and/or error signals. Then this detection statistic is compared to a threshold, in order to determine if a double talk can be declared. The classical implementation of DTD uses Geigel [4] or NCR [5] (Normalized Cross Correlation) algorithms. But in this paper we will focus on an algorithm proposed by [6]. With this algorithm we can perform the comparison between the Far-end signal and the output. A block diagram is showed Figure 2. The output from the e(k) and the Far-end
2 y(k) A B Comparator O B>A A>B O=B O=A y(k-1) Gain Delay Fig. 3. Details of (Level Detector) block in Minami DTD. Fig. 5. Simulated results using WGN samples with a small DTD duration. Fig. 4. Level detector results with voice sample. Signal x(k) pass through a level detector, which converts the signal as if it was an envelope detector. The block diagram of this level detector can be seen in Figure 3. It compares the current sample of the signal with the previous one with a small delay and a small degradation. If the current sample is higher than the previous one, the output will be the current sample. Otherwise, it will be the previous one. This block was implemented with a degradation of.999 and it was tested with an audio sample, resulting in the plots from Figure 4. After both signals have passed through this level detector, they follow to a comparator, which will decide if there is a double talk situation through the Equation 1. L e (k) > L x (k) + G(k) (1) Where: L e (k) = Adaptive filter output signal after level detector L x (k) = Far-End signal after passing through level detector G(k) = Detection Gain If this condition is satisfied, it means a double talk situation was detected and G(k) is the value which determines accuracy of this algorithm. Considering a range of variation from to 4dB, the gain G(k) varies because of the block adjustment Fig. 6. Simulated results using voice samples with many DTD situations. (ADJ) and according to the detection results previously obtained. In the case of a double talk situation, G(k) is increased at the rate of equation 2, considering a maximum double talk duration of 3 seconds. δg(k) = 4dB/3s = dB/s (2) In the case of a non-double talk situation, G(k) is decreased at the rate of equation 3, considering that 1s is sufficient time for the filter to adapt. δg(k) = 4dB/1s = 4dB/s (3) After being implemented, this algorithm was simulated by using White Gaussian Noise (WGN) as input for both Far- End and Near-end talkers, simulating 3 seconds of double talk situation. The results can be seen in Figure 5 and the simulation using audio samples with several double talk situations are represented in Figure 6.
3 ADJ Far-end Talker G -> Gain e(k) > x(k) + G? Filter NO YES DTD OFF Echo Path DTD ON Turn DTD OFF Decrease G Echo Open during double talk NO Turn DTD ON Increase G Video_DTD? DTD ON YES Turn DTD ON G = Lower Limit Video-DTD Double Talk Detector Fig. 9. Flowchart of DTD adjustment (ADJ) block. Near-end Talker FACE DETECTION Fig. 7. e(k) x(k) MOUTH EXTRACTION FACE DETECTION LIP MOVEMENT LIP MOVEMENT FAR END Advanced echo canceller with Video-DTD. MOUTH EXTRACTION Comp Inhibition NEAR END G(k) TALKING yes DTD yes TALKING Video-DTD ADJ Fig. 8. Video DTD Fig. 1. Video DTD system flowchart. Novel double talk detection with Video-DTD integration. III. P ROPOSED A DVANCED AEC In this paper a novel type of AEC has been proposed, based on the double talk detector algorithm proposed by Minami and using video information to enhance the DTD performance. The integration of this novel DTD into the AEC structure is depicted in Figure 7. In this new proposal we improve the original Minami DTD with changes in the adjustment block, which now should modify the detection gain G(k) by using a new piece of information: video double talk detection (Video-DTD). This changes are depicted in Figure 8. Figure 9 presents the flowchart of the adjustment block. The DTD is the output of this block. The Gain and DTD condition are set based on Filter output and Far-end signals. If a VideoDTD signal is received, the DTD is turned ON and the Gain is set at the lower limit. IV. E XPERIMENTAL E VALUATION OF THE N OVEL AEC In this section we present the results of our novel AEC, using Video-DTD. A video was recorded from a previous established scenery, and the image frames and stereo audio were extracted from the camera. The original video was recorded in Full-HD (192x188) with an interlaced scan video camera. It has 3 frames per second originally, but it was resampled to 15 frames per second in order to produce a de-interlaced video (progressive). The images are then reduced about 5% (96x544 pixels) in order All Faces Initially Detected Cluster Analysis Fig. 11. Wrong Detections Elimination Mean Value of the correct Detections Face detection stages. to reduce the simulation time needed. In this video there are sequences of Silent, Opening, Closing and Roll mouth periods. Roll mouth are sequenced frames with a slight displacement of the whole face, but without producing any sound. In a multi-modal approach we wanted to use video and audio information mixed in a novel technique to evaluate a double talk situation. Figure 1 shows the proposed video DTD system flowchart. For both sides (Near and Far end) we have three stages: Face Detection, Mouth Extraction and Lip Movement. On each side we determine if the speakers are talking or not. When two speakers are talking at the same time, a double-talking condition is detected by the video system and then informed to the novel DTD algorithm. A. Face Detection To perform face detection in a robust and reliable way, we combine the recent Viola-Jones algorithm [8] with a classical cluster analysis method called K-Means [9]. Figure 11 shows the stages in the face detection flowchart. After applying the Viola-Jones results into a K-Means
4 2 cluster 1 cluster 2 cluster (a) (b) Fig. 12. Cluster being sort out using the K-Means algorithm. X and Y axes: bottom-left coordinates of square frames; Z axis: size of the square frame. (c) (d) Fig. 14. Mouth situations: (a) Silent, (b) Opening, (c) Closing and (d) Roll using Lucas-Kanade algorithm. Mouth Region of Interest (a) (b) Bottom Half of Face Detection Frame Fig. 13. Mouth extraction. cluster algorithm, the frames detected by Viola-Jones will be sorted in different clusters. This is an unsupervised analysis and it is capable of classifying as many clusters as those initially found. Figure 12 shows the K-Means output for the frames in the first picture of Figure 11. Three clusters were classified. The cluster number one (green/cross) contains 2 frames, cluster number two (red/star) contains 31 frames and cluster number three (blue/circle) contains only 1 frame. The second picture of Figure 11 depicts the frames classified according to the cluster analysis (red, green and blue). Afterwards, only the frames from the small clusters are removed, which is show in the third picture of Figure 11. Finally, a mean value calculation is made in order to detect the main face. B. Mouth Extraction The mouth area search is the simplest technique in all proposed Video-DTD. We only compute the bottom half of the face frame detected, as shown in Figure 13. C. Lip Movement In order to detect the mouth movement, we will adopt the approach developed by Tamura et al [2] using optical flow analysis. The optical flow vectors (horizontal and vertical) are added, which generates an increase of their intensity when the mouth is opening, as well as a decrease when the mouth is closing. The optical flow was computed only in the Region of Interest, around the mouth, acquired according to the descriptions (c) Fig. 15. Mouth situations: (a) Silent, (b) Opening, (c) Closing and (d) Roll using Horn & Schunck algorithm. in the section IV-B. We use three algorithms to compute the optical flow: Lucas-Kanade (LK) [1], Horn & Schunck (HS) [11] and Brox (BROX) [7]. Each one of the four situations (Silent, Opening, Closing and Roll) to be classified was tested with these four algorithms, and the results are showed in Figures 14, 15 and 16. Figure 17 shows the optical flow variance for each mouth condition using all algorithms. In this figure we see that it is not possible to use variance only as a classification factor. However, the variance of the Silent condition was close to zero (as presented by Tamura); the variance of the Roll condition is higher, even bigger than the Opening or Closing situations (for most algorithms). An extension to Tamura s proposal was performed here, in order to also classify periods of face rolling. When the face is inclined a little (clockwise or counter-clockwise), the optical flow suffers small increases or decreases as a whole. An analysis of maximum and minimum peaks above the mean was performed. Figure 18 depicts the final comparison between results for three algorithms. If a threshold was established in order to separate which algorithm correctly classifies each situation, (d)
5 (a) (b) (c) (d) Fig. 16. Mouth situations: (a) Silent, (b) Opening, (c) Closing and (d) Roll using Brox algorithm. 3 Fig. 19. DTD result using Video-DTD with WGN. 2,5 2 1,5 1 Silent Opening Closing Roll 1.8,5 BROX HS LK.6 Fig. 17. Variance analysis of the optical flow algorithms in the four situations TALKING BROX 3 HS LK 2 Threshold Fig. 2. Misaligment of White Gaussian Noise simulation. 1 SILENT Silent Opening Closing Roll V. RESULTS Fig. 18. Peak above mean using optical flow algorithms. we can see that only Horn & Schunck (HS) is wrong. Silent and Roll are non-talking conditions, indicated in the region below the threshold (SILENT). Opening and Closing are talking conditions, indicated in the region above the threshold. With this in mind one can see that Horn & Schunck algorithm is not suitable for detecting talking conditions. There is no significant difference between the Silent and Opening mouth conditions. Brox and Lucas-Kanade are suitable for detecting talking conditions. Brox is faster than LK in simulations, but LK is more suitable to speed up when implemented in hardware. Brox provides a better classification for Silent conditions than LK. As described in detail in section II-A, a White Gaussian Noise analysis is the first step in order to test the new Echo Cancellation System. Figure 19 shows the behavior of Gain and DTD output when DTD-Video signal is set. The small delay between the start of DTD situation and the application of DTD-Video signal is related with the inherent delay of the image processing. Results using real voice data are depicted in the Figures 21 and 22. The main result of the system can be seen in Figure 22. The misalignment of the novel AEC, using the Video-DTD is represented in the red (thin) line. The same audio samples and room impulse response were used in a traditional AEC using NCR-DTD algorithm and is represented by the blue (thicker) line. The novel DTD is slightly better than audio-only DTD.
6 Fig DTD result using Video-DTD with voice data. [3] Ahgren, P.; Jakobsson, A.; A study of double-talk detection performance in the presence of acoustic echo path changes, Acoustics, Speech, and Signal Processing, 25. Proceedings. (ICASSP 5). IEEE International Conference on, vol.3, no., pp. iii/141- iii/144 Vol. 3, March 25 [4] Duttweiler, D.; A Twelve-Channel Digital Echo Canceler, Communications, IEEE Transactions on, vol.26, no.5, pp , May 1978 [5] Benesty, J.; Morgan, D.R.; Cho, J.H.; A new class of doubletalk detectors based on cross-correlation, Speech and Audio Processing, IEEE Transactions on, vol.8, no.2, pp , Mar 2 [6] Minami, S.; Kawasaki, T.; A Double Talk Detection Method for an Echo Canceller, IEEE International Communications Conference (ICC 85), pp , 1985 [7] Brox, T; Bruhn, A; Papenberg, N.; Weickert, J. High Accuracy Optical Flow Estimation Based on a Theory for Warping, Proceedings 8th European Conference on Computer Vision, Springer LNCS 324, T. Pajdla and J. Matas (Eds.) (4), 25-36, Prague, Czech Republic, 24 [8] Viola, P.; Jones, M.; Robust real-time object detection International Journal of Computer Vision 57(2), , 24 [9] MacQueen, J.; Some methods for classification and analysis of multivariate observations, in Proc. 5th Berkeley Symp. Math. Stat. Probab., L. M. L. Cam and J. Neyman, Eds. Berkeley, CA: Univ. California Press, 1967 [1] Lucas, B.D.; Kanade, T.; An Interative Image Registration Technique with an Application to Stereo Vision, DARPA Image Understanding Workshop, pp , 1981 [11] Horn, B.K.P.; Schunck, B.G.; Determining Optical Flow, Artificial Intelligent, vol 9, pp , Normalized Output Samples Fig. 22. Misaligment produced by Video-DTD (red/thin) slightly better than NCR-DTD (blue/thicker). VI. CONCLUSION A novel Acoustic Echo Cancellation system was successfully developed, integrating audio and video information. For video-dtd analysis we verify that the relation peak above the mean is more indicated due to the rolling face problem. The performance of this new system is slightly better than a traditional audio-only one. Using a DTD based on NCR has indicated that the video information applied in a double talk detection could improve the performance of the Acoustic Echo Cancellation system. REFERENCES [1] Murata, N.; Kajikawa, Y.; A Double-Talk-Detector Integrating Sound and Image Information, Proceedings of IEICE SIP 18, , 29 [2] Tamura, S., Iwano, K.; Furui, S., Multi-Modal Speech Recognition Using Optical-Flow Analysis for Lip Images, Journal of VLSI Signal Processing 36, pp , Feb 24
Figure1. Acoustic feedback in packet based video conferencing system
Real-Time Howling Detection for Hands-Free Video Conferencing System Mi Suk Lee and Do Young Kim Future Internet Research Department ETRI, Daejeon, Korea {lms, dyk}@etri.re.kr Abstract: This paper presents
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
ADAPTIVE ALGORITHMS FOR ACOUSTIC ECHO CANCELLATION IN SPEECH PROCESSING
www.arpapress.com/volumes/vol7issue1/ijrras_7_1_05.pdf ADAPTIVE ALGORITHMS FOR ACOUSTIC ECHO CANCELLATION IN SPEECH PROCESSING 1,* Radhika Chinaboina, 1 D.S.Ramkiran, 2 Habibulla Khan, 1 M.Usha, 1 B.T.P.Madhav,
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
Building an Advanced Invariant Real-Time Human Tracking System
UDC 004.41 Building an Advanced Invariant Real-Time Human Tracking System Fayez Idris 1, Mazen Abu_Zaher 2, Rashad J. Rasras 3, and Ibrahiem M. M. El Emary 4 1 School of Informatics and Computing, German-Jordanian
Network Sensing Network Monitoring and Diagnosis Technologies
Network Sensing Network Monitoring and Diagnosis Technologies Masanobu Morinaga Yuji Nomura Takeshi Otani Motoyuki Kawaba (Manuscript received April 7, 2009) Problems that occur in the information and
Tracking and Recognition in Sports Videos
Tracking and Recognition in Sports Videos Mustafa Teke a, Masoud Sattari b a Graduate School of Informatics, Middle East Technical University, Ankara, Turkey [email protected] b Department of Computer
Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers
Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers Sung-won ark and Jose Trevino Texas A&M University-Kingsville, EE/CS Department, MSC 92, Kingsville, TX 78363 TEL (36) 593-2638, FAX
Tracking of Small Unmanned Aerial Vehicles
Tracking of Small Unmanned Aerial Vehicles Steven Krukowski Adrien Perkins Aeronautics and Astronautics Stanford University Stanford, CA 94305 Email: [email protected] Aeronautics and Astronautics Stanford
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA N. Zarrinpanjeh a, F. Dadrassjavan b, H. Fattahi c * a Islamic Azad University of Qazvin - [email protected]
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS Aswin C Sankaranayanan, Qinfen Zheng, Rama Chellappa University of Maryland College Park, MD - 277 {aswch, qinfen, rama}@cfar.umd.edu Volkan Cevher, James
Template-based Eye and Mouth Detection for 3D Video Conferencing
Template-based Eye and Mouth Detection for 3D Video Conferencing Jürgen Rurainsky and Peter Eisert Fraunhofer Institute for Telecommunications - Heinrich-Hertz-Institute, Image Processing Department, Einsteinufer
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING Ms.PALLAVI CHOUDEKAR Ajay Kumar Garg Engineering College, Department of electrical and electronics Ms.SAYANTI BANERJEE Ajay Kumar Garg Engineering
Low-resolution Character Recognition by Video-based Super-resolution
2009 10th International Conference on Document Analysis and Recognition Low-resolution Character Recognition by Video-based Super-resolution Ataru Ohkura 1, Daisuke Deguchi 1, Tomokazu Takahashi 2, Ichiro
Cloud tracking with optical flow for short-term solar forecasting
Cloud tracking with optical flow for short-term solar forecasting Philip Wood-Bradley, José Zapata, John Pye Solar Thermal Group, Australian National University, Canberra, Australia Corresponding author:
EE289 Lab Fall 2009. LAB 4. Ambient Noise Reduction. 1 Introduction. 2 Simulation in Matlab Simulink
EE289 Lab Fall 2009 LAB 4. Ambient Noise Reduction 1 Introduction Noise canceling devices reduce unwanted ambient noise (acoustic noise) by means of active noise control. Among these devices are noise-canceling
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
Feature Tracking and Optical Flow
02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve
Automatic Traffic Estimation Using Image Processing
Automatic Traffic Estimation Using Image Processing Pejman Niksaz Science &Research Branch, Azad University of Yazd, Iran [email protected] Abstract As we know the population of city and number of
Final Year Project Progress Report. Frequency-Domain Adaptive Filtering. Myles Friel. Supervisor: Dr.Edward Jones
Final Year Project Progress Report Frequency-Domain Adaptive Filtering Myles Friel 01510401 Supervisor: Dr.Edward Jones Abstract The Final Year Project is an important part of the final year of the Electronic
An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network
Proceedings of the 8th WSEAS Int. Conf. on ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING & DATA BASES (AIKED '9) ISSN: 179-519 435 ISBN: 978-96-474-51-2 An Energy-Based Vehicle Tracking System using Principal
Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids
Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain
Search keywords: Connect, Meeting, Collaboration, Voice over IP, VoIP, Acoustic Magic, audio, web conferencing, microphone, best practices
Title: Acoustic Magic Voice Tracker II array microphone improves operation with VoIP based Adobe Connect Meeting URL: www.acousticmagic.com By: Bob Feingold, President, Acoustic Magic Inc. Search keywords:
Navigation Aid And Label Reading With Voice Communication For Visually Impaired People
Navigation Aid And Label Reading With Voice Communication For Visually Impaired People A.Manikandan 1, R.Madhuranthi 2 1 M.Kumarasamy College of Engineering, [email protected],karur,india 2 M.Kumarasamy
Face Model Fitting on Low Resolution Images
Face Model Fitting on Low Resolution Images Xiaoming Liu Peter H. Tu Frederick W. Wheeler Visualization and Computer Vision Lab General Electric Global Research Center Niskayuna, NY, 1239, USA {liux,tu,wheeler}@research.ge.com
Adaptive Sampling Rate Correction for Acoustic Echo Control in Voice-Over-IP Matthias Pawig, Gerald Enzner, Member, IEEE, and Peter Vary, Fellow, IEEE
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 58, NO. 1, JANUARY 2010 189 Adaptive Sampling Rate Correction for Acoustic Echo Control in Voice-Over-IP Matthias Pawig, Gerald Enzner, Member, IEEE, and Peter
Tracking Moving Objects In Video Sequences Yiwei Wang, Robert E. Van Dyck, and John F. Doherty Department of Electrical Engineering The Pennsylvania State University University Park, PA16802 Abstract{Object
Mouse Control using a Web Camera based on Colour Detection
Mouse Control using a Web Camera based on Colour Detection Abhik Banerjee 1, Abhirup Ghosh 2, Koustuvmoni Bharadwaj 3, Hemanta Saikia 4 1, 2, 3, 4 Department of Electronics & Communication Engineering,
TRAFFIC MONITORING WITH AD-HOC MICROPHONE ARRAY
4 4th International Workshop on Acoustic Signal Enhancement (IWAENC) TRAFFIC MONITORING WITH AD-HOC MICROPHONE ARRAY Takuya Toyoda, Nobutaka Ono,3, Shigeki Miyabe, Takeshi Yamada, Shoji Makino University
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. [email protected]
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA
Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm
1 Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm Hani Mehrpouyan, Student Member, IEEE, Department of Electrical and Computer Engineering Queen s University, Kingston, Ontario,
Vision-Based Blind Spot Detection Using Optical Flow
Vision-Based Blind Spot Detection Using Optical Flow M.A. Sotelo 1, J. Barriga 1, D. Fernández 1, I. Parra 1, J.E. Naranjo 2, M. Marrón 1, S. Alvarez 1, and M. Gavilán 1 1 Department of Electronics, University
Comparing Improved Versions of K-Means and Subtractive Clustering in a Tracking Application
Comparing Improved Versions of K-Means and Subtractive Clustering in a Tracking Application Marta Marrón Romera, Miguel Angel Sotelo Vázquez, and Juan Carlos García García Electronics Department, University
HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT
International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012 1 HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT Akhil Gupta, Akash Rathi, Dr. Y. Radhika
Fall detection in the elderly by head tracking
Loughborough University Institutional Repository Fall detection in the elderly by head tracking This item was submitted to Loughborough University's Institutional Repository by the/an author. Citation:
A secure face tracking system
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 10 (2014), pp. 959-964 International Research Publications House http://www. irphouse.com A secure face tracking
Implementation of Canny Edge Detector of color images on CELL/B.E. Architecture.
Implementation of Canny Edge Detector of color images on CELL/B.E. Architecture. Chirag Gupta,Sumod Mohan K [email protected], [email protected] Abstract In this project we propose a method to improve
Video-Conferencing System
Video-Conferencing System Evan Broder and C. Christoher Post Introductory Digital Systems Laboratory November 2, 2007 Abstract The goal of this project is to create a video/audio conferencing system. Video
SPEAKER IDENTIFICATION FROM YOUTUBE OBTAINED DATA
SPEAKER IDENTIFICATION FROM YOUTUBE OBTAINED DATA Nitesh Kumar Chaudhary 1 and Shraddha Srivastav 2 1 Department of Electronics & Communication Engineering, LNMIIT, Jaipur, India 2 Bharti School Of Telecommunication,
HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER
HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER Gholamreza Anbarjafari icv Group, IMS Lab, Institute of Technology, University of Tartu, Tartu 50411, Estonia [email protected]
Automatic Calibration of an In-vehicle Gaze Tracking System Using Driver s Typical Gaze Behavior
Automatic Calibration of an In-vehicle Gaze Tracking System Using Driver s Typical Gaze Behavior Kenji Yamashiro, Daisuke Deguchi, Tomokazu Takahashi,2, Ichiro Ide, Hiroshi Murase, Kazunori Higuchi 3,
Vision based approach to human fall detection
Vision based approach to human fall detection Pooja Shukla, Arti Tiwari CSVTU University Chhattisgarh, [email protected] 9754102116 Abstract Day by the count of elderly people living alone at home
AUDIO POINTER V2 JOYSTICK & CAMERA IN MATLAB
AUDIO POINTER V2 JOYSTICK & CAMERA IN MATLAB F. Rund Department of Radioelectronics, Faculty of Electrical Engineering, Czech Technical University in Prague, Czech Republic Abstract A virtual sound source
An Automatic Optical Inspection System for the Diagnosis of Printed Circuits Based on Neural Networks
An Automatic Optical Inspection System for the Diagnosis of Printed Circuits Based on Neural Networks Ahmed Nabil Belbachir 1, Alessandra Fanni 2, Mario Lera 3 and Augusto Montisci 2 1 Vienna University
How To Recognize Voice Over Ip On Pc Or Mac Or Ip On A Pc Or Ip (Ip) On A Microsoft Computer Or Ip Computer On A Mac Or Mac (Ip Or Ip) On An Ip Computer Or Mac Computer On An Mp3
Recognizing Voice Over IP: A Robust Front-End for Speech Recognition on the World Wide Web. By C.Moreno, A. Antolin and F.Diaz-de-Maria. Summary By Maheshwar Jayaraman 1 1. Introduction Voice Over IP is
ROBUST VEHICLE TRACKING IN VIDEO IMAGES BEING TAKEN FROM A HELICOPTER
ROBUST VEHICLE TRACKING IN VIDEO IMAGES BEING TAKEN FROM A HELICOPTER Fatemeh Karimi Nejadasl, Ben G.H. Gorte, and Serge P. Hoogendoorn Institute of Earth Observation and Space System, Delft University
Speed Performance Improvement of Vehicle Blob Tracking System
Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA [email protected], [email protected] Abstract. A speed
MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL
MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL G. Maria Priscilla 1 and C. P. Sumathi 2 1 S.N.R. Sons College (Autonomous), Coimbatore, India 2 SDNB Vaishnav College
Multimodal Biometric Recognition Security System
Multimodal Biometric Recognition Security System Anju.M.I, G.Sheeba, G.Sivakami, Monica.J, Savithri.M Department of ECE, New Prince Shri Bhavani College of Engg. & Tech., Chennai, India ABSTRACT: Security
A Method of Caption Detection in News Video
3rd International Conference on Multimedia Technology(ICMT 3) A Method of Caption Detection in News Video He HUANG, Ping SHI Abstract. News video is one of the most important media for people to get information.
Real time vehicle detection and tracking on multiple lanes
Real time vehicle detection and tracking on multiple lanes Kristian Kovačić Edouard Ivanjko Hrvoje Gold Department of Intelligent Transportation Systems Faculty of Transport and Traffic Sciences University
Effective Use of Android Sensors Based on Visualization of Sensor Information
, pp.299-308 http://dx.doi.org/10.14257/ijmue.2015.10.9.31 Effective Use of Android Sensors Based on Visualization of Sensor Information Young Jae Lee Faculty of Smartmedia, Jeonju University, 303 Cheonjam-ro,
A TOOL FOR TEACHING LINEAR PREDICTIVE CODING
A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering
Tracking performance evaluation on PETS 2015 Challenge datasets
Tracking performance evaluation on PETS 2015 Challenge datasets Tahir Nawaz, Jonathan Boyle, Longzhen Li and James Ferryman Computational Vision Group, School of Systems Engineering University of Reading,
JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions
Edith Cowan University Research Online ECU Publications Pre. JPEG compression of monochrome D-barcode images using DCT coefficient distributions Keng Teong Tan Hong Kong Baptist University Douglas Chai
Poker Vision: Playing Cards and Chips Identification based on Image Processing
Poker Vision: Playing Cards and Chips Identification based on Image Processing Paulo Martins 1, Luís Paulo Reis 2, and Luís Teófilo 2 1 DEEC Electrical Engineering Department 2 LIACC Artificial Intelligence
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
Establishing the Uniqueness of the Human Voice for Security Applications
Proceedings of Student/Faculty Research Day, CSIS, Pace University, May 7th, 2004 Establishing the Uniqueness of the Human Voice for Security Applications Naresh P. Trilok, Sung-Hyuk Cha, and Charles C.
Emotion Detection from Speech
Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction
Signature Region of Interest using Auto cropping
ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 1 Signature Region of Interest using Auto cropping Bassam Al-Mahadeen 1, Mokhled S. AlTarawneh 2 and Islam H. AlTarawneh 2 1 Math. And Computer Department,
LOW COST HARDWARE IMPLEMENTATION FOR DIGITAL HEARING AID USING
LOW COST HARDWARE IMPLEMENTATION FOR DIGITAL HEARING AID USING RasPi Kaveri Ratanpara 1, Priyan Shah 2 1 Student, M.E Biomedical Engineering, Government Engineering college, Sector-28, Gandhinagar (Gujarat)-382028,
System Aware Cyber Security
System Aware Cyber Security Application of Dynamic System Models and State Estimation Technology to the Cyber Security of Physical Systems Barry M. Horowitz, Kate Pierce University of Virginia April, 2012
Canny Edge Detection
Canny Edge Detection 09gr820 March 23, 2009 1 Introduction The purpose of edge detection in general is to significantly reduce the amount of data in an image, while preserving the structural properties
Introduction to acoustic imaging
Introduction to acoustic imaging Contents 1 Propagation of acoustic waves 3 1.1 Wave types.......................................... 3 1.2 Mathematical formulation.................................. 4 1.3
Detecting and positioning overtaking vehicles using 1D optical flow
Detecting and positioning overtaking vehicles using 1D optical flow Daniel Hultqvist 1, Jacob Roll 1, Fredrik Svensson 1, Johan Dahlin 2, and Thomas B. Schön 3 Abstract We are concerned with the problem
Circle Object Recognition Based on Monocular Vision for Home Security Robot
Journal of Applied Science and Engineering, Vol. 16, No. 3, pp. 261 268 (2013) DOI: 10.6180/jase.2013.16.3.05 Circle Object Recognition Based on Monocular Vision for Home Security Robot Shih-An Li, Ching-Chang
An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks
2011 International Conference on Network and Electronics Engineering IPCSIT vol.11 (2011) (2011) IACSIT Press, Singapore An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks Reyhaneh
Introduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
Object Tracking System Using Approximate Median Filter, Kalman Filter and Dynamic Template Matching
I.J. Intelligent Systems and Applications, 2014, 05, 83-89 Published Online April 2014 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijisa.2014.05.09 Object Tracking System Using Approximate Median
Efficient on-line Signature Verification System
International Journal of Engineering & Technology IJET-IJENS Vol:10 No:04 42 Efficient on-line Signature Verification System Dr. S.A Daramola 1 and Prof. T.S Ibiyemi 2 1 Department of Electrical and Information
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM Amol Ambardekar, Mircea Nicolescu, and George Bebis Department of Computer Science and Engineering University
Normalisation of 3D Face Data
Normalisation of 3D Face Data Chris McCool, George Mamic, Clinton Fookes and Sridha Sridharan Image and Video Research Laboratory Queensland University of Technology, 2 George Street, Brisbane, Australia,
International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014
Efficient Attendance Management System Using Face Detection and Recognition Arun.A.V, Bhatath.S, Chethan.N, Manmohan.C.M, Hamsaveni M Department of Computer Science and Engineering, Vidya Vardhaka College
Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition
IWNEST PUBLISHER Journal of Industrial Engineering Research (ISSN: 2077-4559) Journal home page: http://www.iwnest.com/aace/ Adaptive sequence of Key Pose Detection for Human Action Recognition 1 T. Sindhu
Understanding CIC Compensation Filters
Understanding CIC Compensation Filters April 2007, ver. 1.0 Application Note 455 Introduction f The cascaded integrator-comb (CIC) filter is a class of hardware-efficient linear phase finite impulse response
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Forced Low latency Handoff in Mobile Cellular Data Networks
Forced Low latency Handoff in Mobile Cellular Data Networks N. Moayedian, Faramarz Hendessi Department of Electrical and Computer Engineering Isfahan University of Technology, Isfahan, IRAN [email protected]
ADVANCED APPLICATIONS OF ELECTRICAL ENGINEERING
Development of a Software Tool for Performance Evaluation of MIMO OFDM Alamouti using a didactical Approach as a Educational and Research support in Wireless Communications JOSE CORDOVA, REBECA ESTRADA
A Computer Vision System on a Chip: a case study from the automotive domain
A Computer Vision System on a Chip: a case study from the automotive domain Gideon P. Stein Elchanan Rushinek Gaby Hayun Amnon Shashua Mobileye Vision Technologies Ltd. Hebrew University Jerusalem, Israel
Tracking-Learning-Detection
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 6, NO., JANUARY 200 Tracking-Learning-Detection Zdenek Kalal, Krystian Mikolajczyk, and Jiri Matas, Abstract This paper investigates
Nowcasting of significant convection by application of cloud tracking algorithm to satellite and radar images
Nowcasting of significant convection by application of cloud tracking algorithm to satellite and radar images Ng Ka Ho, Hong Kong Observatory, Hong Kong Abstract Automated forecast of significant convection
Component Ordering in Independent Component Analysis Based on Data Power
Component Ordering in Independent Component Analysis Based on Data Power Anne Hendrikse Raymond Veldhuis University of Twente University of Twente Fac. EEMCS, Signals and Systems Group Fac. EEMCS, Signals
Securing Electronic Medical Records Using Biometric Authentication
Securing Electronic Medical Records Using Biometric Authentication Stephen Krawczyk and Anil K. Jain Michigan State University, East Lansing MI 48823, USA {krawcz10,jain}@cse.msu.edu Abstract. Ensuring
Voice Authentication for ATM Security
Voice Authentication for ATM Security Rahul R. Sharma Department of Computer Engineering Fr. CRIT, Vashi Navi Mumbai, India [email protected] Abstract: Voice authentication system captures the
BARE PCB INSPECTION BY MEAN OF ECT TECHNIQUE WITH SPIN-VALVE GMR SENSOR
BARE PCB INSPECTION BY MEAN OF ECT TECHNIQUE WITH SPIN-VALVE GMR SENSOR K. Chomsuwan 1, S. Yamada 1, M. Iwahara 1, H. Wakiwaka 2, T. Taniguchi 3, and S. Shoji 4 1 Kanazawa University, Kanazawa, Japan;
Visual-based ID Verification by Signature Tracking
Visual-based ID Verification by Signature Tracking Mario E. Munich and Pietro Perona California Institute of Technology www.vision.caltech.edu/mariomu Outline Biometric ID Visual Signature Acquisition
METHODS OF INTEGRATING mvoip IN ADDITION TO A VoIP ENVIRONMENT
Review of the Air Force Academy No 1 (31) 2016 METHODS OF INTEGRATING mvoip IN ADDITION TO A VoIP ENVIRONMENT Paul MOZA, Marian ALEXANDRU Transilvania University, Brașov, Romania DOI: 10.19062/1842-9238.2016.14.1.16
Detection and Recognition of Mixed Traffic for Driver Assistance System
Detection and Recognition of Mixed Traffic for Driver Assistance System Pradnya Meshram 1, Prof. S.S. Wankhede 2 1 Scholar, Department of Electronics Engineering, G.H.Raisoni College of Engineering, Digdoh
