MUSICAL INSTRUMENT FAMILY CLASSIFICATION
|
|
|
- Arnold Grant
- 9 years ago
- Views:
Transcription
1 MUSICAL INSTRUMENT FAMILY CLASSIFICATION Ricardo A. Garcia Media Lab, Massachusetts Institute of Technology 0 Ames Street Room E5-40, Cambridge, MA 039 USA PH: FAX: media. mit. edu A method to classify sounds of musical instruments (single monophonic notes) is introduced. The classification is done using in parallel two sets of perceptual features extracted from the sounds. The models used are a mixture of gaussians, whose parameters where found by training over a database of target sound families. The feature extraction procedure, model training and model usage for classification are explained. An implementation and results of the method are shown and discussed. Introduction Humans are very good at classifying types of musical instruments, even under the most adverse conditions (i.e. noise, polyphonic sound, environmental perturbations). But training a computer to recognize between two different instruments of the same family is not easy task. The problem is that the concept of a family of sounds is difficult to explicitly teach to a computer. A method that uses a set of perceptually derived features to train models of families of sounds is introduced. The sound is analyzed and mapped into two perceptual feature spaces: the Spectral Contours and the Cepstral Coefficients. Approach The proposed method uses a parallel feature-space modeled with gaussian-mixtures to approximate the families of musical instruments to be classified. The models are trained using methods of estimationmaximization. Two feature spaces are used: spectral contour and cepstral coefficients. These are calculated in a frame-by-frame basis for each musical note in the training set. Normalization and dimensionality reduction are applied to make the final training set for each musical instrument type. An independent gaussian mixture model is trained for each feature space for each instrument family. These models are used in parallel to compute the probability of each unknown note of being classified as belonging to a particular instrument type in every feature-space.. Feature-space Digital audio signals are associated with high sampling rates. The amount of data that is produced per second is surprisingly high. Luckily, the characteristics of the sounds of musical instruments vary slowly in time, allowing a meaningful (and very reduced) set of data to be extracted from the original audio signal. In addition, it is important to take into account in some degree the human perceptual element when classifying musical instruments [ 4 ]. Features computed in a frame-by-frame basis of about 0 ms per frame and 30 percent overlap have shown to be good for analysis of musical sounds [ 5 ]... Spectral contour Each frame is transformed using a DFT to the discrete frequency domain. The basic spectral contour is defined as the smoothed energy spectrum. In this project, a modified spectral contour is applied. This modification is performed using a non-linear mapping of the frequency axis into the bark scale [ ]. The
2 bark scale is a perceptually derived scale composed by non-regular divisions in the frequency domain, each division is equivalent to one critical band. This mapping is done using [ ] * f tan f 7500 z = 3tan ( ) The spectral contour is calculated for every frame, in all critical bands. In Figure is possible to see the evolution in time vs. the critical bands of different musical instruments. (note that it is plotted the log of the magnitude to have a better understanding). (a) Figure Spectral contour of (a) piano and (b) saxophone (b).. Cepstral coefficients A technique originally developed to analyze speech can be applied to extract some information about the structure of the sound [ 7 ],[ 5 ] The real cepstrum (or simply cepstrum) is computed as the inverse Discrete Fourier Transform of the logarithm of the magnitude of the energy spectrum of the frame. This is expressed as: cepstrum= IFFT( log( FFT( frame( t) ))) ( ) For each frame, a set of the first cepstral coefficients is selected. These first coefficients are related to the distribution of the broadband energy in the frequency domain. If a sound is regarded as a time-invariant linear system that is exited by a quasi-periodic source, the first coefficients of the cepstrum will be related to the structure of the system, regardless of the frequency of the excitation [ 7 ]. This is very convenient for the task at hand, because the goal is to classify sounds regardless of their pitch. Also, the underlying assumption is that each family of instruments preserves some of the characteristics (cepstral coefficients) across the pitch range. Figure shows the cepstral coefficients evolution for the same instruments in the previous figure.
3 3 (a) Figure Cepstrum coefficients for (a) piano and (b) saxophone (b). Feature data manipulation Because of the high number of dimensions that result typically in the feature extraction, it is necessary to realize a dimensionality reduction to be able to manage the data in a more efficient way. Normalization helps to isolate the effects of different energy levels in different frames and sound samples... Normalization Many of the feature values are related to the energy of the spectrum (or the log of the energy), and at the same time it depends on the energy of the original sound (or sound level). This can drive similar sounds with different sound level to be classified as from different families. The approach taken is to make the magnitude of each frame to be equal to unity. frame frame= ( 3 ) frame Note that this will make some frames that fade out in time to loose this time dependant characteristic... Dimensionality reduction It is convenient to reduce the number of components to manipulate in each frame, because this reduces the time of computation of the algorithms, as well as it improves the performance of the selection when removing redundant (or noisy) information from within the data sets. The method used is Principal Component Analysis (PCA) [ 3 ]. y = Wx.3 Model of a musical instrument family: gaussian mixture Each musical instrument family is modeled by two independent gaussian mixtures [ ],[ 3 ],[ 6 ], one for each feature space used. Therefore, the representation of a single instrument is given by the linear combination of these gaussian mixtures: p M contour cepstrum ( x type) = A Pjcontour( j) pcontour( x j) + B Pjcepstrum( j) pcepstrum( x j) j= M j= ( 4 ) ( 5 )
4 4 Where each p ( x j) contour / cepstrum is a multivariate, full covariance gaussian distribution, with dimension equal to the dimension of each feature space. The weights A and B reflect some knowledge about the confidence in the measure of each feature space. (A+B=, A,B >=0);.3. Training: Expectation maximization An iterative procedure to estimate an optimal set of parameters for the models is used, namely Expectation Maximization EM [ ],[ 3 ], The algorithm selects a random initial guess for the parameters to optimize σ and computes the expected probability of each training point with relation to the mixture. ( p ( j), ) j, u j j Then, a new set of parameters is estimated and the procedure repeated until a good solution is found, or a limit in the number of iterations is reached..3. Classification procedure The classification of a new sound sample has the following steps: - Feature extraction: Extract the frames, and the Spectral Envelope and Cepstrum coefficients. - Normalization: each frame is normalized to unity magnitude. - Dimensionality reduction: for each feature set and each type of musical instrument, a different transform matrix is used to reduce the number of dimensions. - Calculate p(x type) for all the types of instruments, all the frames of the sound, all the feature spaces. - Select the highest p(x type) for each frame. - Classify the sound as belonging to the type that appears the most in all the classified frames/feature spaces. 3 Implementation The suggested algorithm was implemented and tested. The used resources and the results are shown. 3. Program and required resources Sound Database: McGill University Master Samples (MUMS library). CD : Solo Strings: solo violin, viola and cello. 3 notes CD : Woodwind and brass: Flutes, clarinets, bassoons, trumpets, and trombones. CD 3: Piano: Grand pianos. 64 notes Each set was randomly divided in 80% training and 0% testing data. 454 notes The programs were written in Matlab 6.0 and run in a Pentium III dual proc. 750 Mhz, 56 MB ram. They took in average 800 seconds to find the parameters for a model using 0 dimensions and 7 gaussians, using all the training data set. Because of the dependence in a good selection of initial conditions for a good estimate of the parameters using EM, each model was required to be estimated an average of 3 times, to avoid some problems with numeric precision (when a matrix of covariance shrinks to be zero). 4 Discussion During the realization of this project, some interesting functional aspects of EM and gaussian mixtures were noted: When having many dimensions (more than 8), and many gaussians (more than 6), it was common that the initial conditions of the parameter estimation seemed to make the system more sensitive to them. Many of the runs where stopped because of numerical problems with the evaluation after a bad choice of initial conditions.
5 5 A couple of restrictions have to be enforced, to not allow a gaussian to shrink in any given dimension too much. This will make it flat in this dimension and therefore worthless. The number of dimensions needed to express both feature spaces was around 8 for the Spectral Contour, and about 5 for the Cepstrum Coefficients. The number of gaussians was about 6 and 3 respectively. 5 References [ ] Duda, R. O., Hart, P. E., Pattern Classification and Scene Analysis, John Wiley & sons, New York, 973 [ ] Garcia, R. A., Digital Watermarking of Audio Signals using a Psychoacoustic Auditory Model and Spread Spectrum Theory, University of Miami, Master thesis, 999 [ 3 ] Gershenfeld, N., The Nature of Mathematical Modeling, Cambridge University Press, New York, 999 [ 4 ] Martin, K. D., Sound Source Recognition: A Theory and Computational Model, MIT Ph.D Dissertation, 999 [ 5 ] Oppeneim, A. V., Schafer, R. W., Buck, J. R., Discrete-Time Signal Processing, Prentice hall, New Jersey, Second Edition, 999 [ 6 ] Papoulis, A., Probability, Random Variables, and Stochastic Processes, Mc Graw Hill, Third Ed., 99 [ 7 ] Rabiner, L. R., Schafer, R. W., Digital Processing of Speech Signals, Prentice Hall, New Jersey, 978
L9: Cepstral analysis
L9: Cepstral analysis The cepstrum Homomorphic filtering The cepstrum and voicing/pitch detection Linear prediction cepstral coefficients Mel frequency cepstral coefficients This lecture is based on [Taylor,
Artificial Neural Network for Speech Recognition
Artificial Neural Network for Speech Recognition Austin Marshall March 3, 2005 2nd Annual Student Research Showcase Overview Presenting an Artificial Neural Network to recognize and classify speech Spoken
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht [email protected] 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht [email protected] 539 Sennott
Speech Signal Processing: An Overview
Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech
BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION
BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION P. Vanroose Katholieke Universiteit Leuven, div. ESAT/PSI Kasteelpark Arenberg 10, B 3001 Heverlee, Belgium [email protected]
Advanced Signal Processing and Digital Noise Reduction
Advanced Signal Processing and Digital Noise Reduction Saeed V. Vaseghi Queen's University of Belfast UK WILEY HTEUBNER A Partnership between John Wiley & Sons and B. G. Teubner Publishers Chichester New
Ericsson T18s Voice Dialing Simulator
Ericsson T18s Voice Dialing Simulator Mauricio Aracena Kovacevic, Anna Dehlbom, Jakob Ekeberg, Guillaume Gariazzo, Eric Lästh and Vanessa Troncoso Dept. of Signals Sensors and Systems Royal Institute of
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Establishing the Uniqueness of the Human Voice for Security Applications
Proceedings of Student/Faculty Research Day, CSIS, Pace University, May 7th, 2004 Establishing the Uniqueness of the Human Voice for Security Applications Naresh P. Trilok, Sung-Hyuk Cha, and Charles C.
Separation and Classification of Harmonic Sounds for Singing Voice Detection
Separation and Classification of Harmonic Sounds for Singing Voice Detection Martín Rocamora and Alvaro Pardo Institute of Electrical Engineering - School of Engineering Universidad de la República, Uruguay
Recent advances in Digital Music Processing and Indexing
Recent advances in Digital Music Processing and Indexing Acoustics 08 warm-up TELECOM ParisTech Gaël RICHARD Telecom ParisTech (ENST) www.enst.fr/~grichard/ Content Introduction and Applications Components
Linear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
A Sound Analysis and Synthesis System for Generating an Instrumental Piri Song
, pp.347-354 http://dx.doi.org/10.14257/ijmue.2014.9.8.32 A Sound Analysis and Synthesis System for Generating an Instrumental Piri Song Myeongsu Kang and Jong-Myon Kim School of Electrical Engineering,
Quarterly Progress and Status Report. Measuring inharmonicity through pitch extraction
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Measuring inharmonicity through pitch extraction Galembo, A. and Askenfelt, A. journal: STL-QPSR volume: 35 number: 1 year: 1994
Developing an Isolated Word Recognition System in MATLAB
MATLAB Digest Developing an Isolated Word Recognition System in MATLAB By Daryl Ning Speech-recognition technology is embedded in voice-activated routing systems at customer call centres, voice dialling
Analysis/resynthesis with the short time Fourier transform
Analysis/resynthesis with the short time Fourier transform summer 2006 lecture on analysis, modeling and transformation of audio signals Axel Röbel Institute of communication science TU-Berlin IRCAM Analysis/Synthesis
Auto-Tuning Using Fourier Coefficients
Auto-Tuning Using Fourier Coefficients Math 56 Tom Whalen May 20, 2013 The Fourier transform is an integral part of signal processing of any kind. To be able to analyze an input signal as a superposition
Automatic Evaluation Software for Contact Centre Agents voice Handling Performance
International Journal of Scientific and Research Publications, Volume 5, Issue 1, January 2015 1 Automatic Evaluation Software for Contact Centre Agents voice Handling Performance K.K.A. Nipuni N. Perera,
Machine Learning and Pattern Recognition Logistic Regression
Machine Learning and Pattern Recognition Logistic Regression Course Lecturer:Amos J Storkey Institute for Adaptive and Neural Computation School of Informatics University of Edinburgh Crichton Street,
Lab 1. The Fourier Transform
Lab 1. The Fourier Transform Introduction In the Communication Labs you will be given the opportunity to apply the theory learned in Communication Systems. Since this is your first time to work in the
CS 688 Pattern Recognition Lecture 4. Linear Models for Classification
CS 688 Pattern Recognition Lecture 4 Linear Models for Classification Probabilistic generative models Probabilistic discriminative models 1 Generative Approach ( x ) p C k p( C k ) Ck p ( ) ( x Ck ) p(
SPEAKER IDENTIFICATION FROM YOUTUBE OBTAINED DATA
SPEAKER IDENTIFICATION FROM YOUTUBE OBTAINED DATA Nitesh Kumar Chaudhary 1 and Shraddha Srivastav 2 1 Department of Electronics & Communication Engineering, LNMIIT, Jaipur, India 2 Bharti School Of Telecommunication,
203.4770: Introduction to Machine Learning Dr. Rita Osadchy
203.4770: Introduction to Machine Learning Dr. Rita Osadchy 1 Outline 1. About the Course 2. What is Machine Learning? 3. Types of problems and Situations 4. ML Example 2 About the course Course Homepage:
Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA
Audio Engineering Society Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Measuring Line Edge Roughness: Fluctuations in Uncertainty
Tutor6.doc: Version 5/6/08 T h e L i t h o g r a p h y E x p e r t (August 008) Measuring Line Edge Roughness: Fluctuations in Uncertainty Line edge roughness () is the deviation of a feature edge (as
Probability and Random Variables. Generation of random variables (r.v.)
Probability and Random Variables Method for generating random variables with a specified probability distribution function. Gaussian And Markov Processes Characterization of Stationary Random Process Linearly
Hardware Implementation of Probabilistic State Machine for Word Recognition
IJECT Vo l. 4, Is s u e Sp l - 5, Ju l y - Se p t 2013 ISSN : 2230-7109 (Online) ISSN : 2230-9543 (Print) Hardware Implementation of Probabilistic State Machine for Word Recognition 1 Soorya Asokan, 2
School Class Monitoring System Based on Audio Signal Processing
C. R. Rashmi 1,,C.P.Shantala 2 andt.r.yashavanth 3 1 Department of CSE, PG Student, CIT, Gubbi, Tumkur, Karnataka, India. 2 Department of CSE, Vice Principal & HOD, CIT, Gubbi, Tumkur, Karnataka, India.
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.
This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. Title Transcription of polyphonic signals using fast filter bank( Accepted version ) Author(s) Foo, Say Wei;
Automatic Transcription: An Enabling Technology for Music Analysis
Automatic Transcription: An Enabling Technology for Music Analysis Simon Dixon [email protected] Centre for Digital Music School of Electronic Engineering and Computer Science Queen Mary University
CS 591.03 Introduction to Data Mining Instructor: Abdullah Mueen
CS 591.03 Introduction to Data Mining Instructor: Abdullah Mueen LECTURE 3: DATA TRANSFORMATION AND DIMENSIONALITY REDUCTION Chapter 3: Data Preprocessing Data Preprocessing: An Overview Data Quality Major
High Quality Integrated Data Reconstruction for Medical Applications
High Quality Integrated Data Reconstruction for Medical Applications A.K.M Fazlul Haque Md. Hanif Ali M Adnan Kiber Department of Computer Science Department of Computer Science Department of Applied Physics,
MODELING DYNAMIC PATTERNS FOR EMOTIONAL CONTENT IN MUSIC
12th International Society for Music Information Retrieval Conference (ISMIR 2011) MODELING DYNAMIC PATTERNS FOR EMOTIONAL CONTENT IN MUSIC Yonatan Vaizman Edmond & Lily Safra Center for Brain Sciences,
Music Genre Classification
Music Genre Classification Michael Haggblade Yang Hong Kenny Kao 1 Introduction Music classification is an interesting problem with many applications, from Drinkify (a program that generates cocktails
CCNY. BME I5100: Biomedical Signal Processing. Linear Discrimination. Lucas C. Parra Biomedical Engineering Department City College of New York
BME I5100: Biomedical Signal Processing Linear Discrimination Lucas C. Parra Biomedical Engineering Department CCNY 1 Schedule Week 1: Introduction Linear, stationary, normal - the stuff biology is not
Admin stuff. 4 Image Pyramids. Spatial Domain. Projects. Fourier domain 2/26/2008. Fourier as a change of basis
Admin stuff 4 Image Pyramids Change of office hours on Wed 4 th April Mon 3 st March 9.3.3pm (right after class) Change of time/date t of last class Currently Mon 5 th May What about Thursday 8 th May?
Principal components analysis
CS229 Lecture notes Andrew Ng Part XI Principal components analysis In our discussion of factor analysis, we gave a way to model data x R n as approximately lying in some k-dimension subspace, where k
A TOOL FOR TEACHING LINEAR PREDICTIVE CODING
A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering
Trinity College London
Trinity College London The Queensland Curriculum and Assessment Authority (QCAA) has recognised the following studies as contributing studies for the (QCE). The QCAA has no responsibility regarding implementation
Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids
Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain
A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques
A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques Vineela Behara,Y Ramesh Department of Computer Science and Engineering Aditya institute of Technology and
Final Year Project Progress Report. Frequency-Domain Adaptive Filtering. Myles Friel. Supervisor: Dr.Edward Jones
Final Year Project Progress Report Frequency-Domain Adaptive Filtering Myles Friel 01510401 Supervisor: Dr.Edward Jones Abstract The Final Year Project is an important part of the final year of the Electronic
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
The Algorithms of Speech Recognition, Programming and Simulating in MATLAB
FACULTY OF ENGINEERING AND SUSTAINABLE DEVELOPMENT. The Algorithms of Speech Recognition, Programming and Simulating in MATLAB Tingxiao Yang January 2012 Bachelor s Thesis in Electronics Bachelor s Program
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
ANALYZER BASICS WHAT IS AN FFT SPECTRUM ANALYZER? 2-1
WHAT IS AN FFT SPECTRUM ANALYZER? ANALYZER BASICS The SR760 FFT Spectrum Analyzer takes a time varying input signal, like you would see on an oscilloscope trace, and computes its frequency spectrum. Fourier's
Basic Music Theory for Junior Cert.
1 Reading Different Clefs Basic Music Theory for Junior Cert. The most commonly used clefs are the treble and bass. The ability to read both of these clefs proficiently is essential for Junior Cert. Music.
SOFTWARE FOR GENERATION OF SPECTRUM COMPATIBLE TIME HISTORY
3 th World Conference on Earthquake Engineering Vancouver, B.C., Canada August -6, 24 Paper No. 296 SOFTWARE FOR GENERATION OF SPECTRUM COMPATIBLE TIME HISTORY ASHOK KUMAR SUMMARY One of the important
MACHINE LEARNING IN HIGH ENERGY PHYSICS
MACHINE LEARNING IN HIGH ENERGY PHYSICS LECTURE #1 Alex Rogozhnikov, 2015 INTRO NOTES 4 days two lectures, two practice seminars every day this is introductory track to machine learning kaggle competition!
BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics http://www.ccmb.med.umich.edu/node/1376
Course Director: Dr. Kayvan Najarian (DCM&B, [email protected]) Lectures: Labs: Mondays and Wednesdays 9:00 AM -10:30 AM Rm. 2065 Palmer Commons Bldg. Wednesdays 10:30 AM 11:30 AM (alternate weeks) Rm.
Short-time FFT, Multi-taper analysis & Filtering in SPM12
Short-time FFT, Multi-taper analysis & Filtering in SPM12 Computational Psychiatry Seminar, FS 2015 Daniel Renz, Translational Neuromodeling Unit, ETHZ & UZH 20.03.2015 Overview Refresher Short-time Fourier
Component Ordering in Independent Component Analysis Based on Data Power
Component Ordering in Independent Component Analysis Based on Data Power Anne Hendrikse Raymond Veldhuis University of Twente University of Twente Fac. EEMCS, Signals and Systems Group Fac. EEMCS, Signals
B3. Short Time Fourier Transform (STFT)
B3. Short Time Fourier Transform (STFT) Objectives: Understand the concept of a time varying frequency spectrum and the spectrogram Understand the effect of different windows on the spectrogram; Understand
Dynamic Process Modeling. Process Dynamics and Control
Dynamic Process Modeling Process Dynamics and Control 1 Description of process dynamics Classes of models What do we need for control? Modeling for control Mechanical Systems Modeling Electrical circuits
A Digital Audio Watermark Embedding Algorithm
Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin School of Communication Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 3008, China [email protected],
The Probit Link Function in Generalized Linear Models for Data Mining Applications
Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications
MIMO CHANNEL CAPACITY
MIMO CHANNEL CAPACITY Ochi Laboratory Nguyen Dang Khoa (D1) 1 Contents Introduction Review of information theory Fixed MIMO channel Fading MIMO channel Summary and Conclusions 2 1. Introduction The use
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Statistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
Similar benefits are also derived through modal testing of other space structures.
PAGE 1 OF 5 PREFERRED RELIABILITY PRACTICES MODAL TESTING: MEASURING DYNAMIC STRUCTURAL CHARACTERISTICS Practice: Modal testing is a structural testing practice that provides low levels of mechanical excitation
The Image Deblurring Problem
page 1 Chapter 1 The Image Deblurring Problem You cannot depend on your eyes when your imagination is out of focus. Mark Twain When we use a camera, we want the recorded image to be a faithful representation
PYKC Jan-7-10. Lecture 1 Slide 1
Aims and Objectives E 2.5 Signals & Linear Systems Peter Cheung Department of Electrical & Electronic Engineering Imperial College London! By the end of the course, you would have understood: Basic signal
Emotion Detection from Speech
Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction
Frequently Asked Questions: Applied Music Lessons
Frequently Asked Questions: Applied Music Lessons Do I need to have experience playing an instrument to take lessons? No! We welcome aspiring musicians of all levels of experience. What instruments can
Non-negative Matrix Factorization (NMF) in Semi-supervised Learning Reducing Dimension and Maintaining Meaning
Non-negative Matrix Factorization (NMF) in Semi-supervised Learning Reducing Dimension and Maintaining Meaning SAMSI 10 May 2013 Outline Introduction to NMF Applications Motivations NMF as a middle step
Matlab GUI for WFB spectral analysis
Matlab GUI for WFB spectral analysis Jan Nováček Department of Radio Engineering K13137, CTU FEE Prague Abstract In the case of the sound signals analysis we usually use logarithmic scale on the frequency
SOLVING LINEAR SYSTEMS
SOLVING LINEAR SYSTEMS Linear systems Ax = b occur widely in applied mathematics They occur as direct formulations of real world problems; but more often, they occur as a part of the numerical analysis
Linear Classification. Volker Tresp Summer 2015
Linear Classification Volker Tresp Summer 2015 1 Classification Classification is the central task of pattern recognition Sensors supply information about an object: to which class do the object belong
Lecture 14. Point Spread Function (PSF)
Lecture 14 Point Spread Function (PSF), Modulation Transfer Function (MTF), Signal-to-noise Ratio (SNR), Contrast-to-noise Ratio (CNR), and Receiver Operating Curves (ROC) Point Spread Function (PSF) Recollect
Capacity Limits of MIMO Channels
Tutorial and 4G Systems Capacity Limits of MIMO Channels Markku Juntti Contents 1. Introduction. Review of information theory 3. Fixed MIMO channels 4. Fading MIMO channels 5. Summary and Conclusions References
Machine Learning for Data Science (CS4786) Lecture 1
Machine Learning for Data Science (CS4786) Lecture 1 Tu-Th 10:10 to 11:25 AM Hollister B14 Instructors : Lillian Lee and Karthik Sridharan ROUGH DETAILS ABOUT THE COURSE Diagnostic assignment 0 is out:
Christfried Webers. Canberra February June 2015
c Statistical Group and College of Engineering and Computer Science Canberra February June (Many figures from C. M. Bishop, "Pattern Recognition and ") 1of 829 c Part VIII Linear Classification 2 Logistic
TRADUZIONE IN INGLESE degli
TRADUZIONE IN INGLESE degli ORDINAMENTI DIDATTICI DEI CORSI DI STUDIO DEI CONSERVATORI DI MUSICA DEPARTMENT OF STRINGS SCHOOL OF HARP HARP Students completing required courses for the first level Academic
Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP
Department of Electrical and Computer Engineering Ben-Gurion University of the Negev LAB 1 - Introduction to USRP - 1-1 Introduction In this lab you will use software reconfigurable RF hardware from National
Lecture 9: Introduction to Pattern Analysis
Lecture 9: Introduction to Pattern Analysis g Features, patterns and classifiers g Components of a PR system g An example g Probability definitions g Bayes Theorem g Gaussian densities Features, patterns
Automated Stellar Classification for Large Surveys with EKF and RBF Neural Networks
Chin. J. Astron. Astrophys. Vol. 5 (2005), No. 2, 203 210 (http:/www.chjaa.org) Chinese Journal of Astronomy and Astrophysics Automated Stellar Classification for Large Surveys with EKF and RBF Neural
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING
AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations
ADVANCED APPLICATIONS OF ELECTRICAL ENGINEERING
Development of a Software Tool for Performance Evaluation of MIMO OFDM Alamouti using a didactical Approach as a Educational and Research support in Wireless Communications JOSE CORDOVA, REBECA ESTRADA
Logistic Regression. Vibhav Gogate The University of Texas at Dallas. Some Slides from Carlos Guestrin, Luke Zettlemoyer and Dan Weld.
Logistic Regression Vibhav Gogate The University of Texas at Dallas Some Slides from Carlos Guestrin, Luke Zettlemoyer and Dan Weld. Generative vs. Discriminative Classifiers Want to Learn: h:x Y X features
Comp 14112 Fundamentals of Artificial Intelligence Lecture notes, 2015-16 Speech recognition
Comp 14112 Fundamentals of Artificial Intelligence Lecture notes, 2015-16 Speech recognition Tim Morris School of Computer Science, University of Manchester 1 Introduction to speech recognition 1.1 The
Hobbayne Primary School Music Policy Statement Updated October 2011
Hobbayne Primary School Music Policy Statement Updated October 2011 Statement of Intent This policy outlines the purpose, nature and management of Music taught in the school. The main aim of Music Teaching
Spectrum Level and Band Level
Spectrum Level and Band Level ntensity, ntensity Level, and ntensity Spectrum Level As a review, earlier we talked about the intensity of a sound wave. We related the intensity of a sound wave to the acoustic
RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA
RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA ABSTRACT Random vibration is becoming increasingly recognized as the most realistic method of simulating the dynamic environment of military
An introduction to OBJECTIVE ASSESSMENT OF IMAGE QUALITY. Harrison H. Barrett University of Arizona Tucson, AZ
An introduction to OBJECTIVE ASSESSMENT OF IMAGE QUALITY Harrison H. Barrett University of Arizona Tucson, AZ Outline! Approaches to image quality! Why not fidelity?! Basic premises of the task-based approach!
Adding Sinusoids of the Same Frequency. Additive Synthesis. Spectrum. Music 270a: Modulation
Adding Sinusoids of the Same Frequency Music 7a: Modulation Tamara Smyth, [email protected] Department of Music, University of California, San Diego (UCSD) February 9, 5 Recall, that adding sinusoids of
STUDY OF DAM-RESERVOIR DYNAMIC INTERACTION USING VIBRATION TESTS ON A PHYSICAL MODEL
STUDY OF DAM-RESERVOIR DYNAMIC INTERACTION USING VIBRATION TESTS ON A PHYSICAL MODEL Paulo Mendes, Instituto Superior de Engenharia de Lisboa, Portugal Sérgio Oliveira, Laboratório Nacional de Engenharia
Introduction to Machine Learning Using Python. Vikram Kamath
Introduction to Machine Learning Using Python Vikram Kamath Contents: 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Introduction/Definition Where and Why ML is used Types of Learning Supervised Learning Linear Regression
RADIO FREQUENCY INTERFERENCE AND CAPACITY REDUCTION IN DSL
RADIO FREQUENCY INTERFERENCE AND CAPACITY REDUCTION IN DSL Padmabala Venugopal, Michael J. Carter*, Scott A. Valcourt, InterOperability Laboratory, Technology Drive Suite, University of New Hampshire,
Supervised Feature Selection & Unsupervised Dimensionality Reduction
Supervised Feature Selection & Unsupervised Dimensionality Reduction Feature Subset Selection Supervised: class labels are given Select a subset of the problem features Why? Redundant features much or
