Using the cellular mobile telephone network to remotely monitor Parkinson s disease symptom severity

Size: px
Start display at page:

Download "Using the cellular mobile telephone network to remotely monitor Parkinson s disease symptom severity"

Transcription

1 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 1 Using the cellular mobile telephone network to remotely monitor Parkinson s disease symptom severity Athanasios Tsanas*, Max A. Little, Patrick E. McSharry, Senior Member, IEEE, Lorraine O. Ramig Abstract Telemonitoring of Parkinson s Disease (PD) has attracted interest because of the potential to make a lasting, positive impact on the life of patients and their carers. Purposeful-built devices have been developed that record various signals which can be associated with average PD symptom severity, as quantified on standard clinical metrics such as the Unified Parkinson s Disease Rating Scale (UPDRS). Speech signals are particularly promising in this regard, because they are easy to self-collect without the use of expensive, dedicated hardware. Previous studies have demonstrated replication of UPDRS to within less than 2 points of a clinical raters assessment of symptom severity, using high-quality speech signals collected using dedicated telemonitoring hardware. Here, we investigate the potential of using the standard voice-over- GSM (2G) or UMTS (3G) cellular mobile telephone networks for PD telemonitoring, networks that, together, have greater than 5 billion subscribers worldwide. We test the robustness of this approach using a simulated noisy mobile communication network over which speech signals are transmitted, and approximately 6000 recordings from 42 PD subjects. We show that UPDRS can be estimated to within about 3.5 points difference from the clinicians assessment, which is clinically useful given that the inter-rater variability for UPDRS can be as high as 4-5 UPDRS points. This provides compelling evidence that the existing voice telephone network is more than adequate for inexpensive, mass-scale PD symptom telemonitoring applications. Manuscript received February 15. A. Tsanas gratefully acknowledges the financial support of the Engineering and Physical Sciences Research Council (EPSRC), UK, and the financial support of Intel Corporation, US. M.A. Little acknowledges the financial support of the Wellcome Trust, grant number WT090651MF. P.E. McSharry acknowledges the financial support of the European Commission through the SafeWind project (ENK7-CT ). Asterisk indicates corresponding author. *A. Tsanas is with the Oxford Centre for Industrial and Applied Mathematics (OCIAM), Mathematical Institute, University of Oxford, Oxford, UK and with the Systems Analysis Modelling and Prediction (SAMP) group, Department of Engineering Science, University of Oxford, Oxford, UK. (phone: ; fax: ; tsanas@maths.ox.ac.uk, tsanasthanasis@gmail.com). M. A. Little is with the Media Lab, Massachusetts Institute of Technology, Cambridge, MA, USA, and with the Department of Physics, University of Oxford, Oxford, UK (maxl@mit.edu). P. E. McSharry is with the Smith School of Enterprise and the Environment, University of Oxford, Oxford, UK, and with the Oxford Centre for Industrial and Applied Mathematics, University of Oxford, Oxford, UK (patrick@mcsharry.net). L. O. Ramig is with the Speech, Language, and Hearing Science, University of Colorado, Boulder, Colorado, USA, and with the National Center for Voice and Speech, Denver, Colorado, USA (Lorraine.Ramig@colorado.edu). Index Terms Decision support tool, Parkinson s disease, mobile phones, nonlinear speech signal processing, telemedicine P I. INTRODUCTION ARKINSON S disease (PD) is a chronic neurodegenerative disorder characterized by the progressive deterioration of motor function as well as the emergence of considerable non-motor problems [1]. The PD incidence rate is approximately 20/100,000 [2] and the prevalence rate exceeds 100/100,000 [3]; moreover it is believed that an additional 20% of people with Parkinson s (PWP) might be undiagnosed [4]. Early PD stages are mainly characterized by three hallmark symptoms: bradykinesia (slow and reduced amplitude of movement), rigidity (resistance to passive movement), and tremor (while at rest) [5]. Speech disorders, which are of particular interest in this study, may be amongst the earliest PD onset indicators [6], and are reported in the vast majority of PWP [7]. Furthermore, strong empirical evidence has emerged linking speech performance degradation and PD symptom severity [6], [8], [9]. Disease onset was believed to be due to dopaminergic neuron reduction in the basal ganglia, but it is now recognized that PD is also characterised by the degeneration of numerous non-dopaminergic pathways [1]. Medication and surgical intervention can alleviate some of the symptoms and improve quality of life for most PWP [10], although there is no known cure currently. To optimize treatment, PWP are typically followed up by expert clinical staff at regular (three to six month) intervals. More frequent monitoring of PD progression would be of considerable benefit, for example, to optimize treatment regimes, but this is not possible given the available resources and the established assessment setting which requires the physical presence of PWP in the clinic. In current clinical practice, medical raters physically examine PWP and map symptom severity on appropriate clinical scales (metrics). The Unified Parkinson s Disease Rating Scale (UPDRS) [11] is the most widely used clinical metric for quantifying PD impairment [12], and attempts to quantify the full breadth of possible motor (muscle), nonmotor PD symptoms, and complications of dopamine replacement therapies. It consists of 42 items, some with subdivisions (that is, some items have two or more entries). We

2 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 2 shall refer to each individual UPDRS entry as a section in this study. The UPDRS is organized around four major components, where each component is composed of a number of sections. The four components are: (1) Mentation, behavior and mood (4 sections, sections 1-4); (2) Activities of daily living (13 sections, sections 5-17), assessing whether PWP can complete daily tasks unassisted; (3) Motor (27 sections, sections 18-44), addressing muscular control; and (4) Complication of therapy (11 sections, sections 45-55), which expresses dyskinesia and disability problems associated with PD treatment. The third component is commonly referred to as motor-updrs, and is highly correlated with the total UPDRS value [13]. For untreated PWP the fourth component is not used. The motor-updrs ranges between 0 and 108, where 0 denotes healthy state, and 108 severe disabilities. The total UPDRS value is computed by summing up the constituent sections in the components one to three (sections 1-44), and ranges between 0 and 176. In addition to UPDRS, the Hoehn and Yahr (H&Y) scale is a popular clinical metric which provides overall PD stage assessment; recent studies have shown that it is possible to map UPDRS onto H&Y [14]. Other metrics are occasionally used in clinical practice, but here we focus exclusively on UPDRS. Telemonitoring in healthcare is an emerging field which has the potential to revolutionize contemporary clinical practice. It relies on the use of modern equipment, some of which was not widely available just a decade ago (such as mobile phones and the Internet), in order to facilitate the flow of information between patient and medical expert. This flow enables fast, frequent, remote tracking of disease status, minimizing the need for frequent and often inconvenient physical visits to the clinic, and allowing rapid response to the changing circumstances of patients. In addition, telemonitoring alleviates the workload of medical personnel and the associated financial strains on national health systems. Telemonitoring of PD has attracted considerable interest lately [13], [15], [16], [17], [18]. However, all those studies rely on the use of expensive, specialized hardware purpose-built to record signals which are characteristic of PD symptoms. Recently, we demonstrated the considerable potential of speech in this application [13], [18], [19], [20], using high quality speech signals collected with Intel Corporation s At- Home Testing Device (AHTD) [15]. This device collects high quality speech signals sampled at 24 khz, following the established recommendation that a sampling frequency of at least 20 khz should be used to extract clinically useful information [21]. In this study, we investigate whether it is possible to accurately infer UPDRS using speech signals transmitted over the standard cellular mobile voice telephony network, simulating all the major steps involved in this process. The rationale for using the existing voice mobile phone network over specialized, purpose-built hardware is that (a) the existing voice network reaches nearly 75% of the global population, (b) economies of scale and global market competition has brought the price of access down so that it is affordable to a majority of the global population, (c) mobile telephony allows freedom of movement for PWP, eliminating the need to carry additional equipment when leaving home. Data-mining of speech signals obtained using the public telephone network for clinically useful information has recently shown promising results [22], [23]. Similarly, Saenz- Lechon et al. [24] investigated the effect of different data transmission rates in automatic voice pathology detection, and concluded that compressing signals (down to at most 64 kbps) does not prevent accurate detection of pathologies. We demonstrate that mobile phone technology could indeed be useful in telemonitoring average PD symptom severity further endorsing our previous findings that speech may offer a convenient framework [13], [18], [20]. II. DATA We use the voice data collected by Goetz et al. [15], described in detail in Tsanas et al. [13]. In summary, 52 subjects with idiopathic PD diagnosis up to five years from the time of the baseline clinical visit were recruited into a clinical trial to investigate the potential of the AHTD. All subjects gave written informed consent, and did not receive PD-related treatment for the six-month duration of the trial. They were asked to complete a range of tests weekly during a convenient, pre-specified time window (all tests can be completed in about minutes). Sustained vowel ahh phonations, where the subject is asked to sustain vowel phonation at a comfortable pitch for as long and as steadily as possible, were part of the test protocol. Here we focus exclusively on these sustained phonations. Subjects were diagnosed with PD if they had at least two of the three hallmark PD symptoms (bradykinesia, rigidity, tremor), without evidence of other forms of Parkinsonism. We did not apply any exclusion criteria related to specific PD symptoms (e.g. depression). We disregarded data from 10 recruits two that dropped out of the study early, and from eight additional PWP recruits that did not complete at least 20 valid study sessions during the trial period. Therefore, in this study we analyze data from 42 PWP. Previously, we demonstrated that partitioning the data by gender is important in this application [13], [20], and hence males and females are studied separately here as well. The 28 male subjects were 64.8±8.1 (mean ± standard deviation) years old, with a PD diagnosis 63.0±61.9 weeks since diagnosis at trial baseline. Their motor-updrs scores were: baseline 20.3±8.5, three months into the trial 21.9±8.7, six months into the trial 22.0±9.2, and total-updrs scores were: baseline 27.5±11.6, three months into the trial 30.4±11.8, and six months into the trial 31.0±12.4. The 14 female subjects were 63.6±11.6 years old, with a PD diagnosis 89.7±81.2 weeks since diagnosis at trial baseline. Their motor-updrs was: baseline 17.6±7.4, three months into the trial 21.2±10.5, six months into the trial 20.1±9.4, and their total-updrs was: baseline 24.2±9.1, three months into the trial 27.4±12.1, and six months into the trial 26.8±10.8. Six sustained vowel ahh phonations were recorded each time the PD subject took the test: four at comfortable level of pitch and loudness, and two at twice the comfortable loudness (elicited with the instruction twice as loud as the first time ). The signals were sampled at 24 khz at 16 bit

3 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 3 resolution. After initial processing to remove faulty phonations (e.g. patient coughing, failure to record phonation), we processed 4010 phonations for the male subjects, and 1865 phonations for the female subjects. Although the phonations were recorded weekly, the actual clinical assessments for motor-updrs and total-updrs were obtained at trial baseline, three months into the trial, and at six months into the trial. To obtain weekly UPDRS estimates to associate with the phonations we suggested using piecewise linear interpolation going exactly through the measured baseline, three-month and six-month UPDRS assessments [18], [13], [19], [20], [25]. This assertion builds on strong empirical evidence suggesting that average symptom progression in early PD stages (up to about five years) is almost linear in non-medicated patients as observed in clinical metrics [26], [27]. The PWP in the AHTD trial were in early PD stages (up to five years from disease diagnosis) and remained non-medicated for the duration of the trial, aspects which justify the use of piecewise linear interpolation when filling in missing data. The tacit assumption is that PD symptom severity did not fluctuate wildly within the intervals where the clinical scores were obtained. Finally, the interpolated UPDRS values are rounded to the nearest integer. For further details about the dataset and the AHTD data acquisition hardware, please refer to Tsanas et al. [13]. III. METHODS A. Simulation of the cellular mobile telephony network Creating a realistic simulation of the cellular voice telephony network requires the following steps: (a) encoding the AHTD speech signals into bit-streams for transmission, (b) simulate the transmitter, radio channel, and receiver, and (c) decode the transmitted bit-streams back into intelligible speech recordings. It is important to note that this application requires only one way (simplex) communication; PWP call into an automated voice messaging service and leave sustained vowel phonations. Predictions of symptom severity are extracted from these voice messages and clinical personnel suggest the appropriate course of action offline as a result of the estimated UPDRS. Moreover, the sustained vowel phonations need only be a few seconds long, that is, considerably shorter in duration than most telephone conversations. The schematic diagram of the communication system used in this study is presented in Fig. 1. Speech coding The original AHTD signals were sampled at 24 khz, 16 bits. Each speech signal needs to be coded appropriately for transmission over the communication network. 2G (GSM) and 3G (UMTS) cellular networks make use of the AMR- Narrowband codec. This is an Enhanced Full Rate (EFR) algebraic code-excited linear prediction (ACELP) algorithm, with 16-bit algebraic codebook and up to 4 non-zero (pitch) pulses per 160-sample frame. This compresses speech at 8 khz, 16 bits (64 kbps) into a 12.2 kbps bit-stream. It uses perceptually-weighting to mask the linear prediction residual encoding noise under the formant envelop. We refer to Chu [28] for further details about this speech codec. This AMR-Narrowband codec provides a high compression ratio (around 10:1) and very good intelligibility for spoken speech in non-noisy environments. Digital communication network: transmitter, channel, and receiver The main components of a digital communication system are the transmitter, the channel (physical medium connecting the transmitter and the receiver), and the receiver. The transmitter aims to assist the receiver to correctly recover the speech signal which may be distorted by the channel. Here, we very briefly explain this process and refer to Proakis for a comprehensive treatment [29]. We follow closely the studies of Tsanas [30] and Ampeliotis and Berberidis [31] for the practical implementation. After speech coding, the bit-stream is fed into the encoder. The transmission takes place in bursts, i.e. segments of the binary sequence. The GSM standard uses bits, which includes a 26 bit training sequence to estimate the channel. Here, to illustrate the proposed methodology we used 768 bits in each burst since the channel is assumed to be known to the receiver (very common assumption in the literature, e.g. [31]). The encoder inserts a controlled number of redundant bits to protect the binary data sequence from noise. We used a recursive systematic convolutional encoder with generator matrix ( ) [ (( ) ( ))] (this error correction method is used to relate bits adding additional redundant bits and hence maximize the probability of correct bit detection in the receiver), with rate R=1/2 (that is, for every bit of the actual signal, an additional redundant bit is introduced). The resulting bit sequence is fed into the interleaver which re-arranges (permutes) the coded bits, to decorrelate errors between adjacent symbols in the channel. Finally, the mapper takes blocks of the interleaved coded bits and transforms them into symbols suitable for transmission. Here, we used Gray-coding Quaternary Phase Shift Keying (QPSK) which uses two bits per symbol. The signal to noise ratio for the signal transmission was set to 10 db. The symbols per burst are transmitted over a channel which introduces additive white Gaussian noise and inter-symbol interference (ISI) which distort the transmitted signal. We assume that the channel is frequency selective and constant during each burst transmission, and also that it is known to the receiver (in practice the channel would need to be estimated using a training sequence embedded in the transmitted signal, which would be known a priori to the receiver). The channel is approximated by an equivalent discrete time linear filter. We used the standard Proakis C channel [29] which severely distorts the transmitted signal. The main components of a traditional receiver are the equalizer, de-mapper, de-interleaver, and decoder. The aim of the receiver is to accurately recover the original bit-stream. The equalizer exploits the structure of the transmitted symbol constellation to provide a symbol estimate, redressing the

4 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 4 detrimental effects of the channel distortion. In this study, we use an iterative turbo equalizer, which involves a feedback loop from decoder to equalizer to provide prior information about the received bits [32]. We used Tuchler s Minimum Mean Square Error (MMSE) equalizer as part of the turbo equalization scheme [33]. The de-mapper decomposes the received symbols into bits, the de-interleaver restores the initial order of bits (reversing the bit permutation that took place in the interleaver), and the decoder makes use of the added redundancy of the convolutional encoder in order to make an accurate estimation of the transmitted bits. We used a standard Maximum A Posteriori (MAP) decoder to decipher the transmitted sequence. Turbo-equalizers are characterized by iterating soft information (i.e. bit probabilities) between the equalizer and the decoder; we used five iterations before the final decision on the recovered bits. Finally, the output bits from the decoder are fed into the speech decoder, which reconstructs the digital speech signal, undoing the effect of the speech coding. B. Methodology to analyze the speech signals recovered at the receiver We follow three steps to process the recovered phonations and extract clinically useful information: (a) feature extraction, where we apply speech signal processing algorithms to characterize the phonations and extract characteristic patterns (features), (b) feature selection, where a parsimonious (small, information-rich) subset of the originally computed features is selected in order to provide maximally useful information for predicting UPDRS, and (c) feature mapping, where a standard supervised learning algorithm is used to determine a functional form associating the selected feature subset with the clinical outcome (UPDRS). The rationale behind this methodology is that characteristic acoustic patterns in PWP s voice are indicative of UPDRS. Although confounding factors may affect vocal performance (such as the subject s emotional state, or some pathological condition not related to PD), it is unlikely these contaminate more than a handful of the approximately 6000 recordings used here. Feature extraction We apply the dysphonia measures rigorously defined in Tsanas et al. [13] to the speech signals recovered at the receiver. We refer to that paper for detailed description of the concepts and rationale behind each algorithm. Here, we briefly describe the most important families of dysphonia measures used in this and other studies. Some of the most widely used dysphonia measures are jitter and shimmer [21], [34]. They seek to capture the physiological observation that the vocal fold vibration pattern is nearly periodic in healthy voices, whilst it is disturbed in pathological voices [34]. Jitter characterizes deviations in fundamental frequency (F0), whereas shimmer characterizes deviations in amplitude. There is no unique definition of those dysphonia measures, and we investigated many jitter and shimmer variants [13] which are algorithmic variations of the same Fig. 1. Schematic diagram of the digital communication process. The original signal, following speech coding, is inserted into the transmitter as a k. ISI stands for Intersymbol Interference. underlying concept. Quantifying vocal fold departure from near periodicity has inspired the development of the Recurrence Period Density Entropy (RPDE) [35], the Pitch Period Entropy (PPE) [36], the Glottis Quotient (GQ) [13], and F0-related measures [13]. GQ can be seen as an improved jitter-like family of measures, but working directly with vocal fold cycles instead of pre-specified segments (e.g. 10 ms) of the speech signal. RPDE expresses the uncertainty in vocal fold cycle duration. PPE quantifies the impaired control of F0 in sustained phonations, taking into account normal vibrato. The F0-related measures include statistical summaries of F0 distributions, and F0 differences compared to average age- and gender-matched healthy controls in the population. The second group of dysphonia measures characterize signal to noise ratio (SNR)-like quantities. The physiological motivation for this group is that incomplete vocal fold closure leads to the creation of aerodynamic vortices which result in increased acoustic noise. Harmonic to Noise Ratio (HNR) [34], Detrended Fluctuation Analysis (DFA) [35], Glottal to Noise Excitation (GNE) [37], Vocal Fold Excitation Ratio (VFER) [13], and Empirical Mode Decomposition Excitation Ratio (EMD-ER) [13] are typical examples. GNE and VFER analyze the frequency ranges of the signal in bands of 500 Hz. Empirically, we found that frequencies below 2.5 khz can be treated as signal, and everything above 2.5 khz can be treated as noise [13] to define SNR measures using energy, nonlinear energy (Teager-Kaiser energy operator) and entropy concepts. EMD-ER is similarly motivated: the Hilbert-Huang transform [38] decomposes the original signal into its constituent components in decreasing order of contributing

5 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 5 frequency. Then, the top (high frequency) components are taken to constitute noise, and the lower frequency components to constitute signal, to obtain SNR-like measures. Lastly, Mel Frequency Cepstral Coefficients (MFCC) have been traditionally used in speaker recognition applications, but also appear promising in biomedical speech signal processing contexts [13], [39], [40]. MFCCs detect subtle changes in the motion of the articulators (tongue, lips) which are known to be affected in PD [7]. Overall, we applied 132 dysphonia measures to the speech database, each dysphonia measure producing a single real value per voice sample, resulting in a design matrix of size for male PWP and a matrix of size for female PWP. There were no missing entries in either design matrix. Feature selection The use of a large number of features (132 in this study) makes it extremely difficult to discern meaningful patterns in the data, and may often be detrimental in the process of mapping the features onto the clinical outcome UPDRS. This problem is known as the curse of dimensionality, and arises because adequate population of the feature space requires that the number of voice samples increases exponentially with the number of features [41]. Contemporary algorithms that can map features onto outcomes may be very robust to the inclusion of potentially noisy or irrelevant features, and their predictive power may or may not be severely affected; however, a smaller feature set always facilitates insight into the problem by allowing interpretation of the most predictive features [42], [43]. An exhaustive search through all possible feature subset combinations is computationally impractical; feature selection (FS) algorithms are a principled approach to selecting a smaller (lower dimensional) feature subset. We refer to Guyon et al. [43] for a detailed overview of feature selection. Here, we compared four FS algorithms: (1) Least Absolute Shrinkage and Selection Operator (LASSO) [44], (2) Minimum Redundancy Maximum Relevance (mrmr) [45], (3) RELIEF [46], and (4) feature importance in Random Forests (RF) [47]. LASSO introduces the L1-norm penalty into classical linear regression, penalizing the absolute value of the coefficients; sweeping through the regularization parameter leads some regression coefficients to shrink to zero, which effectively indicates that the associated features are irrelevant and can be removed. The LASSO has oracle properties (correctly identifying all true predictive features) in sparse settings with low correlations amongst features, but if those conditions are violated, then non-predictive features may be selected [48]. The mrmr approach relies on a heuristic criterion to compromise between relevance (association strength of features with the outcome) and redundancy (association strength between pairs of features). It is a greedy algorithm (incrementally adding one feature at a time), and takes into account only pairwise redundancies neglecting complementarity (joint association of features towards predicting the response). RELIEF is a margin maximization algorithm [49], selecting features that contribute to the separation of samples from different classes. Inherently, RELIEF uses complementarity as part of the FS process, but does not account for feature redundancy. SIMBA is an extension of RELIEF, with a similar conceptual basis. Its advantage over RELIEF is that it reweights the samples as part of an updating algorithm which may lead to discarding redundant features. We defer discussion of the feature importance in RF to the next subsection. All these feature selection algorithms have demonstrated encouraging results in statistical machine learning contexts over a wide range of applications. We apply the methodology for FS selection that was previously described in Tsanas et al. [40], which is provided here again for completeness. The feature subsets were selected using a cross-validation (CV) scheme, where at each CV iteration only the training data was used. We used 10 CV iterations, where at each iteration the features ( ) for each FS algorithm are selected in descending order. Theoretically, this feature order should be identical for all 10 CV iterations, but often, in practice it is not. Therefore, we apply the following strategy to select the features that appeared most often under each of the FS algorithms. The ultimate aim is to determine one subset for each FS algorithm. For each FS algorithm we build an empty set which will contain the indices of the features selected. Feature indices are incrementally included in, one index at each step. For each step ( is a scalar with value ) we find the indices corresponding to the features selected in the search steps for the 10 CV iterations. The index which appears most frequently amongst the elements that is also not already included in, is now included as the th element in. Ties are resolved by including the lowest index number. We repeat this process for each of the FS algorithms. Contrary to the other FS algorithms studied here, LASSO may remove features which were selected in prior steps in its incremental search. Thus, prior to the voting scheme described above, we repeated the 10-fold CV process independently for each th step in LASSO, in order to obtain the best features. The final feature subset selected for each FS algorithm was used in the subsequent statistical mapping phase. Feature mapping In the preceding steps we have computed 132 characteristic patterns from the sustained vowel phonations, and subsequently applied feature selection techniques to obtain subsets of those features. Here, we aim to determine the functional relationship ( ), which maps the dysphonia measures ( ), where M is the number of features, to the outcome (response) y (motor-updrs and total-updrs in this study). That is, we want to obtain a classifier that will use the dysphonia measures to accurately predict UPDRS. There is a large literature on supervised classification, and we refer to Hastie et al. [42], and Bishop [41] for a broad overview of this area. Here, we experimented with two powerful classifiers: Random Forests (RF), and Support Vector Machines (SVM). RF is an intuitively appealing ensemble classifier, where a large number of decision trees (each of those decision trees is an individual base classifier with a prediction function ) vote for the most frequent class. RF has two free parameters: (a)

6 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 6 the number of decision trees (which should be fairly large; we used 500 decision trees), and (b) the number of features over which to search to construct each branch of each tree. Breiman [47] has shown that RF does not overfit with an increasing number of decision trees included in the ensemble, and also that it is fairly insensitive to the choice of the other free parameter. Following the suggestion of Breiman [47], we used the square root of the number of input features over which to search to construct each branch of each decision tree, and also experimented with doubling and halving this value. Another attractive aspect of RF is determining feature importance as an inherent part of search strategy for building the individual decision trees. Feature importance can be thought of as a weight associated with each feature, which is related to the use of that particular feature to split the data into separate regions, at the nodes of each decision tree. Hence, the feature importance concept accounts for relevance, redundancy, and complementarity, and has shown promising results in detecting parsimonious datasets [50]. We refer to Breiman [47] for further details about the specifics of RF and feature importance. SVMs were originally proposed for binary classification problems, and were subsequently extended to multi-class classification problems. In their basic form for binary classification, they aim to construct an optimal separating hyper-plane in the feature space, maximizing a geometric margin between samples from the two classes. In practice, typical data cannot be linearly separated; hence SVMs transform the data into a higher dimensional space using the kernel trick, and construct the separating hyperplane in that space [42]. Contrary to RF, SVMs are known to be particularly sensitive to the values of their free parameters [42], which vary depending on the choice of kernel. In this study, we used the LIBSVM implementation [51] and followed the suggestions of its developers: we linearly scaled each of the input features to lie in the range [-1, 1], and used a Gaussian radial basis function kernel (which has two free parameters requiring optimization). The determination of the optimal values of the kernel parameter γ and the penalty parameter C was decided using a grid search of possible values. We selected the pair ( ) that gave the lowest CV error metric (see the following section for details). Specifically, we searched over the grid ( ), where [ ], and [ ]. Once the optimal parameters and were determined, we trained and tested the SVM using these parameters. Cross-validation and model generalization As in previous studies [13], [18], [20] we used 10-fold CV to test the generalization performance of the RF and SVMs. Conceptually, CV provides a good estimate of the accuracy with which UPDRS may be predicted on a new dataset using the classifier described above, assuming the new dataset has similar statistical characteristics to the data used to train and test the classifier. Specifically, we split the initial dataset comprising (4010 for males and 1865 for females) phonations into a training (in sample) subset of (3609 and 1679) phonations and a testing (out of sample) subset of (401 and 186) phonations. For statistical confidence, the process was repeated a total of 100 times, randomly permuting the data each time before splitting into training and testing subsets. As in previous studies [13], [18], [19], [20], we used mean absolute error (MAE) to assess the model performance: (1) where is the predicted UPDRS and is the actual UPDRS for the i th entry in the training or testing subset, is the number of phonations in the training or testing subset, and Q contains the indices of that set. Errors over the 100 CV realizations were averaged. IV. RESULTS We computed correlation coefficients and normalized mutual information [13] to quantify the strength of statistical association of the features with the outcome variable, and then to compare this association strength to previous findings [13] (results not shown due to space constraints). We have found that, as expected, the features in the present study had lower association strength with motor-updrs and total-updrs than in earlier studies that used full bandwidth speech [13]. We report the accuracy with which UPDRS can be predicted in Table I for males, and Table II for females. For each FS algorithm, the final number of features is determined using the one standard error rule [42]: adhering to the principle of parsimony, we fix to be the number of features where MAE is up to one standard deviation larger than the globally lowest MAE obtained with the feature subsets from that FS algorithm. The MAE for motor-updrs is 2.91 ± 0.23 for males and 2.38 ± 0.23 for females, whilst the MAE for total-updrs is 3.43 ± 0.27 for males and 2.91 ± 0.27 for females. By comparison, in a recent study in this application where highquality, high-bandwidth, uncompressed speech signals from the AHTD were used instead, the MAE reported for total- UPDRS was [20]: 1.49 ± 0.14 for males, and 2.14 ± 0.25 for females. Overall, these results suggest that there is indeed clinically important information in the higher frequency ranges (that is discarded in the GSM codec), and this suggestion concurs with that of clinical speech specialists [21]. However, it seems that clinically useful information can still be extracted from much lower-quality, low-bandwidth, compressed speech signals. V. DISCUSSION We have extended previous findings which demonstrated that speech signals may be promising in telemonitoring for PD, by investigating the robustness of the GSM mobile telephone network for this application. We found strong evidence that the existing GSM network, which to-date reaches more than 5 billion subscribers, enables clinically accurate UPDRS estimation. In Tsanas et al. [20], where the high-quality signals

7 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 7 TABLE I SELECTED FEATURES AND CLASSIFICATION ERRORS FOR MALES LASSO mrmr RELIEF RF-imp 12 th MFCC 4 th MFCC IMF NSR,SEO DFA 8 th MFCC 8 th MFCC DFA 4 th MFCC 1 st delta MFCC VFER NSR,SEO 5 th MFCC 5 th MFCC 4 th MFCC 10 th MFCC 8 th MFCC 2 nd MFCC VFER SNR,TKEO HNR std 4 th MFCC IMF SNR,SEO 7 th MFCC 12 th MFCC 9 th MFCC VFER NSR,entropy 5 th MFCC IMF NSR,TKEO 3 rd MFCC Log energy GNE NSR,TKEO 0 th MFCC VFER NSR,entropy 10 th MFCC VFER SNR,entropy 9 th MFCC 2 nd MFCC 1 st MFCC Shimmer TKEOprc 75 VFER std 10 th MFCC 0 th MFCC 9 th delta MFCC 3 rd MFCC 12 th MFCC 9 th MFCC GNEstd VFER SNR,SEO 7 th MFCC Jitter F0,TKEO 11 th MFCC 5 th MFCC Log energy 3 rd MFCC IMF SNR,entropy Shimmer TKEO,prc 95 6 th MFCC 8 th MFCC Jitter pitchtkeo GNEstd 0 th MFCC 12 th MFCC HNR mean 6 th MFCC 11 th MFCC F0 Sun-F0 exp 3 rd delta MFCC 11 th MFCC 1 st MFCC F0 mix-f0 exp 3.43± ± ± ± ± ± ± ±0.27 The last row presents the Mean Absolute Error (MAE) when the selected features from each algorithm are fed into the Random Forests (RF) classification algorithm. The results are given in the form mean ± standard deviation and are out of sample computed using10-fold cross validation with 100 repetitions. The first entry in the last row is the motor-updrs MAE, and the second entry is the total-updrs MAE. obtained from Intel s AHTD were used, we reported that UPDRS could be estimated to within 1.5 UPDRS points for males and 2.2 UPDRS points for females; here we demonstrated that UPDRS can be estimated to within approximately 4.1 UPDRS points for males, and 3.8 UPDRS points for females. We argue that this loss in accuracy of UPDRS, which is due to bandwidth restriction and/or channel transmission error is acceptable in practice because most PWP who could benefit from remote symptom tracking, are unlikely to have access to expensive, dedicated hardware such as the AHTD. We emphasize that the accuracy with which UPDRS is estimated even in this scenario of restricted quality speech, is very close to the inter-rater variability (difference in UPDRS score between two expert clinicians), which is about 4-5 points [52]. Therefore, speech over GSM remains clinically useful here, and could be used as a decision support tool to aid clinicians in remote, non-invasive PD symptom TABLE II SELECTED FEATURES AND CLASSIFICATION ERRORS FOR FEMALES LASSO mrmr RELIEF RF-imp Log energy Log energy IMF NSR,SEO Log energy HNR mean 7 th MFCC 3 rd MFCC 0 th MFCC 7 th MFCC 9 th MFCC Log energy 3 rd MFCC PPE HNR mean 0 th MFCC HNR mean 5 th MFCC 3 rd MFCC 1 st MFCC 5 th MFCC HNR std Delta log energy Delta log energy 8 th MFCC 9 th MFCC Std F0 Praat DFA 1 st MFCC 4 th MFCC Shimmer PQ11 Jitter F0,FM NHR mean VFER NSR,enttropy 5 th MFCC Jitter pitchfm VFER SNR,SEO 10 th MFCC 2 nd delta MFCC 2 nd MFCC VFER SNR,TKEO GNEmean 0 th MFCC 4 th MFCC 10 th MFCC OQprc5-95 IMF SNR,entropy 12 th MFCC 11 th MFCC GNE NSR,TKEO Std F0 Rapt 5 th MFCC 8th MFCC 1 st delta MFCC 11 th MFCC HNR mean Jitter F0,TKEO 11 th MFCC 2.92± ± th delta MFCC 2.69± ±0.27 PPE 2.43± ± th MFCC 2.38± ±0.27 The last row presents the Mean Absolute Error (MAE) when the selected features from each algorithm are fed into the Random Forests (RF) classification algorithm. The results are given in the form mean ± standard deviation and are out of sample computed using10-fold cross validation with 100 repetitions. The first entry in the last row is the motor-updrs MAE, and the second entry is the total-updrs MAE. severity tracking. The automatic assessment of voice pathologies using signals transmitted over the public telephone network was recently shown to be promising in related applications [22], [23]. Concurring with previous findings, we have found that a parsimonious speech feature subset actually improves the outof-sample MAE, and is also more amenable to interpretation [13], [18], [20]. Random Forests outperformed SVMs consistently and significantly (p<0.001), although we cannot provide a clear theoretical justification for this finding. As indicated in previous studies [40], more detailed empirical and theoretical analysis is required to understand which classification algorithm is likely to lead to more accurate prediction for similar datasets. The non-classical dysphonia measures (mainly VFER and MFCCs) are consistently selected as the most predictive features by the feature selection algorithms. One very interesting new finding in this study is that UPDRS estimation in males deteriorates considerably more compared to UPDRS

8 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 8 estimation in females as a result of the lower quality speech signals. This may be related to the bandwidth restriction, but may also be a consequence of the finite bit allocation available to reproducing the pitch period with pitch pulses in the ACELP codec. It could also be due to the increased noise that is masked by the formants in the perceptually-weighted linear prediction: this noise may not be heard, but may, nonetheless, be important in PD. The results of this study confirm the established view in the clinical speech community suggesting that speech signals of at least 20 khz should be preferred in clinically applications because there is useful information in the higher frequencies of the spectrum [21]. Nevertheless, the performance degradation as a result of the use of the lower-quality GSM coding and communication framework is unlikely to be prohibitive for clinically useful UPDRS prediction. We conjecture that this may also be the case for other voice pathologies. We hypothesize that the speech community may have, hitherto, been overly pessimistic in the need for very high quality speech signals [21] in clinical speech science. We stress that the results reported in this study were obtained in a simulated digital communications framework involving the GSM standard. Additional tests in real-world contexts using actual mobile phones would be required to validate the robustness of the presented methodology. For example, we have not simulated the effect of drop-outs due to cell handoff, or switching between 2G/3G, or quality reduction due to the user not placing the phone close to their mouth. For this reason, our channel is chosen to be extremely noisy which introduces quite severe speech signal quality degradation. Additional factors that are hard to control, such as the frequency response of the mobile phone s microphone, might need to be taken into account in a real application. Telemonitoring in healthcare has received considerable attention lately, but global adoption is always constrained by the prohibitive costs associated with specialized telemonitoring hardware. The exploration of highly costeffective solutions, such as exploitation of existing cellular or PSTN telephone networks may be a critical step towards more widespread diffusion of this promising technology. We envisage the results of this study being a first step towards practical, affordable, and accurate telemonitoring of PD for the population at large. ACKNOWLEDGMENT We want to thank James McNames, Lucia M. Blasucci, Eric Dishman, Rodger Elble, Christopher G. Goetz, Andy S. Grove, Mark Hallett, Peter H. Kraus, Ken Kubota, John Nutt, Terence Sanger, Kapil D. Sethi, Ejaz A. Shamim, Helen Bronte-Stewart, Jennifer Spielman, Barr C. Taylor, David Wolff, and Allan D. Wu, who were responsible for the design and construction of the AHTD device and organizing the trials in which the data used in this study was collected. REFERENCES [1] C.W. Olanow, M.B. Stern, K. Sethi, The scientific and clinical basis for the treatment of Parkinson disease, Neurology, 72(21 Suppl 4):s1-s136, 2009 [2] M. Rajput, A. Rajput, A.H. Rajput. Epidemiology (chapter 2). In Handbook of Parkinson s disease, edited by R. Pahwa and K. E. Lyons, 4th edition, Informa Healthcare, USA, 2007 [3] A. S. von Campenhausen, B. Bornschein, R. Wick, K. Bötzel, C. Sampaio, W. Poewe, W. Oertel, U. Siebert, K. Berger, and R. Dodel, Prevalence and incidence of Parkinson's disease in Europe, European Neuropsychopharmacology, Vol. 15, pp , 2005 [4] A. Schrag, Y. Ben-Schlomo, N. Quinn. How valid is the clinical diagnosis of Parkinson s disease in the community? Journal of Neurology, Neurosurgery Psychiatry, Vol. 73, pp , 2002 [5] A.J. Lees, J. Hardy, T. Revesz, Parkinson's disease, Lancet, Vol. 13, pp , 2009 [6] B. Harel, M. Cannizzaro and P.J. Snyder. Variability in fundamental frequency during speech in prodromal and incipient Parkinson s disease: A longitudinal case study, Brain and Cognition 56, 24 29, 2004 [7] A. Ho, R. Iansek, C. Marigliani, J. Bradshaw, S. Gates. Speech impairment in a large sample of patients with Parkinson s disease. Behavioral Neurology 11, , 1998 [8] R.J. Holmes, J.M. Oates, D.J. Phyland, A.J. Hughes. Voice characteristics in the progression of Parkinson s disease. Int J Lang Comm Dis 35, , 2000 [9] S. Skodda, H. Rinsche, U. Schlegel. Progression of dysprosody in Parkinson s disease over time A longitudinal study. Movement Disorders 24 (5), , 2009 [10] N. Singh, V. Pillay, Y. E. Choonara. Advances in the treatment of Parkinson s disease, Progress in Neurobiology, Vol. 81, pp , 2007 [11] S. Fahn, R.L. Elton, Members of the UPDRS Development Committee. Unified Parkinson s disease rating scale. In: Fahn S, Marsden CD, Goldstein M, et al, eds. Recent developments in Parkinson s disease. Vol II. New Jersey: Macmillan Healthcare Information, 1987: [12] C. Ramaker, J. Marinus, A.M. Stiggelbout, B.J. van Hilten, Systematic evaluation of rating scales for impairment and disability in Parkinson s disease. Movement Disorders, Vol. 17, pp , 2002 [13] A. Tsanas, M.A. Little, P.E. McSharry, L.O. Ramig, Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson s disease symptom severity, Journal of the Royal Society Interface, Vol. 8, , 2011 [14] A. Tsanas, M.A. Little, P.E. McSharry, B.K. Scanlon, S. Papapetropoulos, Statistical analysis and mapping of the Unified Parkinson s Disease Rating Scale to Hoehn and Yahr staging, Parkinsonism and Related Disorders, (in press) [15] C.G. Goetz. et al. Testing objective measures of motor impairment in early Parkinson s disease: Feasibility study of an at-home testing device. Movement Disorders 24 (4), , 2009 [16] S. Patel, K. Lorincz, R. Hughes, N. Huggins, J. Growdon, D. Standaert, M. Akay, J. Dy, M. Welsh, and P. Bonato, Monitoring Motor Fluctuations in Patients With Parkinson s Disease Using Wearable Sensors, IEEE Transactions on Information technology in biomedicine, Vol. 13 (6), pp , 2009 [17] S. Papapetropoulos, H. L. Katzen, B.K. Scanlon, A. Guevara, C. Singer, B.E. Levin, Objective Quantification of Neuromotor Symptoms in Parkinson's Disease: Implementation of a Portable, Computerized Measurement Tool, Parkinson s disease, , 2010 [18] A. Tsanas, M.A. Little, P.E. McSharry, L.O. Ramig. Accurate telemonitoring of Parkinson s disease progression using non-invasive speech tests, IEEE Trans. Biomedical Engineering 57, , 2010 [19] A. Tsanas, M.A. Little, P.E. McSharry, L.O. Ramig. Enhanced classical dysphonia measures and sparse regression for telemonitoring of Parkinson s disease progression, IEEE Signal Processing Society, International Conference on Acoustics, Speech and Signal Processing (ICASSP 10), Dallas, Texas, US, pp , 2010 [20] A. Tsanas, M.A. Little, P.E. McSharry, L.O. Ramig, Robust parsimonious selection of dysphonia measures for telemonitoring of Parkinson's disease symptom severity, 7th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA), pp , Florence, Italy, August 2011 [21] I.R. Titze. Principles of Voice Production. National Center for Voice and Speech, Iowa City, US, 2nd ed., 2000

9 > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 9 [22] R.J. Moran, R.B. Reilly, P. de Chazal, P.D. Lacy, Telephony-Based Voice Pathology Assessment Using Automated Speech Analysis, IEEE Transactions on Biomedical Engineering, Vol. 53, pp , 2006 [23] T. Haderlein, E. Nöth, A. Batliner, U. Eysholdt, F.Rosanowski, Automatic intelligibility assessment of pathologic speech over the telephone, Logopedics Phoniatrics Vocology, Vol. 36, pp , 2011 [24] N. Saenz-Lechon, V. Osma-Ruiz, J.I. Godino-Llorente, M. Blanco- Velasco, F. Cruz-Roldan, J.D. Arias-Londono, Effects of Audio Compression in Automatic Detection of Voice Pathologies, IEEE Transactions on Biomedical Engineering, Vol. 55, pp , 2008 [25] A. Tsanas, M.A. Little, P.E. McSharry, L.O. Ramig, New nonlinear markers and insights into speech signal degradation for effective tracking of Parkinson s disease symptom severity, International Symposium on Nonlinear Theory and its Applications (NOLTA), pp , Krakow, Poland, 5-8 September 2010 [26] M.W.M. Schüpbach, J-C. Corvol, V. Czernecki, M.B. Djebara, J-L. Golmard, Y. Agid and A. Hartmann: The segmental progression of early untreated Parkinson disease: a novel approach to clinical rating, J. Neurol. Neurosurg. Psychiatry, Vol. 81, pp , 2010 [27] W. Maetzler, I. Liepelt, D. Berg, Progression of Parkinson's disease in the clinical phase: potential markers, Lancet Neurology, Vol. 8, pp , 2009 [28] W.C. Chu, Speech Coding Algorithms: Foundation and Evolution of Standardized Coders, Wiley-Blackwell, 2003 [29] J. Proakis, Digital Communications, 4th ed. New York: McGraw-Hill, 2000 [30] A. Tsanas, Advanced equalisation schemes: trends of the future, M.Sc. thesis, Department of Electrical, Electronic, and Computer Engineering, Newcastle University, 2008 [31] D. Ampeliotis and K. Berberidis, Low Complexity Turbo Equalization for High Data Rate Wireless Communications, EURASIP Journal on Wireless Communications and Networking, pp. 1-12, 2006 [32] L. Hanzo, T.H. Liew and B.L. Yeap: Turbo Coding, Turbo Equalisation and Space-Time Coding, John Wiley & Sons, 2002 [33] M. Tuchler, R. Koetter and A. Singer, Turbo equalization: Principles and new results, IEEE Transactions on Communications, vol. 50, pp , 2002 [34] R.J. Baken and R.F. Orlikoff. Clinical measurement of speech and voice, San Diego: Singular Thomson Learning, 2nd ed., 2000 [35] M.A. Little, P.E. McSharry, S.J. Roberts, D. Costello, I.M. Moroz. Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection. Biomedical Engineering Online 6:23, 2007 [36] M.A. Little, P.E. McSharry, E.J. Hunter, J. Spielman, L.O. Ramig. Suitability of dysphonia measurements for telemonitoring of Parkinson s disease, IEEE Trans. Biomedical Engineering 56(4), , 2009 [37] D. Michaelis, T. Gramss, H.W. Strube, Glottal-to-Noise Excitation Ratio a New Measure for Describing Pathological Voices, Acta Acustica, Vol. 83, pp , 1997 [38] N.E. Huang, Z. Shen, S.R. Long, M.C. Wu, H.H. Shih, Q. Zheng, N.C. Yen, C.C. Tung, H.H. Liu, The empirical mode decomposition and the Hilbert spectrum for non-linear and non-stationary time series analysis, Proc. Royal Soc. London A, 454, , 1998 [39] J.I. Godino-Llorente, P. Gomez-Vilda, M. Blanco-Velasco, Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters, IEEE Transactions on Biomedical Engineering, Vol. 53, , 2006 [40] A. Tsanas, M.A. Little, P.E. McSharry, J. Spielman, L.O. Ramig, Novel speech signal processing algorithms for high-accuracy classification of Parkinson s disease, IEEE Transactions on Biomedical Engineering (in press) [41] C.M. Bishop, Pattern recognition and machine learning, Springer, 2007 [42] T. Hastie, R. Tibshirani, J. Friedman. The elements of statistical learning: data mining, inference, and prediction. Springer, 2nd ed., 2009 [43] I. Guyon, S. Gunn, M. Nikravesh, L. A. Zadeh (Eds.), Feature Extraction: Foundations and Applications, Springer, 2006 [44] R. Tibshirani. Regression Shrinkage and Selection via the Lasso. J. R. Statist. Soc. B 58, , 1996 [45] H. Peng, F. Long, C. Ding, Feature selection based on mutual information: criteria of max-dependency, max-relevance, and minredundancy, IEEE Transactions on Pattern Analysis and Machine Intelligence, 27: , 2005 [46] K. Kira and L.A. Rendell, A practical approach to feature selection, Proceedings of the Ninth International Conference on Machine Learning, , 1992 [47] L. Breiman, Random Forests, Machine Learning, 45, [48] N. Meinshausen and B. Yu, Lasso-type recovery of sparse representations for high-dimensional data, Annals of Statistics, Vol. 37(1), pp , 2009 [49] R. Gilad-Bachrach, A. Navot, and N. Tishby, Margin based feature selection - theory and algorithms, 21st International Conference on Machine learning (ICML), pp , 2004 [50] E. Tuv, A. Borisov, G. Runger and K. Torkkola. Feature Selection with Ensembles, Artificial Variables, and Redundancy Elimination. Journal of Machine Learning Research, 10: , 2009 [51] C.C. Chang and C-J. Lin, LIBSVM: a library for support vector machines, ACM Transactions on Intelligent Systems and Technology, Vol. 2, pp. 1-27, Software available at [52] B. Post, M.P. Merkus, R.M.A. de Bie, R.J. de Haan, J.D. Speelman, Unified Parkinson s Disease Rating Scale Motor Examination: Are Ratings of Nurses, Residents in Neurology, and Movement Disorders Specialists Interchangeable?, Movement Disorders, Vol. 20, pp , 2005

A methodology for the analysis of medical data

A methodology for the analysis of medical data Please cite this book chapter as: A. Tsanas, M.A. Little, P.E. McSharry, A methodology for the analysis of medical data, Chapter 7 in Handbook of Systems and Complexity in Health, Springer, 2013 A methodology

More information

Emotion Detection from Speech

Emotion Detection from Speech Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction

More information

Accurate telemonitoring of Parkinson s disease progression by non-invasive speech tests

Accurate telemonitoring of Parkinson s disease progression by non-invasive speech tests 1 Accurate telemonitoring of Parkinson s disease progression by non-invasive speech tests Authors: Athanasios Tsanas 1,2, Max A. Little 1,2, Patrick E. McSharry 1,2, Lorraine Ramig 3,4 Affiliations: 1

More information

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29. Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet

More information

Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms

Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Regularized Logistic Regression for Mind Reading with Parallel Validation

Regularized Logistic Regression for Mind Reading with Parallel Validation Regularized Logistic Regression for Mind Reading with Parallel Validation Heikki Huttunen, Jukka-Pekka Kauppi, Jussi Tohka Tampere University of Technology Department of Signal Processing Tampere, Finland

More information

Predict the Popularity of YouTube Videos Using Early View Data

Predict the Popularity of YouTube Videos Using Early View Data 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Voice services over Adaptive Multi-user Orthogonal Sub channels An Insight

Voice services over Adaptive Multi-user Orthogonal Sub channels An Insight TEC Voice services over Adaptive Multi-user Orthogonal Sub channels An Insight HP 4/15/2013 A powerful software upgrade leverages quaternary modulation and MIMO techniques to improve network efficiency

More information

How To Recognize Voice Over Ip On Pc Or Mac Or Ip On A Pc Or Ip (Ip) On A Microsoft Computer Or Ip Computer On A Mac Or Mac (Ip Or Ip) On An Ip Computer Or Mac Computer On An Mp3

How To Recognize Voice Over Ip On Pc Or Mac Or Ip On A Pc Or Ip (Ip) On A Microsoft Computer Or Ip Computer On A Mac Or Mac (Ip Or Ip) On An Ip Computer Or Mac Computer On An Mp3 Recognizing Voice Over IP: A Robust Front-End for Speech Recognition on the World Wide Web. By C.Moreno, A. Antolin and F.Diaz-de-Maria. Summary By Maheshwar Jayaraman 1 1. Introduction Voice Over IP is

More information

Data Mining Practical Machine Learning Tools and Techniques

Data Mining Practical Machine Learning Tools and Techniques Ensemble learning Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 8 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Combining multiple models Bagging The basic idea

More information

Social Media Mining. Data Mining Essentials

Social Media Mining. Data Mining Essentials Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers

More information

Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT)

Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT) Page 1 Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT) ECC RECOMMENDATION (06)01 Bandwidth measurements using FFT techniques

More information

AN ANALYSIS OF DELAY OF SMALL IP PACKETS IN CELLULAR DATA NETWORKS

AN ANALYSIS OF DELAY OF SMALL IP PACKETS IN CELLULAR DATA NETWORKS AN ANALYSIS OF DELAY OF SMALL IP PACKETS IN CELLULAR DATA NETWORKS Hubert GRAJA, Philip PERRY and John MURPHY Performance Engineering Laboratory, School of Electronic Engineering, Dublin City University,

More information

D-optimal plans in observational studies

D-optimal plans in observational studies D-optimal plans in observational studies Constanze Pumplün Stefan Rüping Katharina Morik Claus Weihs October 11, 2005 Abstract This paper investigates the use of Design of Experiments in observational

More information

Tutorial about the VQR (Voice Quality Restoration) technology

Tutorial about the VQR (Voice Quality Restoration) technology Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport

More information

Ericsson T18s Voice Dialing Simulator

Ericsson T18s Voice Dialing Simulator Ericsson T18s Voice Dialing Simulator Mauricio Aracena Kovacevic, Anna Dehlbom, Jakob Ekeberg, Guillaume Gariazzo, Eric Lästh and Vanessa Troncoso Dept. of Signals Sensors and Systems Royal Institute of

More information

Revision of Lecture Eighteen

Revision of Lecture Eighteen Revision of Lecture Eighteen Previous lecture has discussed equalisation using Viterbi algorithm: Note similarity with channel decoding using maximum likelihood sequence estimation principle It also discusses

More information

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering

More information

PHASE ESTIMATION ALGORITHM FOR FREQUENCY HOPPED BINARY PSK AND DPSK WAVEFORMS WITH SMALL NUMBER OF REFERENCE SYMBOLS

PHASE ESTIMATION ALGORITHM FOR FREQUENCY HOPPED BINARY PSK AND DPSK WAVEFORMS WITH SMALL NUMBER OF REFERENCE SYMBOLS PHASE ESTIMATION ALGORITHM FOR FREQUENCY HOPPED BINARY PSK AND DPSK WAVEFORMS WITH SMALL NUM OF REFERENCE SYMBOLS Benjamin R. Wiederholt The MITRE Corporation Bedford, MA and Mario A. Blanco The MITRE

More information

Parkinson s prevalence in the United Kingdom (2009)

Parkinson s prevalence in the United Kingdom (2009) Parkinson s prevalence in the United Kingdom (2009) Contents Summary 3 Background 5 Methods 6 Study population 6 Data analysis 7 Results 7 Parkinson's prevalence 7 Parkinson's prevalence among males and

More information

Image Compression through DCT and Huffman Coding Technique

Image Compression through DCT and Huffman Coding Technique International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul

More information

AN1200.04. Application Note: FCC Regulations for ISM Band Devices: 902-928 MHz. FCC Regulations for ISM Band Devices: 902-928 MHz

AN1200.04. Application Note: FCC Regulations for ISM Band Devices: 902-928 MHz. FCC Regulations for ISM Band Devices: 902-928 MHz AN1200.04 Application Note: FCC Regulations for ISM Band Devices: Copyright Semtech 2006 1 of 15 www.semtech.com 1 Table of Contents 1 Table of Contents...2 1.1 Index of Figures...2 1.2 Index of Tables...2

More information

Supervised Feature Selection & Unsupervised Dimensionality Reduction

Supervised Feature Selection & Unsupervised Dimensionality Reduction Supervised Feature Selection & Unsupervised Dimensionality Reduction Feature Subset Selection Supervised: class labels are given Select a subset of the problem features Why? Redundant features much or

More information

TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS

TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS 1. Bandwidth: The bandwidth of a communication link, or in general any system, was loosely defined as the width of

More information

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski trajkovski@nyus.edu.mk

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski trajkovski@nyus.edu.mk Introduction to Machine Learning and Data Mining Prof. Dr. Igor Trakovski trakovski@nyus.edu.mk Neural Networks 2 Neural Networks Analogy to biological neural systems, the most robust learning systems

More information

PCM Encoding and Decoding:

PCM Encoding and Decoding: PCM Encoding and Decoding: Aim: Introduction to PCM encoding and decoding. Introduction: PCM Encoding: The input to the PCM ENCODER module is an analog message. This must be constrained to a defined bandwidth

More information

Evolution from Voiceband to Broadband Internet Access

Evolution from Voiceband to Broadband Internet Access Evolution from Voiceband to Broadband Internet Access Murtaza Ali DSPS R&D Center Texas Instruments Abstract With the growth of Internet, demand for high bit rate Internet access is growing. Even though

More information

Classification of Bad Accounts in Credit Card Industry

Classification of Bad Accounts in Credit Card Industry Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition

More information

Appendix C GSM System and Modulation Description

Appendix C GSM System and Modulation Description C1 Appendix C GSM System and Modulation Description C1. Parameters included in the modelling In the modelling the number of mobiles and their positioning with respect to the wired device needs to be taken

More information

CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.

CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning. Lecture Machine Learning Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht milos@cs.pitt.edu 539 Sennott

More information

Efficient Data Recovery scheme in PTS-Based OFDM systems with MATRIX Formulation

Efficient Data Recovery scheme in PTS-Based OFDM systems with MATRIX Formulation Efficient Data Recovery scheme in PTS-Based OFDM systems with MATRIX Formulation Sunil Karthick.M PG Scholar Department of ECE Kongu Engineering College Perundurau-638052 Venkatachalam.S Assistant Professor

More information

Non-Data Aided Carrier Offset Compensation for SDR Implementation

Non-Data Aided Carrier Offset Compensation for SDR Implementation Non-Data Aided Carrier Offset Compensation for SDR Implementation Anders Riis Jensen 1, Niels Terp Kjeldgaard Jørgensen 1 Kim Laugesen 1, Yannick Le Moullec 1,2 1 Department of Electronic Systems, 2 Center

More information

An Algorithm for Automatic Base Station Placement in Cellular Network Deployment

An Algorithm for Automatic Base Station Placement in Cellular Network Deployment An Algorithm for Automatic Base Station Placement in Cellular Network Deployment István Törős and Péter Fazekas High Speed Networks Laboratory Dept. of Telecommunications, Budapest University of Technology

More information

Lezione 6 Communications Blockset

Lezione 6 Communications Blockset Corso di Tecniche CAD per le Telecomunicazioni A.A. 2007-2008 Lezione 6 Communications Blockset Ing. Marco GALEAZZI 1 What Is Communications Blockset? Communications Blockset extends Simulink with a comprehensive

More information

Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm

Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm 1 Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm Hani Mehrpouyan, Student Member, IEEE, Department of Electrical and Computer Engineering Queen s University, Kingston, Ontario,

More information

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints

More information

The Effect of Network Cabling on Bit Error Rate Performance. By Paul Kish NORDX/CDT

The Effect of Network Cabling on Bit Error Rate Performance. By Paul Kish NORDX/CDT The Effect of Network Cabling on Bit Error Rate Performance By Paul Kish NORDX/CDT Table of Contents Introduction... 2 Probability of Causing Errors... 3 Noise Sources Contributing to Errors... 4 Bit Error

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

The Periodic Moving Average Filter for Removing Motion Artifacts from PPG Signals

The Periodic Moving Average Filter for Removing Motion Artifacts from PPG Signals International Journal The of Periodic Control, Moving Automation, Average and Filter Systems, for Removing vol. 5, no. Motion 6, pp. Artifacts 71-76, from December PPG s 27 71 The Periodic Moving Average

More information

Less naive Bayes spam detection

Less naive Bayes spam detection Less naive Bayes spam detection Hongming Yang Eindhoven University of Technology Dept. EE, Rm PT 3.27, P.O.Box 53, 5600MB Eindhoven The Netherlands. E-mail:h.m.yang@tue.nl also CoSiNe Connectivity Systems

More information

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical

More information

Speech and Network Marketing Model - A Review

Speech and Network Marketing Model - A Review Jastrzȩbia Góra, 16 th 20 th September 2013 APPLYING DATA MINING CLASSIFICATION TECHNIQUES TO SPEAKER IDENTIFICATION Kinga Sałapa 1,, Agata Trawińska 2 and Irena Roterman-Konieczna 1, 1 Department of Bioinformatics

More information

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION Adiel Ben-Shalom, Michael Werman School of Computer Science Hebrew University Jerusalem, Israel. {chopin,werman}@cs.huji.ac.il

More information

MODULATION Systems (part 1)

MODULATION Systems (part 1) Technologies and Services on Digital Broadcasting (8) MODULATION Systems (part ) "Technologies and Services of Digital Broadcasting" (in Japanese, ISBN4-339-62-2) is published by CORONA publishing co.,

More information

HD Radio FM Transmission System Specifications Rev. F August 24, 2011

HD Radio FM Transmission System Specifications Rev. F August 24, 2011 HD Radio FM Transmission System Specifications Rev. F August 24, 2011 SY_SSS_1026s TRADEMARKS HD Radio and the HD, HD Radio, and Arc logos are proprietary trademarks of ibiquity Digital Corporation. ibiquity,

More information

QAM Demodulation. Performance Conclusion. o o o o o. (Nyquist shaping, Clock & Carrier Recovery, AGC, Adaptive Equaliser) o o. Wireless Communications

QAM Demodulation. Performance Conclusion. o o o o o. (Nyquist shaping, Clock & Carrier Recovery, AGC, Adaptive Equaliser) o o. Wireless Communications 0 QAM Demodulation o o o o o Application area What is QAM? What are QAM Demodulation Functions? General block diagram of QAM demodulator Explanation of the main function (Nyquist shaping, Clock & Carrier

More information

Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification

Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification Department of Telecommunication Engineering Hijjawi Faculty for Engineering Technology Yarmouk University Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification Prepared by Orobh

More information

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP Department of Electrical and Computer Engineering Ben-Gurion University of the Negev LAB 1 - Introduction to USRP - 1-1 Introduction In this lab you will use software reconfigurable RF hardware from National

More information

Speech Signal Processing: An Overview

Speech Signal Processing: An Overview Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech

More information

2695 P a g e. IV Semester M.Tech (DCN) SJCIT Chickballapur Karnataka India

2695 P a g e. IV Semester M.Tech (DCN) SJCIT Chickballapur Karnataka India Integrity Preservation and Privacy Protection for Digital Medical Images M.Krishna Rani Dr.S.Bhargavi IV Semester M.Tech (DCN) SJCIT Chickballapur Karnataka India Abstract- In medical treatments, the integrity

More information

8. Cellular Systems. 1. Bell System Technical Journal, Vol. 58, no. 1, Jan 1979. 2. R. Steele, Mobile Communications, Pentech House, 1992.

8. Cellular Systems. 1. Bell System Technical Journal, Vol. 58, no. 1, Jan 1979. 2. R. Steele, Mobile Communications, Pentech House, 1992. 8. Cellular Systems References 1. Bell System Technical Journal, Vol. 58, no. 1, Jan 1979. 2. R. Steele, Mobile Communications, Pentech House, 1992. 3. G. Calhoun, Digital Cellular Radio, Artech House,

More information

IN current film media, the increase in areal density has

IN current film media, the increase in areal density has IEEE TRANSACTIONS ON MAGNETICS, VOL. 44, NO. 1, JANUARY 2008 193 A New Read Channel Model for Patterned Media Storage Seyhan Karakulak, Paul H. Siegel, Fellow, IEEE, Jack K. Wolf, Life Fellow, IEEE, and

More information

How To Use Neural Networks In Data Mining

How To Use Neural Networks In Data Mining International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and

More information

Using artificial intelligence for data reduction in mechanical engineering

Using artificial intelligence for data reduction in mechanical engineering Using artificial intelligence for data reduction in mechanical engineering L. Mdlazi 1, C.J. Stander 1, P.S. Heyns 1, T. Marwala 2 1 Dynamic Systems Group Department of Mechanical and Aeronautical Engineering,

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin

More information

Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers

Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers Sung-won ark and Jose Trevino Texas A&M University-Kingsville, EE/CS Department, MSC 92, Kingsville, TX 78363 TEL (36) 593-2638, FAX

More information

Log-Likelihood Ratio-based Relay Selection Algorithm in Wireless Network

Log-Likelihood Ratio-based Relay Selection Algorithm in Wireless Network Recent Advances in Electrical Engineering and Electronic Devices Log-Likelihood Ratio-based Relay Selection Algorithm in Wireless Network Ahmed El-Mahdy and Ahmed Walid Faculty of Information Engineering

More information

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA Audio Engineering Society Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

Lecture 10: Regression Trees

Lecture 10: Regression Trees Lecture 10: Regression Trees 36-350: Data Mining October 11, 2006 Reading: Textbook, sections 5.2 and 10.5. The next three lectures are going to be about a particular kind of nonlinear predictive model,

More information

Coding and decoding with convolutional codes. The Viterbi Algor

Coding and decoding with convolutional codes. The Viterbi Algor Coding and decoding with convolutional codes. The Viterbi Algorithm. 8 Block codes: main ideas Principles st point of view: infinite length block code nd point of view: convolutions Some examples Repetition

More information

By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

By choosing to view this document, you agree to all provisions of the copyright laws protecting it. This material is posted here with permission of the IEEE Such permission of the IEEE does not in any way imply IEEE endorsement of any of Helsinki University of Technology's products or services Internal

More information

PERCENTAGE ARTICULATION LOSS OF CONSONANTS IN THE ELEMENTARY SCHOOL CLASSROOMS

PERCENTAGE ARTICULATION LOSS OF CONSONANTS IN THE ELEMENTARY SCHOOL CLASSROOMS The 21 st International Congress on Sound and Vibration 13-17 July, 2014, Beijing/China PERCENTAGE ARTICULATION LOSS OF CONSONANTS IN THE ELEMENTARY SCHOOL CLASSROOMS Dan Wang, Nanjie Yan and Jianxin Peng*

More information

Bootstrapping Big Data

Bootstrapping Big Data Bootstrapping Big Data Ariel Kleiner Ameet Talwalkar Purnamrita Sarkar Michael I. Jordan Computer Science Division University of California, Berkeley {akleiner, ameet, psarkar, jordan}@eecs.berkeley.edu

More information

Data, Measurements, Features

Data, Measurements, Features Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are

More information

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional

More information

Using telehealth to deliver speech treatment for Parkinson s into the home: Outcomes & satisfaction

Using telehealth to deliver speech treatment for Parkinson s into the home: Outcomes & satisfaction Using telehealth to deliver speech treatment for Parkinson s into the home: Outcomes & satisfaction Deborah Theodoros PhD Anne Hill PhD Trevor Russell PhD Telerehabilitation Research Unit Parkinson s Australia

More information

The Artificial Prediction Market

The Artificial Prediction Market The Artificial Prediction Market Adrian Barbu Department of Statistics Florida State University Joint work with Nathan Lay, Siemens Corporate Research 1 Overview Main Contributions A mathematical theory

More information

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain

More information

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University

More information

Perceived Speech Quality Prediction for Voice over IP-based Networks

Perceived Speech Quality Prediction for Voice over IP-based Networks Perceived Speech Quality Prediction for Voice over IP-based Networks Lingfen Sun and Emmanuel C. Ifeachor Department of Communication and Electronic Engineering, University of Plymouth, Plymouth PL 8AA,

More information

SURVIVABILITY OF COMPLEX SYSTEM SUPPORT VECTOR MACHINE BASED APPROACH

SURVIVABILITY OF COMPLEX SYSTEM SUPPORT VECTOR MACHINE BASED APPROACH 1 SURVIVABILITY OF COMPLEX SYSTEM SUPPORT VECTOR MACHINE BASED APPROACH Y, HONG, N. GAUTAM, S. R. T. KUMARA, A. SURANA, H. GUPTA, S. LEE, V. NARAYANAN, H. THADAKAMALLA The Dept. of Industrial Engineering,

More information

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Mobile Phone APP Software Browsing Behavior using Clustering Analysis Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis

More information

Solutions to Exam in Speech Signal Processing EN2300

Solutions to Exam in Speech Signal Processing EN2300 Solutions to Exam in Speech Signal Processing EN23 Date: Thursday, Dec 2, 8: 3: Place: Allowed: Grades: Language: Solutions: Q34, Q36 Beta Math Handbook (or corresponding), calculator with empty memory.

More information

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically. Sampling Theorem We will show that a band limited signal can be reconstructed exactly from its discrete time samples. Recall: That a time sampled signal is like taking a snap shot or picture of signal

More information

Spot me if you can: Uncovering spoken phrases in encrypted VoIP conversations

Spot me if you can: Uncovering spoken phrases in encrypted VoIP conversations Spot me if you can: Uncovering spoken phrases in encrypted VoIP conversations C. Wright, L. Ballard, S. Coull, F. Monrose, G. Masson Talk held by Goran Doychev Selected Topics in Information Security and

More information

Cross-Validation. Synonyms Rotation estimation

Cross-Validation. Synonyms Rotation estimation Comp. by: BVijayalakshmiGalleys0000875816 Date:6/11/08 Time:19:52:53 Stage:First Proof C PAYAM REFAEILZADEH, LEI TANG, HUAN LIU Arizona State University Synonyms Rotation estimation Definition is a statistical

More information

How To Understand The Theory Of Time Division Duplexing

How To Understand The Theory Of Time Division Duplexing Multiple Access Techniques Dr. Francis LAU Dr. Francis CM Lau, Associate Professor, EIE, PolyU Content Introduction Frequency Division Multiple Access Time Division Multiple Access Code Division Multiple

More information

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic

More information

Unlocking Value from. Patanjali V, Lead Data Scientist, Tiger Analytics Anand B, Director Analytics Consulting,Tiger Analytics

Unlocking Value from. Patanjali V, Lead Data Scientist, Tiger Analytics Anand B, Director Analytics Consulting,Tiger Analytics Unlocking Value from Patanjali V, Lead Data Scientist, Anand B, Director Analytics Consulting, EXECUTIVE SUMMARY Today a lot of unstructured data is being generated in the form of text, images, videos

More information

Advanced Signal Processing and Digital Noise Reduction

Advanced Signal Processing and Digital Noise Reduction Advanced Signal Processing and Digital Noise Reduction Saeed V. Vaseghi Queen's University of Belfast UK WILEY HTEUBNER A Partnership between John Wiley & Sons and B. G. Teubner Publishers Chichester New

More information

Information, Entropy, and Coding

Information, Entropy, and Coding Chapter 8 Information, Entropy, and Coding 8. The Need for Data Compression To motivate the material in this chapter, we first consider various data sources and some estimates for the amount of data associated

More information

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals Modified from the lecture slides of Lami Kaya (LKaya@ieee.org) for use CECS 474, Fall 2008. 2009 Pearson Education Inc., Upper

More information

INTER CARRIER INTERFERENCE CANCELLATION IN HIGH SPEED OFDM SYSTEM Y. Naveena *1, K. Upendra Chowdary 2

INTER CARRIER INTERFERENCE CANCELLATION IN HIGH SPEED OFDM SYSTEM Y. Naveena *1, K. Upendra Chowdary 2 ISSN 2277-2685 IJESR/June 2014/ Vol-4/Issue-6/333-337 Y. Naveena et al./ International Journal of Engineering & Science Research INTER CARRIER INTERFERENCE CANCELLATION IN HIGH SPEED OFDM SYSTEM Y. Naveena

More information

THE SVM APPROACH FOR BOX JENKINS MODELS

THE SVM APPROACH FOR BOX JENKINS MODELS REVSTAT Statistical Journal Volume 7, Number 1, April 2009, 23 36 THE SVM APPROACH FOR BOX JENKINS MODELS Authors: Saeid Amiri Dep. of Energy and Technology, Swedish Univ. of Agriculture Sciences, P.O.Box

More information

CDMA TECHNOLOGY. Brief Working of CDMA

CDMA TECHNOLOGY. Brief Working of CDMA CDMA TECHNOLOGY History of CDMA The Cellular Challenge The world's first cellular networks were introduced in the early 1980s, using analog radio transmission technologies such as AMPS (Advanced Mobile

More information

MASTER'S THESIS. Improved Power Control for GSM/EDGE

MASTER'S THESIS. Improved Power Control for GSM/EDGE MASTER'S THESIS 2005:238 CIV Improved Power Control for GSM/EDGE Fredrik Hägglund Luleå University of Technology MSc Programmes in Engineering Department of Computer Science and Electrical Engineering

More information

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA Tuija Niemi-Laitinen*, Juhani Saastamoinen**, Tomi Kinnunen**, Pasi Fränti** *Crime Laboratory, NBI, Finland **Dept. of Computer

More information

II. RELATED WORK. Sentiment Mining

II. RELATED WORK. Sentiment Mining Sentiment Mining Using Ensemble Classification Models Matthew Whitehead and Larry Yaeger Indiana University School of Informatics 901 E. 10th St. Bloomington, IN 47408 {mewhiteh, larryy}@indiana.edu Abstract

More information

Time series analysis of data from stress ECG

Time series analysis of data from stress ECG Communications to SIMAI Congress, ISSN 827-905, Vol. 3 (2009) DOI: 0.685/CSC09XXX Time series analysis of data from stress ECG Camillo Cammarota Dipartimento di Matematica La Sapienza Università di Roma,

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs anton.heijs@treparel.com Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

Implementation of Digital Signal Processing: Some Background on GFSK Modulation

Implementation of Digital Signal Processing: Some Background on GFSK Modulation Implementation of Digital Signal Processing: Some Background on GFSK Modulation Sabih H. Gerez University of Twente, Department of Electrical Engineering s.h.gerez@utwente.nl Version 4 (February 7, 2013)

More information

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence Artificial Neural Networks and Support Vector Machines CS 486/686: Introduction to Artificial Intelligence 1 Outline What is a Neural Network? - Perceptron learners - Multi-layer networks What is a Support

More information

Winning the Kaggle Algorithmic Trading Challenge with the Composition of Many Models and Feature Engineering

Winning the Kaggle Algorithmic Trading Challenge with the Composition of Many Models and Feature Engineering IEICE Transactions on Information and Systems, vol.e96-d, no.3, pp.742-745, 2013. 1 Winning the Kaggle Algorithmic Trading Challenge with the Composition of Many Models and Feature Engineering Ildefons

More information

Meta-learning. Synonyms. Definition. Characteristics

Meta-learning. Synonyms. Definition. Characteristics Meta-learning Włodzisław Duch, Department of Informatics, Nicolaus Copernicus University, Poland, School of Computer Engineering, Nanyang Technological University, Singapore wduch@is.umk.pl (or search

More information

Performance Evaluation of AODV, OLSR Routing Protocol in VOIP Over Ad Hoc

Performance Evaluation of AODV, OLSR Routing Protocol in VOIP Over Ad Hoc (International Journal of Computer Science & Management Studies) Vol. 17, Issue 01 Performance Evaluation of AODV, OLSR Routing Protocol in VOIP Over Ad Hoc Dr. Khalid Hamid Bilal Khartoum, Sudan dr.khalidbilal@hotmail.com

More information

Automatic Evaluation Software for Contact Centre Agents voice Handling Performance

Automatic Evaluation Software for Contact Centre Agents voice Handling Performance International Journal of Scientific and Research Publications, Volume 5, Issue 1, January 2015 1 Automatic Evaluation Software for Contact Centre Agents voice Handling Performance K.K.A. Nipuni N. Perera,

More information

MUSICAL INSTRUMENT FAMILY CLASSIFICATION

MUSICAL INSTRUMENT FAMILY CLASSIFICATION MUSICAL INSTRUMENT FAMILY CLASSIFICATION Ricardo A. Garcia Media Lab, Massachusetts Institute of Technology 0 Ames Street Room E5-40, Cambridge, MA 039 USA PH: 67-53-0 FAX: 67-58-664 e-mail: rago @ media.

More information

Component Ordering in Independent Component Analysis Based on Data Power

Component Ordering in Independent Component Analysis Based on Data Power Component Ordering in Independent Component Analysis Based on Data Power Anne Hendrikse Raymond Veldhuis University of Twente University of Twente Fac. EEMCS, Signals and Systems Group Fac. EEMCS, Signals

More information