Abant Izzet Baysal University



Similar documents
Speech Signal Processing: An Overview

FOURIER TRANSFORM BASED SIMPLE CHORD ANALYSIS. UIUC Physics 193 POM

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

WAVELET ANALYSES OF SOME ATMOSPHERIC PARAMETERS AT BLACK SEA REGION Zafer ASLAN, Zehra Nevin ÇAĞLAR and D.Nail YENİÇERİ

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

Establishing the Uniqueness of the Human Voice for Security Applications

WINDAQ Data Acquisition and Playback Software

Emotion Detection from Speech

Hideo Okawara s Mixed Signal Lecture Series. DSP-Based Testing Fundamentals 46 Per-pin Signal Generator

Power Quality Analysis and Evaluation at Water Pumping Station Supplied by Medium Voltage

SR2000 FREQUENCY MONITOR

School Class Monitoring System Based on Audio Signal Processing

Electronic Systems Engineering Department, Turkish Naval Academy, Naval Sciences and Engineering Institute, Tuzla, Istanbul 1 moun@dho.edu.

SIGNAL AND SYSTEM LEVEL SIMULATIONS ON WIDEBAND INTERCEPT RECEIVERS

San José State University Department of Electrical Engineering EE 112, Linear Systems, Spring 2010

TOTAL FACTOR PRODUCTIVITY AND ECONOMIC GROWTH

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

ANALYZER BASICS WHAT IS AN FFT SPECTRUM ANALYZER? 2-1


Thirukkural - A Text-to-Speech Synthesis System

Voice Dialer Speech Recognition Dialing IC

NEW SECOND GENERATION CURRENT CONVEYOR-BASED CURRENT-MODE FIRST ORDER ALL-PASS FILTER AND QUADRATURE OSCILLATOR

Lecture 1-6: Noise and Filters

Waveforms and the Speed of Sound

Sampling and Interpolation. Yao Wang Polytechnic University, Brooklyn, NY11201

Simulation and Analysis of PWM Inverter Fed Induction Motor Drive

B3. Short Time Fourier Transform (STFT)

Lecture 1-10: Spectrograms

Lecture 4: Jan 12, 2005

Voltage. Oscillator. Voltage. Oscillator

Detection of Leak Holes in Underground Drinking Water Pipelines using Acoustic and Proximity Sensing Systems

PURE TONE AUDIOMETER

Automatic Evaluation Software for Contact Centre Agents voice Handling Performance

Analog Representations of Sound

Program curriculum for graduate studies in Speech and Music Communication

PeakVue Analysis for Antifriction Bearing Fault Detection

Figure1. Acoustic feedback in packet based video conferencing system

LOW COST HARDWARE IMPLEMENTATION FOR DIGITAL HEARING AID USING

INTRODUCTION TO COMMUNICATION SYSTEMS AND TRANSMISSION MEDIA

SPEECH Biswajeet Sarangi, B.Sc.(Audiology & speech Language pathology)

FAST Fourier Transform (FFT) and Digital Filtering Using LabVIEW

RF Measurements Using a Modular Digitizer

JOURNAL OF TELECOMMUNICATIONS, VOLUME 2, ISSUE 2, MAY Simulink based VoIP Analysis. Hardeep Singh Dalhio, Jasvir Singh, M.

Artificial Neural Network for Speech Recognition

Example #1: Controller for Frequency Modulated Spectroscopy

Aircraft cabin noise synthesis for noise subjective analysis

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Taking the Mystery out of the Infamous Formula, "SNR = 6.02N dB," and Why You Should Care. by Walt Kester

L9: Cepstral analysis

SIGNAL PROCESSING FOR EFFECTIVE VIBRATION ANALYSIS

Introduction to Digital Audio

EUROPEAN COMPUTER DRIVING LICENCE. Multimedia Audio Editing. Syllabus

PCM Encoding and Decoding:

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction

SGN-1158 Introduction to Signal Processing Test. Solutions

A DESIGN OF DSPIC BASED SIGNAL MONITORING AND PROCESSING SYSTEM

A Network Analyzer For Active Components

Basics of Digital Recording

Analysis/resynthesis with the short time Fourier transform

THE COLOUR SIGNALING PROCESS IN TURKISH SIGN LANGUAGE

Timing Errors and Jitter

: Instructor

Tutorial about the VQR (Voice Quality Restoration) technology

CHAPTER 6 Frequency Response, Bode Plots, and Resonance

WAVELET ANALYSIS BASED ULTRASONIC NONDESTRUCTIVE TESTING OF POLYMER BONDED EXPLOSIVE

Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers

Agilent Creating Multi-tone Signals With the N7509A Waveform Generation Toolbox. Application Note

Analog-to-Digital Voice Encoding

Matlab GUI for WFB spectral analysis

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

Teaching Fourier Analysis and Wave Physics with the Bass Guitar

The Calculation of G rms

VOICE RECOGNITION KIT USING HM2007. Speech Recognition System. Features. Specification. Applications

Sound absorption and acoustic surface impedance

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.)

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING

Feasibility Study of Implementation of Cell Phone Controlled, Password Protected Door Locking System

Appendices. Appendix A: List of Publications

Faculty of Technical Education Electrical Department ABSTRACT

Voice Authentication for ATM Security

CS Introduction to Data Mining Instructor: Abdullah Mueen

Synchronization of sampling in distributed signal processing systems

A secure face tracking system

Experiment # (4) AM Demodulator

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA

Lecture 8 ELE 301: Signals and Systems

A Sound Analysis and Synthesis System for Generating an Instrumental Piri Song

The Algorithms of Speech Recognition, Programming and Simulating in MATLAB

Current mode all-pass filter using a single CDBA and its application

- T-Carrier Technologies -

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

Transcription:

TÜRKÇE SESLERİN İSTATİSTİKSEL ANALİZİ Pakize ERDOGMUS (1) Ali ÖZTÜRK (2) Abant Izzet Baysal University Technical Education Faculty Electronic and Comp. Education Dept. Asit. Prof. Abant İzzet Baysal Üniversitesi Technical Education Faculty Electrical Education Dept. Lecturer ÖZET Ses işleme, sesin zaman ve frekans özelliklerini temel alır. Ses gerçekte analog bir işaret iken, sayısal işarete çevrilerek bilgisayara aktarılan ses bir sayı dizisi haline gelir. Bu çalışmada, sayısal hale getirilmiş seslerin(çalışmada harfler kullanılmıştır.) zaman ve frekans domenindeki yapıları Matlab ortamında istatistiksel olarak incelenmiş ve sonuçları sunulmuştur. Anahtar Kelimeler: Sayısal İşaret İşleme; Ses işleme; Ses Tanıma; Ses Sentezi; İstatistiksel Analiz THE STATISTICAL ANALYSIS OF TURKISH LETTERS ABSTRACT Speech processing is based on time and frequency properties of signal. Originally, speech is an analog signal, but it is converted to digital signal for processing in computer. And it is an array for computer. In this study, it has been studied Turkish letters time and frequency domain analysis as statistically in Matlab and Excel. And the interesting results has been submitted. Key Words: Digital Signal Processing, Speech Processing; Speech Recognition; Speech Synthesis; Statistical Analysis 1. INTRODUCTION Sound waves are variations in air pressure. And two key features of sound wave are its amplitude which is a measure of air pressure at a point in time and frequency which is the rate at which this amplitude varies.[1] Speech is also a sound wave produced by human via vocal fold, velum, tongue and e.t.c. Speech is an effortless and highly efficent form of communication that doesn t require any special equipment. Speech Processing has the fallowing sub titles. [2] Coding: To store a speech using as few bits as possible while retaining high quality Synthesis: To convert a text string into a speech waveform Recognition: To convert a speech waveform into text Identity Verification Enhancement Speech is an analog signal that can be converted to digital signal easily. One of the speech feature is its amplitude stated before. This can be obtained easily from the recorded digital speech signal. The other speech feature,frequency are usually obtained via Fourier transforms (FTs), Short time fourier transforms (STFTs) or Linear Predictive Coding techniques. In this study we obtained this feature via FFT. The most popular titles in speech processing are speech recognition and speech synthesis because of the fact that human-man interaction is very important especially for both disabilities and children. Even thought many speech/voice processing tasks, like speech and

word recognition, reached satisfactory performance levels on specific applications, and although a variety of commercial products were launched in the last decade, many problems remain an open research area, and absolute solutions were not found out, yet.[3] The main obstacles slowing down these studies are the difference speaking style and grammer. 2 SPEECH PROCESSING In this study, firstly 18 Turkish speech letters were recorded by microphone with Windows Sound Recorder. The sampling rate was used to as 22.050 khz with 8-bit for all recorded letters. Table 1. Turkish Letters Used in This Study Consanants B C D F G H P S T Z Vowels A E I İ O Ö U Ü Then it can be possible to filter and denoise these letters signals. But when it was applied some signals, it has been seen that it is redundant for this study. But it is quite easy to denoise signals with Matlab Wavelet Toolbox. It could be possible to record letters spoken by different peoples. As it has been known that signal can change according to the person. Child, female and male voices may have a bit different features. But since this study is not a pure speech recognition study, letters has been spoken by a person. Secondly, it has been applied FFT in order to get frequency features of these signals. Wovels and constonants time representation has been given in Figure 1 and Figure 2 respectively. Figure 1.Turkish Vowels Time Representation(a,e,ı,i,o,ö,u,,ü)

Figure 2. Turkish Constanants Time Representation(b,c,d,f,g,h,p,s,t,z) And when it was observed some of these letters time presentations are quite similar. So it has been taken FFT in Matlab and some of these letter pairs frequency representation is as follows. It has been taken FFT for ı,o,e,i vowels and s,z,c,f constanants. Figure 3. s-z Frequency Spectrum

Figure 4. c-f Frequency Spectrum Figure 5. i-o Frequency Spectrum

3 STATISTICAL ANALYSIS Recorded signals have been read with Matlab as a number array with Wavread Command. It has been calculated mean, standart deviation and modes of these arrays in time domain. It has been shown in Table 2. These statistics can be obtained from the Matlab figure window easily. But it has been calculated by Excel. Table 2. The statistics of Letter signals Letters Mean Absolute Mean Std. Deviation A -0,00395 0,012367 0,020343 E -0,00369 0,009688 0,018192 I -0,0039 0,011522 0,02428 i -0,00382 0,009856 0,016072 O -0,00396 0,074739 0,165849 ö -0,0039 0,011594 0,024669 U -0,00407 0,0129 0,028433 ü -0,00387 0,011119 0,023834 B -0,00413 0,005926 0,009666 C -0,00385 0,007506 0,013891 D -0,00394 0,006431 0,010865 F -0,00413 0,007601 0,013222 G -0,00389 0,037777 0,081246 H -0,00406 0,019653 0,040311 P -0,00384 0,015176 0,03035 S -0,00389 0,030842 0,054286 T -0,00393 0,045256 0,083597 Z -0,00353 0,021545 0,034016 As it has been seen in the table the highest standart deviations of the letters are belong to g and t. And the lowest standart deviation is belong to i. Regression between I and O vowels' frequency values o vowel 0,12 0,1 0,08 0,06 0,04 0,02 0 y = 0,6229x 3-1,0733x 2 + 0,5694x + 0,003 R 2 = 0,6196 0 0,2 0,4 0,6 0,8 1 I vowel Figure 6. The Regression between the i-o vowels

Regression between c-f constanant f constanant 2 1,5 1 0,5 0-0,5 0 0,5 1 1,5 2-0,5 c constanant y = 0,9987x + 6E-05 R 2 = 0,9975 Seri 1 Doğrusal (Seri 1) Figure 7. The Regression between the c-f letter. 4 RESULTS In this study, it has been aimed to find some useful information using for speech recognition of the lettters. And as it has been seen both in tables and figures there are a lot of useful information to use. Especially statistics can be used for recognition parts paremeters. And as it has been in the figures some letters are very similar. The correlation coefficent is quite meaningful for these letters. So the process for these letters recognition will be much more hard. These study will continue as Turkish letter recognition. 5 REFERENCES [1] CALLAN R., Artificial Intelligence, Palgrave Mcmillan, 2003. [2] http://www.ee.ic.ac.uk/hp/staff/dmb/courses/speech/overview.pdf [3] Avcı, E., Akpolat, Z. H., Speech Recognition Using A Wavelet Packet Adaptive Network Based Fuzzy Inference System, Expert System with Applications, Vol 31, 2006.