Some Commonly Used Speech Feature Extraction Algorithms

Alim, Sabur Ajibola; Rashid, Nahrul Khair Alang

Repository landing page

Some Commonly Used Speech Feature Extraction Algorithms

Authors: Sabur Ajibola Alim
Nahrul Khair Alang Rashid
Publication date: 12 December 2018
Publisher: IntechOpen
Doi

Abstract

Speech is a complex naturally acquired human motor ability. It is characterized in adults with the production of about 14 different sounds per second via the harmonized actions of roughly 100 muscles. Speaker recognition is the capability of a software or hardware to receive speech signal, identify the speaker present in the speech signal and recognize the speaker afterwards. Feature extraction is accomplished by changing the speech waveform to a form of parametric representation at a relatively minimized data rate for subsequent processing and analysis. Therefore, acceptable classification is derived from excellent and quality features. Mel Frequency Cepstral Coefficients (MFCC), Linear Prediction Coefficients (LPC), Linear Prediction Cepstral Coefficients (LPCC), Line Spectral Frequencies (LSF), Discrete Wavelet Transform (DWT) and Perceptual Linear Prediction (PLP) are the speech feature extraction techniques that were discussed in these chapter. These methods have been tested in a wide variety of applications, giving them high level of reliability and acceptability. Researchers have made several modifications to the above discussed techniques to make them less susceptible to noise, more robust and consume less time. In conclusion, none of the methods is superior to the other, the area of application would determine which method to select

Similar works

Full text

Open in the Core reader

Download PDF

IntechOpen

oai:intechopen.com:63970

Last time updated on 07/05/2020

This paper was published in IntechOpen.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.