Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

Doungpaisan, Pafan; Mingkhwan, Anirach

Repository landing page

oai:ojs.www.iaescore.com:article/8707

Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

Authors: Pafan Doungpaisan
Anirach Mingkhwan
Publication date: 2017
Publisher: Institute of Advanced Engineering and Science
Doi

Abstract

Search engine is the popular term for an information retrieval (IR) system. Typically, search engine can be based on full-text indexing. Changing the presentation from the text data to multimedia data types make an information retrieval process more complex such as a retrieval of image or sounds in large databases. This paper introduces the use of language and text independent speech as input queries in a large sound database by using Speaker identification algorithm. The method consists of 2 main processing first steps, we separate vocal and non-vocal identification after that vocal be used to speaker identification for audio query by speaker voice. For the speaker identification and audio query by process, we estimate the similarity of the example signal and the samples in the queried database by calculating the Euclidian distance between the Mel frequency cepstral coefficients (MFCC) and Energy spectrum of acoustic features. The simulations show that the good performance with a sustainable computational cost and obtained the average accuracy rate more than 90%

Similar works

Full text

IAES journal

oai:ojs.www.iaescore.com:artic...

Last time updated on 07/06/2018

This paper was published in IAES journal.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.