Feature analysis for discriminative confidence estimation in spoken term detection

Tejedor, Javier; Toledano, Doroteo T.; Wang, Dong; King, Simon; Colas, Jose

Repository landing page

oai:pure.ed.ac.uk:publications/821ed562-0d7a-47a8-b3e5-a93e0670cebe

Feature analysis for discriminative confidence estimation in spoken term detection

Authors: Javier Tejedor
Doroteo T. Toledano
Dong Wang
Simon King
Jose Colas
Publication date: 1 September 2014
Publisher
Doi

Abstract

Discriminative confidence based on multi-layer perceptrons (MLPs) and multiple features has shown significant advantage compared to the widely used lattice-based confidence in spoken term detection (STD). Although the MLP-based framework can handle any features derived from a multitude of sources, choosing all possible features may lead to over complex models and hence less generality. In this paper, we design an extensive set of features and analyze their contribution to STD individually and as a group. The main goal is to choose a small set of features that are sufficiently informative while keeping the model simple and generalizable. We employ two established models to conduct the analysis: one is linear regression which targets for the most relevant features and the other is logistic linear regression which targets for the most discriminative features. We find the most informative features are comprised of those derived from diverse sources (ASR decoding, duration and lexical properties) and the two models deliver highly consistent feature ranks. STD experiments on both English and Spanish data demonstrate significant performance gains with the proposed feature sets.<br/

Similar works

Full text

Open in the Core reader

Download PDF

Edinburgh Research Explorer

oai:pure.ed.ac.uk:publications...

Last time updated on 09/08/2016

This paper was published in Edinburgh Research Explorer.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.