Using dysphonic voice to characterize speaker's biometry

Gómez Vilda, Pedro ORCID: https://orcid.org/0000-0003-3283-378X, San Segundo, Eugenia, Mazaira Fernández, Luis Miguel, Álvarez Marquina, Agustín ORCID: https://orcid.org/0000-0002-6106-6711 and Rodellar Biarge, M. Victoria ORCID: https://orcid.org/0000-0001-9384-3290 (2014). Using dysphonic voice to characterize speaker's biometry. "Language and Law / Linguagem e Direito", v. 1 (n. 2); pp. 42-66. ISSN 2183-3745.

Descripción

Título:	Using dysphonic voice to characterize speaker's biometry
Autor/es:	Gómez Vilda, Pedro https://orcid.org/0000-0003-3283-378X San Segundo, Eugenia Mazaira Fernández, Luis Miguel Álvarez Marquina, Agustín https://orcid.org/0000-0002-6106-6711 Rodellar Biarge, M. Victoria https://orcid.org/0000-0001-9384-3290
Tipo de Documento:	Artículo
Título de Revista/Publicación:	Language and Law / Linguagem e Direito
Fecha:	2014
ISSN:	2183-3745
Volumen:	1
Materias:	Filología
Palabras Clave Informales:	Phonation; Speaker Recognition; Voice Production; Speech Processing
Escuela:	E.T.S. de Ingenieros Informáticos (UPM)
Departamento:	Arquitectura y Tecnología de Sistemas Informáticos
Licencias Creative Commons:	Reconocimiento - Sin obra derivada - No comercial

Texto completo

Vista Previa

PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB) | Vista Previa

Resumen

Phonation distortion leaves relevant marks in a speaker's biometric profile. Dysphonic voice production may be used for biometrical speaker characterization. In the present paper phonation features derived from the glottal source (GS) parameterization, after vocal tract inversion, is proposed for dysphonic voice characterization in Speaker Verification tasks. The glottal source derived parameters are matched in a forensic evaluation framework defining a distance-based metric specification. The phonation segments used in the study are derived from fillers, long vowels, and other phonation segments produced in spontaneous telephone conversations. Phonated segments from a telephonic database of 100 male Spanish native speakers are combined in a 10-fold cross-validation task to produce the set of quality measurements outlined in the paper. Shimmer, mucosal wave correlate, vocal fold cover biomechanical parameter unbalance and a subset of the GS cepstral profile produce accuracy rates as high as 99.57 for a wide threshold interval (62.08-75.04%). An Equal Error Rate of 0.64 % can be granted. The proposed metric framework is shown to behave more fairly than classical likelihood ratios in supporting the hypothesis of the defense vs that of the prosecution, thus ofering a more reliable evaluation scoring. Possible applications are Speaker Verification and Dysphonic Voice Grading.

Proyectos asociados

Tipo

Código

Acrónimo

Responsable

Título

Gobierno de España

TEC2012-38630-C04-01

Sin especificar

Universidad Politécnica de Madrid

EVALUACION MULTIMODAL DE TRASTORNOS NEUROLOGICOS MEDIANTE LA CARACTERIZACION DE LA VOZ, DINAMICA DE LOS PLIEGUES VOCALES Y SECUENCIAS SACADICAS

Gobierno de España

TEC2012-38630-C04-04

Sin especificar

Universidad Politécnica de Madrid

DETECCION DEL TRASTORNO NEUROLOGICO POR MEDIO DE CORRELATOS DE LA FONACION OBTENIDOS POR MODELADO INVERSO A PARTIR DE LA FUENTE GLOTICA

Más información

ID de Registro:	40910
Identificador DC:	https://oa.upm.es/40910/
Identificador OAI:	oai:oa.upm.es:40910
URL Oficial:	https://ojs.letras.up.pt/index.php/LLLD/article/vi...
Depositado por:	Memoria Investigacion
Depositado el:	26 Oct 2016 11:09
Ultima Modificación:	05 Jun 2019 17:21

Estadísticas

Exportar cita

Editar (sólo personal del Archivo)

En esta página

Menú principal

Buscar

Using dysphonic voice to characterize speaker's biometry

Cita

Descripción

Texto completo

Resumen

Proyectos asociados

Más información

Acciones

Documentos

Agrupados por ...

Datos Investigación

Financiadores

Especiales

En otros formatos

El repositorio

Redes sociales

Información adicional