DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation

Houidhek, Amal; Colotte, Vincent; Mnasri, Zied; Jouvet, Denis

Repository landing page

oai:HAL:hal-01904512v1

DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation

Authors: Amal Houidhek
Vincent Colotte
Zied Mnasri
Denis Jouvet
Publication date: 15 October 2018
Publisher: HAL CCSD

Abstract

International audienceThis paper investigates the use of deep neural networks (DNN) for Arabic speech synthesis. In parametric speech synthesis, whether HMM-based or DNN-based, each speech segment is described with a set of contextual features. These contextual features correspond to linguistic, phonetic and prosodic information that may affect the pronunciation of the segments. Gemination and vowel quantity (short vowel vs. long vowel) are two particular and important phenomena in Arabic language. Hence, it is worth investigating if those phenomena must be handled by using specific speech units, or if their specification in the contextual features is enough. Consequently four modelling approaches are evaluated by considering geminated consonants (respectively long vowels) either as fully-fledged phoneme units or as the same phoneme as their simple (respectively short) counterparts. Although no significant difference has been observed in previous studies relying on HMM-based modelling, this paper examines these modelling variants in the framework of DNN-based speech synthesis. Listening tests are conducted to evaluate the four modelling approaches, and to assess the performance of DNN-based Arabic speech synthesis with respect to previous HMM-based approach

Similar works

Full text

Open in the Core reader

Download PDF

INRIA a CCSD electronic archive server

oai:HAL:hal-01904512v1

Last time updated on 12/01/2019

This paper was published in INRIA a CCSD electronic archive server.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.