A Log Domain Pulse Model for Parametric Speech Synthesis

Degottex, Gilles; Lanchantin, Pierre; Gales, Mark

A Log Domain Pulse Model for Parametric Speech Synthesis

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/275115

Repository DOI

https://doi.org/10.17863/CAM.21315

Files

Accepted version (2.23 MB)

Type

Article

Authors

Degottex, Gilles

Lanchantin, Pierre

Gales, Mark

https://orcid.org/0000-0002-5311-8219

Abstract

Most of the degradation in current Statistical Parametric Speech Synthesis (SPSS) results from the form of the vocoder. One of the main causes of degradation is the reconstruction of the noise. In this article, a new signal model is proposed that leads to a simple synthesizer, without the need for ad-hoc tuning of model parameters. The model is not based on the traditional additive linear source-filter model, it adopts a combination of speech components that are additive in the log domain. Also, the same representation for voiced and unvoiced segments is used, rather than relying on binary voicing decisions. This avoids voicing error discontinuities that can occur in many current vocoders. A simple binary mask is used to denote the presence of noise in the time-frequency domain, which is less sensitive to classification errors. Four experiments have been carried out to evaluate this new model. The first experiment examines the noise reconstruction issue. Three listening tests have also been carried out that demonstrate the advantages of this model: comparison with the STRAIGHT vocoder; the direct prediction of the binary noise mask by using a mixed output configuration; and partial improvements of creakiness using a mask correction mechanism.

Keywords

speech, speech processing, speech synthesis, text-to-speech, parametric speech synthesis, acoustic model, voice, pulse model

Journal Title

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Journal ISSN

2329-9290
2329-9304

Volume Title

26

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publisher DOI

https://doi.org/10.1109/TASLP.2017.2761546

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

European Commission Horizon 2020 (H2020) Marie Sk?odowska-Curie actions (655764)

European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie; 10.13039/501100000266-EPSRC

Collections

Scholarly Works - Engineering