A Speaker De-Identification System Based on Sound Processing

Costandache, Mihai-Andrei; Iftene, Adrian; Gifu, Daniela

Repository landing page

oai:aisel.aisnet.org:isd2014-1348

A Speaker De-Identification System Based on Sound Processing

Authors: Mihai-Andrei Costandache
Adrian Iftene
Daniela Gifu
Publication date: 9 August 2021
Publisher: AIS Electronic Library (AISeL)

Abstract

In the context of products employing speech recognition, where the speech signal is sent from the device to centralized servers that process data, or simply products that involve data storage on servers, privacy for audio data is an important issue, just as it is for other types of data. Ignoring privacy has consequences for both, speakers (information leaks) and server administrators (legal issues). In this paper, we propose a speaker de-identification solution based on sound processing, altering voice characteristics, along with an API. Our solution consisting of pitch shift and noise mix (the latter is an optional augmentation method) has a great speaker de-identification performance, without an important loss in terms of word intelligibility. It is worth mentioning that sometimes the recordings may not be easy to understand in the initial (i.e., not de-identified) form, due to the speaker’s pronunciation, talking speed, and other related factors

text

Similar works

Full text

Open in the Core reader

Download PDF

AIS Electronic Library (AISeL)

oai:aisel.aisnet.org:isd2014-1...

Last time updated on 16/11/2021

This paper was published in AIS Electronic Library (AISeL).

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.