AVSE Challenge: Audio-Visual Speech Enhancement Challenge

Aldana Blanco, Andrea Lorena; Valentini Botinhao, Cassia; Klejch, Ondrej; Gogate, Mandar; Dashtipour, Kia; Hussain, Amir; Bell, Peter

Repository landing page

oai:pure.ed.ac.uk:publications/9aa045cc-739e-445b-8d69-203e1b4f2eef

AVSE Challenge: Audio-Visual Speech Enhancement Challenge

Authors: Andrea Lorena Aldana Blanco
Cassia Valentini Botinhao
Ondrej Klejch
Mandar Gogate
Kia Dashtipour
Amir Hussain
Peter Bell
Publication date: 27 January 2023
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Doi

Abstract

Audio-visual speech enhancement is the task of improving the quality of a speech signal when video of the speaker is available. It opens-up the opportunity of improving speech intelligibility in adverse listening scenarios that are currently too challenging for audio-only speech enhancement models. The Audio-Visual Speech Enhancement (AVSE) challenge aims to set the first benchmark in this area. We provide participants with datasets and scripts to test their audio-visual speech enhancement models under a common framework for both training and evaluation. The data is derived from real-world videos, and comprises noisy mixes, in which audio from target speaker is mixed with either a competing speaker or a noise signal. The submitted systems are evaluated by conducting AV intelligibility tests involving human participants. We expect this challenge to be a platform for advancing the field of audio-visual speech-enhancement and to provide further insight about the scope and limitations of current AV speech enhancement approaches

Similar works

Full text

Open in the Core reader

Download PDF

Edinburgh Research Explorer

oai:pure.ed.ac.uk:publications...

Last time updated on 02/03/2023

This paper was published in Edinburgh Research Explorer.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.