An Audio-Visual System for Object-Based Audio: From Recording to Listening

Coleman, Philip; Franck, A; Francombe, Jon; Liu, Qingju; de Campos, Teofilo; Hughes, R; Menzies, D; Simon Galvez,, M; Tang, Y; Woodcock, J; Jackson, Philip; Melchior, F; Pike, C; Fazi, F; Cox, T; Hilton, Adrian

Repository landing page

An Audio-Visual System for Object-Based Audio: From Recording to Listening

Authors: Philip Coleman
A Franck
Jon Francombe
Qingju Liu
Teofilo de Campos
R Hughes
D Menzies
Simon Galvez,M
Y Tang
J Woodcock
Philip Jackson
F Melchior
C Pike
F Fazi
T Cox
Adrian Hilton
Publication date: 5 March 2019
Publisher: IEEE
Doi

Abstract

Object-based audio is an emerging representation for audio content, where content is represented in a reproductionformat- agnostic way and thus produced once for consumption on many different kinds of devices. This affords new opportunities for immersive, personalized, and interactive listening experiences. This article introduces an end-to-end object-based spatial audio pipeline, from sound recording to listening. A high-level system architecture is proposed, which includes novel audiovisual interfaces to support object-based capture and listenertracked rendering, and incorporates a proposed component for objectification, i.e., recording content directly into an object-based form. Text-based and extensible metadata enable communication between the system components. An open architecture for object rendering is also proposed. The system’s capabilities are evaluated in two parts. First, listener-tracked reproduction of metadata automatically estimated from two moving talkers is evaluated using an objective binaural localization model. Second, object-based scene capture with audio extracted using blind source separation (to remix between two talkers) and beamforming (to remix a recording of a jazz group), is evaluated with perceptually-motivated objective and subjective experiments. These experiments demonstrate that the novel components of the system add capabilities beyond the state of the art. Finally, we discuss challenges and future perspectives for object-based audio workflows

Similar works

Full text

Open in the Core reader

Download PDF

University of Surrey

oai:alma.44SUR_INST:1113872634...

Last time updated on 01/08/2022

This paper was published in University of Surrey.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.