Human Detection and Segmentation via Multi-view Consensus

Katircioglu, Isinsu; Rhodin, Helge; Spörri, Jörg; Salzmann, Mathieu; Fua, Pascal

Repository landing page

oai:infoscience.epfl.ch:287925

Human Detection and Segmentation via Multi-view Consensus

Authors: Isinsu Katircioglu
Helge Rhodin
Jörg Spörri
Mathieu Salzmann
Pascal Fua
Publication date: 18 August 2021
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'

Abstract

Self-supervised detection and segmentation of foreground objects aims for accuracy without annotated training data. However, existing approaches predominantly rely on restrictive assumptions on appearance and motion. For scenes with dynamic activities and camera motion, we propose a multi-camera framework in which geometric constraints are embedded in the form of multi-view consistency during training via coarse 3D localization in a voxel grid and fine-grained offset regression. In this manner, we learn a joint distribution of proposals over multiple views. At inference time, our method operates on single RGB images. We outperform state-of-the-art techniques both on images that visually depart from those of standard benchmarks and on those of the classical Human3.6M dataset

Text

Similar works

Full text

Infoscience - École polytechnique fédérale de Lausanne

oai:infoscience.epfl.ch:287925

Last time updated on 28/09/2021

This paper was published in Infoscience - École polytechnique fédérale de Lausanne.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.