VIEW-INVARIANT ACTION RECOGNITION FROM RGB DATA VIA 3D POSE ESTIMATION

Baptista, Renato; Ghorbel, Enjie; Papadopoulos, Konstantinos; Demisse, Girum; Aouada, Djamila; Ottersten, Björn

Repository landing page

oai:orbilu.uni.lu:10993/39033

VIEW-INVARIANT ACTION RECOGNITION FROM RGB DATA VIA 3D POSE ESTIMATION

Authors: Renato Baptista
Enjie Ghorbel
Konstantinos Papadopoulos
Girum Demisse
Djamila Aouada
Björn Ottersten
Publication date: 1 May 2019
Publisher

Abstract

peer reviewedIn this paper, we propose a novel view-invariant action recognition method using a single monocular RGB camera. View-invariance remains a very challenging topic in 2D action recognition due to the lack of 3D information in RGB images. Most successful approaches make use of the concept of knowledge transfer by projecting 3D synthetic data to multiple viewpoints. Instead of relying on knowledge transfer, we propose to augment the RGB data by a third dimension by means of 3D skeleton estimation from 2D images using a CNN-based pose estimator. In order to ensure view-invariance, a pre-processing for alignment is applied followed by data expansion as a way for denoising. Finally, a Long-Short Term Memory (LSTM) architecture is used to model the temporal dependency between skeletons. The proposed network is trained to directly recognize actions from aligned 3D skeletons. The experiments performed on the challenging Northwestern-UCLA dataset show the superiority of our approach as compared to state-of-the-art ones

Similar works

Full text

Open in the Core reader

Download PDF

Open Repository and Bibliography - Luxembourg

oai:orbilu.uni.lu:10993/39033

Last time updated on 19/03/2019

This paper was published in Open Repository and Bibliography - Luxembourg.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.