Actor-Transformers for Group Activity Recognition

Gavrilyuk, K.; Sanford, R.; Javan, M.; Snoek, C.G.M.

Repository landing page

oai:dare.uva.nl:publications/60c57d59-e6e6-45d8-836f-948d67d2ff1e

Actor-Transformers for Group Activity Recognition

Authors: K. Gavrilyuk
R. Sanford
M. Javan
C.G.M. Snoek
Publication date: 1 January 2020
Publisher: IEEE Computer Society
Doi

Abstract

This paper strives to recognize individual actions and group activities from videos. While existing solutions for this challenging problem explicitly model spatial and temporal relationships based on location of individual actors, we propose an actor-transformer model able to learn and selectively extract information relevant for group activity recognition. We feed the transformer with rich actor-specific static and dynamic representations expressed by features from a 2D pose network and 3D CNN, respectively. We empirically study different ways to combine these representations and show their complementary benefits. Experiments show what is important to transform and how it should be transformed. What is more, actor-transformers achieve state-of-the-art results on two publicly available benchmarks for group activity recognition, outperforming the previous best published results by a considerable margin

contributionToPeriodical

Similar works

Full text

UvA-DARE

oai:dare.uva.nl:publications/6...

Last time updated on 09/05/2023

This paper was published in UvA-DARE.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.