Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition

Lifang Wu, Xianglong Lang, Ye Xiang, Changwen Chen, Zun Li, Zhuming Wang

Research output: Journal article publicationJournal articleAcademic researchpeer-review

10 Citations (Scopus)

Abstract

Group activity recognition aims to recognize behaviors characterized by multiple individuals within a scene. Existing schemes rely on individual relation inference and usually take the individuals as tokens. Essentially they select the most relevant region of the group activity from the entire image while filtering out irrelevant background noises. However, these schemes require individual bounding box labeling in both training and testing stages. Since individuals have usually been presented at one scale, multi-scale individuals cannot be combined in an effective way. In this paper, we present a novel end-to-end hierarchical relation inference framework based on active spatial positions for group activity recognition. This framework is designed to locate active spatial positions and use them as visual tokens to infer the relations for token embeddings. It requires individual bounding box labeling only in the training stage while automatically eliminating the background after locating active spatial positions from the entire scene. The hierarchical relations can be naturally inferred based on the visual tokens at different scales, contributing to further performance improvement. Experimental results demonstrate that the proposed framework is competitive against existing schemes that require more laboring and computation to generate labels in both the training and testing stage.

Original languageEnglish
Pages (from-to)2839-2851
Number of pages13
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume33
Issue number6
DOIs
Publication statusPublished - 1 Jun 2023

Keywords

  • active spatial positions
  • Group activity recognition
  • hierarchical relation inference

ASJC Scopus subject areas

  • Media Technology
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition'. Together they form a unique fingerprint.

Cite this