Indexed spatio-temporal appearance models for query-driven video action recognition

Haomian Zheng, Zhu Li, Aggelos K. Katsaggelos, Jia You

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review


Video action and event recognition is an important problem in video analysis research with many important applications, such as surveillance and video search. In this work, we deal with the appearance complexity in video action recognition by applying an indexing structure and partition in appearance space. The task requires spatio-temporal appearance modeling that can capture the discriminative information among different action classes. Traditional approaches are based on a global appearance model, which is not robust to local variations in background. In this work, we develop a query driven dynamic appearance modeling method and use a localized subspace to obtain a distance metric for appearance discrimination. Multiple localized models are constructed and utilized to measure the similarity between the trajectories and the sub-space metric is adaptive during the learning process. The processing is implemented based on an indexing scheme, which is very fast in computation. Simulation results demonstrate the effectiveness of the solution.
Original languageEnglish
Title of host publicationElectronic Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, ICME 2011
Publication statusPublished - 7 Nov 2011
Event2011 12th IEEE International Conference on Multimedia and Expo, ICME 2011 - Barcelona, Spain
Duration: 11 Jul 201115 Jul 2011


Conference2011 12th IEEE International Conference on Multimedia and Expo, ICME 2011


  • Localize Modeling
  • Query-Driven
  • Space Indexing
  • Spatio-temporal Modeling
  • Video Action Recognition

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications

Cite this