MoSIFT: Recognizing Human Actions in Surveillance Videos

Chen, Ming-Yu; Hauptmann, Alexander

doi:10.1184/R1/6607523.v1

file.pdf (1.67 MB)

MoSIFT: Recognizing Human Actions in Surveillance Videos

journal contribution

posted on 1995-08-01, 00:00 authored by Ming-Yu Chen, Alexander Hauptmann

The goal of this paper is to build robust human action recognition for real world surveillance videos. Local spatio-temporal features around interest points provide compact but descriptive representations for video analysis and motion recognition. Current approaches tend to extend spatial descriptions by adding a temporal component for the appearance descriptor, which only implicitly captures motion information. We propose an algorithm called MoSIFT, which detects interest points and encodes not only their local appearance but also explicitly models local motion. The idea is to detect distinctive local features through local appearance and motion. We construct MoSIFT feature descriptors in the spirit of the well-known SIFT descriptors to be robust to small deformations through grid aggregation. We also introduce a bigram model to construct a correlation between local features to capture the more global structure of actions. The method advances the state of the art result on the KTH dataset to an accuracy of 95.8%. We also applied our approach to 100 hours of surveillance data as part of the TRECVID Event Detection task with very promising results on recognizing human actions in the real world surveillance videos. 2

History

Publisher Statement

Copyright 1995 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited

Date

1995-08-01

Usage metrics

Keywords

computer sciences

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

MoSIFT: Recognizing Human Actions in Surveillance Videos

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports