Carnegie Mellon University
Browse
- No file added yet -

Multi-label Discriminative Weakly-Supervised Human Activity Recognition and Localization

Download (5.38 MB)
journal contribution
posted on 2014-11-01, 00:00 authored by Ehsan Adeli Mosabbeb, Ricardo da Silveira Cabral, Fernando de la Torre, Mahmood Fathy

Activity recognition in video has become increasingly important due to its many applications ranging from in-home elder care, surveillance, human computer interaction to automatic sports commentary. To date, most approaches to video rely on fully supervised settings that require time consuming and error prone manual labeling. Moreover, existing supervised approaches are typically tailored for classification, not detection problems (the spatial and temporal support of the action has to be detected). Recently, weakly-supervised learning (WSL) approaches were able to learn discriminative classifiers while localizing the action in space and/or time using weak labels. However, existing approaches for WSL provide coarse localization in terms of spatial regions or spatio-temporal volumes. Moreover, it is unclear how to extend current approaches to the multilabel case that is common in practical applications. This paper proposes a matrix completion approach to the problem of WSL for multi-label learning for video. Our approach localizes non-rectangular spatio-temporal discriminative regions that are inferred by clustering regions of common texture and motion features. We illustrate how our approach improves existing WSL and supervised learning techniques in three standard databases: Hollywood, UCF sports, and MSR-II.

History

Publisher Statement

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-16814-2_16

Date

2014-11-01

Usage metrics

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC