An evaluation of bags-of-words and spatio-temporal shapes for action recognition

Teofilo de Campos, Mark Barnard, Krystian Mikolajczyk, Josef Kittler, Fei Yan, William Christmas, David Windridge

    Research output: Contribution to conferencePaperpeer-review

    Abstract

    Bags-of-visual-Words (BoW) and Spatio-Temporal Shapes (STS) are two very popular approaches for action recognition from video. The former (BoW) is an un-structured global representation of videos which is built using a large set of local features. The latter (STS) uses a single feature located on a region of interest (where the actor is) in the video. Despite the popularity of these methods, no comparison between them has been done. Also, given that BoW and STS differ intrinsically in terms of context inclusion and globality/locality of operation, an appropriate evaluation framework has to be designed carefully. This paper compares these two approaches using four different datasets with varied degree of space-time specificity of the actions and varied relevance of the contextual background. We use the same local feature extraction method and the same classifier for both approaches. Further to BoW and STS, we also evaluated novel variations of BoW constrained in time or space. We observe that the STS approach leads to better results in all datasets whose background is of little relevance to action classification.
    Original languageEnglish
    DOIs
    Publication statusPublished - Jan 2011
    Event2011 IEEE Workshop on Applications of Computer Vision (WACV) - Kona, Hawaii
    Duration: 5 Jan 20117 Jan 2011

    Conference

    Conference2011 IEEE Workshop on Applications of Computer Vision (WACV)
    Period5/01/117/01/11

    Bibliographical note

    Note: Published in: 2011 IEEE Workshop on Applications of Computer Vision (WACV). Piscataway, NJ : Institute of Electrical and Electronics Engineers. ISSN 1550-5790 ISBN 9781424494965

    Organising Body: IEEE

    Keywords

    • Computer science and informatics

    Fingerprint

    Dive into the research topics of 'An evaluation of bags-of-words and spatio-temporal shapes for action recognition'. Together they form a unique fingerprint.
    • An evaluation of bags-of-words and spatio-temporal shapes for action recognition

      Campos, T. D., Barnard, M., Mikolajczyk, K., Kittler, J., Yan, F., Christmas, W. & Windridge, D., Jan 2011, Published in: 2011 IEEE Workshop on Applications of Computer Vision (WACV). Piscataway, NJ : Institute of Electrical and Electronics Engineers. ISSN 1550-5790 ISBN 9781424494965 Organising Body: IEEE Organising Body: IEEE.

      Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Cite this