Spatiotemporal pyramid representation for recognition of facial expressions and hand gestures

Zhipeng Zhao, Ahmed Elgammal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

This paper presents a spatiotemporal pyramid representation for recognizing facial expressions and hand gestures. This approach works by partitioning video sequence into increasingly fine subdivisions in the space and time domains and modeling the distribution of the local motion features inside each subdivision such that the set of motion features are mapped into spatial and temporal multi-resolution histograms. This spatiotemporal pyramid is built by weighting the histograms from the different layers of the subdivisions. The proposed approach is an extension of the orderless "bag-of-words" model by approximately capturing geometric and temporal arrangements of the local motion features. The experiments on facial expression and hand gesture data sets have demonstrated the significantly improved performance over state of art results on human activity recognition tasks by using our representation.

Original languageEnglish (US)
Title of host publication2008 8th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2008
DOIs
StatePublished - 2008
Event2008 8th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2008 - Amsterdam, Netherlands
Duration: Sep 17 2008Sep 19 2008

Publication series

Name2008 8th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2008

Other

Other2008 8th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2008
CountryNetherlands
CityAmsterdam
Period9/17/089/19/08

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition
  • Human-Computer Interaction
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Spatiotemporal pyramid representation for recognition of facial expressions and hand gestures'. Together they form a unique fingerprint.

Cite this