Seeing is worse than believing: Reading people's minds better than computer-vision methods recognize actions

Andrei Barbu, Daniel P. Barrett, Wei Chen, Narayanaswamy Siddharth, Caiming Xiong, Jason J. Corso, Christiane D. Fellbaum, Catherine Hanson, Stephen José Hanson, Sébastien Hélie, Evguenia Malaia, Barak A. Pearlmutter, Jeffrey Mark Siskind, Thomas Michael Talavage, Ronnie B. Wilbur

Research output: Contribution to journalConference articlepeer-review

6 Scopus citations

Abstract

We had human subjects perform a one-out-of-six class action recognition task from video stimuli while undergoing functional magnetic resonance imaging (fMRI). Support-vector machines (SVMs) were trained on the recovered brain scans to classify actions observed during imaging, yielding average classification accuracy of 69.73% when tested on scans from the same subject and of 34.80% when tested on scans from different subjects. An apples-to-apples comparison was performed with all publicly available software that implements state-of-the-art action recognition on the same video corpus with the same cross-validation regimen and same partitioning into training and test sets, yielding classification accuracies between 31.25% and 52.34%. This indicates that one can read people's minds better than state-of-the-art computer-vision methods can perform action recognition.

Original languageEnglish (US)
Pages (from-to)612-627
Number of pages16
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8693 LNCS
Issue numberPART 5
DOIs
StatePublished - 2014
Event13th European Conference on Computer Vision, ECCV 2014 - Zurich, Switzerland
Duration: Sep 6 2014Sep 12 2014

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Keywords

  • action recognition
  • fMRI

Fingerprint

Dive into the research topics of 'Seeing is worse than believing: Reading people's minds better than computer-vision methods recognize actions'. Together they form a unique fingerprint.

Cite this