Efficient Action Recognition with MoFREAK

被引：9

作者：

Whiten, Chris ^{[1
]}

Laganiere, Robert ^{[1
]}

Bilodeau, Guillaume-Alexandre ^{[2
]}

机构：

[1] Univ Ottawa, VIVA Lab, Ottawa, ON, Canada

[2] Ecole Polytech, LITIV Lab, Montreal, PQ, Canada

来源：

2013 INTERNATIONAL CONFERENCE ON COMPUTER AND ROBOT VISION (CRV) | 2013年

关键词：

action recognition; spatiotemporal feature description; local binary descriptor;

D O I：

10.1109/CRV.2013.30

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent work shows that local binary feature descriptors are effective for increasing the efficiency of object recognition, while retaining comparable performance to other state of the art descriptors. An extension of these approaches to action recognition in videos would facilitate huge gains in efficiency, due to the computational advantage of computing a bag-of-words representation with the Hamming distance rather than the Euclidean distance. We present a new local spatiotemporal descriptor for action recognition that encodes both the appearance and motion in a scene with a short binary string. The first bytes of the descriptor encode the appearance and some implicit motion, through an extension of the recently proposed FREAK descriptor. The remaining bytes strengthen the motion model by building a binary string through local motion patterns. We demonstrate that by exploiting the binary makeup of this descriptor, it is possible to greatly reduce the running time of action recognition while retaining competitive performance with the state of the art.

引用

页码：319 / 325

页数：7