Fine-Grained Activity Recognition for Assembly Videos

被引:10
|
作者
Jones, Jonathan D. [1 ]
Cortesa, Cathryn [2 ]
Shelton, Amy [3 ]
Landau, Barbara [2 ]
Khudanpur, Sanjeev [1 ]
Hager, Gregory D. [4 ]
机构
[1] Johns Hopkins Univ, Dept Elect Engn, Baltimore, MD 21211 USA
[2] Johns Hopkins Univ, Dept Cognit Sci, Baltimore, MD 21211 USA
[3] Johns Hopkins Univ, Sch Educ, Baltimore, MD 21211 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21211 USA
来源
基金
美国国家科学基金会;
关键词
Probabilistic Inference; sensor fusion; recognition; assembly; multi-modal perception for HRI;
D O I
10.1109/LRA.2021.3064149
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter we address the task of recognizing assembly actions as a structure (e.g. a piece of furniture or a toy block tower) is built up from a set of primitive objects. Recognizing the full range of assembly actions requires perception at a level of spatial detail that has not been attempted in the action recognition literature to date. We extend the fine-grained activity recognition setting to address the task of assembly action recognition in its full generality by unifying assembly actions and kinematic structures within a single framework. We use this framework to develop a general method for recognizing assembly actions from observation sequences, along with observation features that take advantage of a spatial assembly's special structure. Finally, we evaluate our method empirically on two application-driven data sources: 1) An IKEA furniture-assembly dataset, and 2) A block-building dataset. On the first, our system recognizes assembly actions with an average framewise accuracy of 70% and an average normalized edit distance of 10%. On the second, which requires fine-grained geometric reasoning to distinguish between assemblies, our system attains an average normalized edit distance of 23%-a relative improvement of 69% over prior work.
引用
收藏
页码:3728 / 3735
页数:8
相关论文
共 50 条
  • [31] Fine-grained recognition of plants from images
    Milan Šulc
    Jiří Matas
    Plant Methods, 13
  • [32] Fine-Grained Grounding for Multimodal Speech Recognition
    Srinivasan, Tejas
    Sanabria, Ramon
    Metze, Florian
    Elliott, Desmond
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2667 - 2677
  • [33] Semantic bilinear pooling for fine-grained recognition
    School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
    Proc. Int. Conf. Pattern Recognit., (3660-3666):
  • [34] Semantic Bilinear Pooling for Fine-Grained Recognition
    Li, Xinjie
    Yang, Chun
    Chen, Song-Lu
    Zhu, Chao
    Yin, Xu-Cheng
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3660 - 3666
  • [35] Learning Features and Parts for Fine-Grained Recognition
    Krause, Jonathan
    Gebru, Timnit
    Deng, Jia
    Li, Li-Jia
    Li Fei-Fei
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 26 - 33
  • [36] Annotation modification for fine-grained visual recognition
    Luo, Changzhi
    Meng, Zhijun
    Feng, Jiashi
    Ni, Bingbing
    Wang, Meng
    NEUROCOMPUTING, 2018, 274 : 58 - 65
  • [37] Leveraging the Wisdom of the Crowd for Fine-Grained Recognition
    Deng, Jia
    Krause, Jonathan
    Stark, Michael
    Fei-Fei, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (04) : 666 - 676
  • [38] Selective Pooling Vector for Fine-grained Recognition
    Chen, Guang
    Yang, Jianchao
    Jin, Hailin
    Shechtman, Eli
    Brandt, Jonathan
    Han, Tony X.
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 860 - 867
  • [39] Fine-grained recognition of plants from images
    Sulc, Milan
    Matas, Jiri
    PLANT METHODS, 2017, 13
  • [40] Nonparametric Part Transfer for Fine-grained Recognition
    Goering, Christoph
    Rodner, Erik
    Freytag, Alexander
    Denzler, Joachim
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2489 - 2496