Fine-Grained Activity Recognition for Assembly Videos

被引:10
|
作者
Jones, Jonathan D. [1 ]
Cortesa, Cathryn [2 ]
Shelton, Amy [3 ]
Landau, Barbara [2 ]
Khudanpur, Sanjeev [1 ]
Hager, Gregory D. [4 ]
机构
[1] Johns Hopkins Univ, Dept Elect Engn, Baltimore, MD 21211 USA
[2] Johns Hopkins Univ, Dept Cognit Sci, Baltimore, MD 21211 USA
[3] Johns Hopkins Univ, Sch Educ, Baltimore, MD 21211 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21211 USA
来源
基金
美国国家科学基金会;
关键词
Probabilistic Inference; sensor fusion; recognition; assembly; multi-modal perception for HRI;
D O I
10.1109/LRA.2021.3064149
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter we address the task of recognizing assembly actions as a structure (e.g. a piece of furniture or a toy block tower) is built up from a set of primitive objects. Recognizing the full range of assembly actions requires perception at a level of spatial detail that has not been attempted in the action recognition literature to date. We extend the fine-grained activity recognition setting to address the task of assembly action recognition in its full generality by unifying assembly actions and kinematic structures within a single framework. We use this framework to develop a general method for recognizing assembly actions from observation sequences, along with observation features that take advantage of a spatial assembly's special structure. Finally, we evaluate our method empirically on two application-driven data sources: 1) An IKEA furniture-assembly dataset, and 2) A block-building dataset. On the first, our system recognizes assembly actions with an average framewise accuracy of 70% and an average normalized edit distance of 10%. On the second, which requires fine-grained geometric reasoning to distinguish between assemblies, our system attains an average normalized edit distance of 23%-a relative improvement of 69% over prior work.
引用
收藏
页码:3728 / 3735
页数:8
相关论文
共 50 条
  • [41] Learning to locate for fine-grained image recognition
    Chen, Jiamin
    Hu, Jianguo
    Li, Shiren
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 206
  • [42] Fine-Grained Argument Unit Recognition and Classification
    Trautmann, Dietrich
    Daxenberger, Johannes
    Stab, Christian
    Schuetze, Hinrich
    Gurevych, Iryna
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9048 - 9056
  • [43] Dynamic Perception Framework for Fine-Grained Recognition
    Ding, Yao
    Han, Zhenjun
    Zhou, Yanzhao
    Zhu, Yi
    Chen, Jie
    Ye, Qixiang
    Jiao, Jianbin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1353 - 1365
  • [44] Fine-grained Named Entity Recognition for Turkish
    Khudoyberdieva, Lola
    Diri, Banu
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [45] Fine-Grained Recognition without Part Annotations
    Krause, Jonathan
    Jin, Hailin
    Yang, Jianchao
    Li Fei-Fei
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5546 - 5555
  • [46] Discovering Localized Attributes for Fine-grained Recognition
    Duan, Kun
    Parikh, Devi
    Crandall, David
    Grauman, Kristen
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3474 - 3481
  • [47] Fine-Grained Object Recognition with Gnostic Fields
    Kanan, Christopher
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 23 - 30
  • [48] Picking Neural Activations for Fine-Grained Recognition
    Zhang, Xiaopeng
    Xiong, Hongkai
    Zhou, Wengang
    Lin, Weiyao
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (12) : 2736 - 2750
  • [49] TransFG: A Transformer Architecture for Fine-Grained Recognition
    He, Ju
    Chen, Jie-Neng
    Liu, Shuai
    Kortylewski, Adam
    Yang, Cheng
    Bai, Yutong
    Wang, Changhu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 852 - 860
  • [50] SIRNet: Fine-Grained Surgical Interaction Recognition
    Li, Ling
    Li, Xiaojian
    Ding, Shuai
    Fang, Zhao
    Xu, Mengya
    Ren, Hongliang
    Yang, Shanlin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 4212 - 4219