Fine-Grained Activity Recognition for Assembly Videos

被引:10
|
作者
Jones, Jonathan D. [1 ]
Cortesa, Cathryn [2 ]
Shelton, Amy [3 ]
Landau, Barbara [2 ]
Khudanpur, Sanjeev [1 ]
Hager, Gregory D. [4 ]
机构
[1] Johns Hopkins Univ, Dept Elect Engn, Baltimore, MD 21211 USA
[2] Johns Hopkins Univ, Dept Cognit Sci, Baltimore, MD 21211 USA
[3] Johns Hopkins Univ, Sch Educ, Baltimore, MD 21211 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21211 USA
来源
基金
美国国家科学基金会;
关键词
Probabilistic Inference; sensor fusion; recognition; assembly; multi-modal perception for HRI;
D O I
10.1109/LRA.2021.3064149
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter we address the task of recognizing assembly actions as a structure (e.g. a piece of furniture or a toy block tower) is built up from a set of primitive objects. Recognizing the full range of assembly actions requires perception at a level of spatial detail that has not been attempted in the action recognition literature to date. We extend the fine-grained activity recognition setting to address the task of assembly action recognition in its full generality by unifying assembly actions and kinematic structures within a single framework. We use this framework to develop a general method for recognizing assembly actions from observation sequences, along with observation features that take advantage of a spatial assembly's special structure. Finally, we evaluate our method empirically on two application-driven data sources: 1) An IKEA furniture-assembly dataset, and 2) A block-building dataset. On the first, our system recognizes assembly actions with an average framewise accuracy of 70% and an average normalized edit distance of 10%. On the second, which requires fine-grained geometric reasoning to distinguish between assemblies, our system attains an average normalized edit distance of 23%-a relative improvement of 69% over prior work.
引用
收藏
页码:3728 / 3735
页数:8
相关论文
共 50 条
  • [21] Fine-grained Walking Activity Recognition via Driving Recorder Dataset
    Kataoka, Hirokatsu
    Aoki, Yoshimitsu
    Satoh, Yutaka
    Oikawa, Shoko
    Matsui, Yasuhiro
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, : 620 - 625
  • [22] A Transformer-based Late-Fusion Mechanism for Fine-Grained Object Recognition in Videos
    Koch, Jannik
    Wolf, Stefan
    Beyerer, Juergen
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 100 - 109
  • [23] Fine-Grained Kitchen Activity Recognition using RGB-D
    Lei, Jinna
    Ren, Xiaofeng
    Fox, Dieter
    UBICOMP'12: PROCEEDINGS OF THE 2012 ACM INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING, 2012, : 208 - 211
  • [24] Fine-Grained Similarity Measurement between Educational Videos and Exercises
    Wang, Xin
    Huang, Wei
    Liu, Qi
    Yin, Yu
    Huang, Zhenya
    Wu, Le
    Ma, Jianhui
    Wang, Xue
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 331 - 339
  • [25] Fine-Grained Facial Expression Recognition in the Wild
    Liang, Liqian
    Lang, Congyan
    Li, Yidong
    Feng, Songhe
    Zhao, Jian
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 482 - 494
  • [26] Fine-Grained Named Entity Recognition for Sinhala
    Azeez, Rameela
    Ranathunga, Surangika
    MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 295 - 300
  • [27] PROGRESSIVE TRAINING ENABLED FINE-GRAINED RECOGNITION
    Kang, Bin
    Wu, Fan
    Li, Xin
    Zhou, Quan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 876 - 880
  • [28] TaiChi: A Fine-Grained Action Recognition Dataset
    Sun, Shan
    Wang, Feng
    Liang, Qi
    He, Liang
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 434 - 438
  • [29] Fine-grained Categorization of Fish Motion Patterns in Underwater Videos
    Amer, Mohamed
    Bilgazyev, Emil
    Todorovic, Sinisa
    Shah, Shishir
    Kakadiaris, Ioannis
    Ciannelli, Lorenzo
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [30] FenceNet: Fine-grained Footwork Recognition in Fencing
    Zhu, Kevin
    Wong, Alexander
    McPhee, John
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3588 - 3597