Fine-Grained Activity Recognition for Assembly Videos

被引:10
|
作者
Jones, Jonathan D. [1 ]
Cortesa, Cathryn [2 ]
Shelton, Amy [3 ]
Landau, Barbara [2 ]
Khudanpur, Sanjeev [1 ]
Hager, Gregory D. [4 ]
机构
[1] Johns Hopkins Univ, Dept Elect Engn, Baltimore, MD 21211 USA
[2] Johns Hopkins Univ, Dept Cognit Sci, Baltimore, MD 21211 USA
[3] Johns Hopkins Univ, Sch Educ, Baltimore, MD 21211 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21211 USA
来源
基金
美国国家科学基金会;
关键词
Probabilistic Inference; sensor fusion; recognition; assembly; multi-modal perception for HRI;
D O I
10.1109/LRA.2021.3064149
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter we address the task of recognizing assembly actions as a structure (e.g. a piece of furniture or a toy block tower) is built up from a set of primitive objects. Recognizing the full range of assembly actions requires perception at a level of spatial detail that has not been attempted in the action recognition literature to date. We extend the fine-grained activity recognition setting to address the task of assembly action recognition in its full generality by unifying assembly actions and kinematic structures within a single framework. We use this framework to develop a general method for recognizing assembly actions from observation sequences, along with observation features that take advantage of a spatial assembly's special structure. Finally, we evaluate our method empirically on two application-driven data sources: 1) An IKEA furniture-assembly dataset, and 2) A block-building dataset. On the first, our system recognizes assembly actions with an average framewise accuracy of 70% and an average normalized edit distance of 10%. On the second, which requires fine-grained geometric reasoning to distinguish between assemblies, our system attains an average normalized edit distance of 23%-a relative improvement of 69% over prior work.
引用
收藏
页码:3728 / 3735
页数:8
相关论文
共 50 条
  • [1] Fine-grained Activity Recognition in Baseball Videos
    Piergiovanni, A. J.
    Ryoo, Michael S.
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1821 - 1829
  • [2] Local Depth Patterns for Fine-Grained Activity Recognition in Depth Videos
    Awwad, Sari
    Piccardi, Massimo
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2016, : 214 - 219
  • [3] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [4] Hand Detection and Tracking in Videos for Fine-Grained Action Recognition
    Do, Nga H.
    Yanai, Keiji
    COMPUTER VISION - ACCV 2014 WORKSHOPS, PT I, 2015, 9008 : 19 - 34
  • [5] Fine-Grained Human Activity Recognition - A new paradigm
    Pandurangan, Shalini
    Papandrea, Michela
    Gelsomini, Mirko
    7TH INTERNATIONAL WORKSHOP ON SENSOR-BASED ACTIVITY RECOGNITION AND ARTIFICIAL INTELLIGENCE, IWOAR 2022, 2022,
  • [6] Fine-grained Multimodal Entity Linking for Videos
    Zhao H.-Q.
    Wang X.-W.
    Li J.-L.
    Li Z.-X.
    Xiao Y.-H.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (03): : 1140 - 1153
  • [7] Fine-grained activity recognition by aggregating abstract object usage
    Patterson, DJ
    Fox, D
    Kautz, H
    Philipose, M
    NINTH IEEE INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, PROCEEDINGS, 2005, : 44 - 51
  • [8] Multimodal Wearable Sensing for Fine-Grained Activity Recognition in Healthcare
    De, Debraj
    Bharti, Pratool
    Das, Sajal K.
    Chellappan, Sriram
    IEEE INTERNET COMPUTING, 2015, 19 (05) : 26 - 35
  • [9] Fine-Grained Activity Recognition with Holistic and Pose Based Features
    Pishchulin, Leonid
    Andriluka, Mykhaylo
    Schiele, Bernt
    PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 678 - 689
  • [10] Fine-grained Vehicle Recognition Using Hierarchical Fine-Tuning Strategy for Urban Surveillance Videos
    Zhang, Qiang
    Zhuo, Li
    Hu, Xiaochen
    Zhan, Jing
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 233 - 236