Max Margin AND/OR Graph learning for parsing the human body

被引:0
|
作者
Zhu, Long [1 ]
Chen, Yuanhao [2 ]
Lu, Yifei [3 ]
Lin, Chenxi [4 ]
Yuille, Alan [5 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90024 USA
[2] Univ Sci & Technol China, Beijing, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai 200030, Peoples R China
[4] Microsoft Res Asia, Shanghai, Peoples R China
[5] Univ Calif Los Angeles, Dept Stat Psychol & Comp Sci, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel structure learning method, Max Margin AND/OR Graph (MM-AOG), for parsing the human body into parts and recovering their poses. Our method represents the human body and its parts by an AND/OR graph, which is a multi-level mixture of Markov Random Fields (MRFs). Max-margin learning, which is a generalization of the training algorithm for support vector machines (SVMs), is used to learn the parameters of the AND/OR graph model discriminatively. There are four advantages from this combination of AND/OR graphs and max-margin learning. Firstly, the AND/OR graph allows us to handle enormous articulated poses with a compact graphical model. Secondly, max-margin learning has more discriminative power than the traditional maximum likelihood approach. Thirdly, the parameters of the AND/OR graph model are optimized globally. In particular, the weights of the appearance model for individual nodes and the relative importance of spatial relationships between nodes are learnt simultaneously. Finally, the kernel trick can be used to handle high dimensional features and to enable complex similarity measure of shapes. We perform comparison experiments on the baseball datasets, showing significant improvements over state of the art methods.
引用
收藏
页码:3458 / +
页数:3
相关论文
共 50 条
  • [1] Max-Margin Incremental CCG Parsing
    Stanojevic, Milos
    Steedman, Mark
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4111 - 4122
  • [2] Optimizing for Measure of Performance in Max-Margin Parsing
    Bauer, Alexander
    Nakajima, Shinichi
    Goernitz, Nico
    Mueller, Klaus-Robert
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2680 - 2684
  • [3] Graphonomy: Universal Human Parsing via Graph Transfer Learning
    Gong, Ke
    Gao, Yiming
    Liang, Xiaodan
    Shen, Xiaohui
    Wang, Meng
    Lin, Liang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7442 - 7451
  • [4] Max Margin Learning of Hierarchical Configural Deformable Templates (HCDTs) for Efficient Object Parsing and Pose Estimation
    Long (Leo) Zhu
    Yuanhao Chen
    Chenxi Lin
    Alan Yuille
    International Journal of Computer Vision, 2011, 93 : 1 - 21
  • [5] Max Margin Learning of Hierarchical Configural Deformable Templates (HCDTs) for Efficient Object Parsing and Pose Estimation
    Zhu, Long
    Chen, Yuanhao
    Lin, Chenxi
    Yuille, Alan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 93 (01) : 1 - 21
  • [6] Multilabel learning based adaptive graph convolutional network for human parsing
    Hao, Huaqing
    Liu, Weibin
    Xing, Weiwei
    Zhang, Shunli
    PATTERN RECOGNITION, 2022, 127
  • [7] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423
  • [8] Max-Margin Contrastive Learning
    Shah, Anshul
    Sra, Suvrit
    Chellappa, Rama
    Cherian, Anoop
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8220 - 8230
  • [9] Learning compositional structures for semantic graph parsing
    Groschwitz, Jonas
    Fowlie, Meaghan
    Koller, Alexander
    SPNLP 2021: THE 5TH WORKSHOP ON STRUCTURED PREDICTION FOR NLP, 2021, : 22 - 32
  • [10] Learning to transfer focus of graph neural network for scene graph parsing
    Jiang, Junjie
    He, Zaixing
    Zhang, Shuyou
    Zhao, Xinyue
    Tan, Jianrong
    PATTERN RECOGNITION, 2021, 112