Max Margin AND/OR Graph learning for parsing the human body

被引：0

作者：

Zhu, Long ^{[1
]}

Chen, Yuanhao ^{[2
]}

Lu, Yifei ^{[3
]}

Lin, Chenxi ^{[4
]}

Yuille, Alan ^{[5
]}

机构：

[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90024 USA

[2] Univ Sci & Technol China, Beijing, Peoples R China

[3] Shanghai Jiao Tong Univ, Shanghai 200030, Peoples R China

[4] Microsoft Res Asia, Shanghai, Peoples R China

[5] Univ Calif Los Angeles, Dept Stat Psychol & Comp Sci, Los Angeles, CA 90024 USA

来源：

2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12 | 2008年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel structure learning method, Max Margin AND/OR Graph (MM-AOG), for parsing the human body into parts and recovering their poses. Our method represents the human body and its parts by an AND/OR graph, which is a multi-level mixture of Markov Random Fields (MRFs). Max-margin learning, which is a generalization of the training algorithm for support vector machines (SVMs), is used to learn the parameters of the AND/OR graph model discriminatively. There are four advantages from this combination of AND/OR graphs and max-margin learning. Firstly, the AND/OR graph allows us to handle enormous articulated poses with a compact graphical model. Secondly, max-margin learning has more discriminative power than the traditional maximum likelihood approach. Thirdly, the parameters of the AND/OR graph model are optimized globally. In particular, the weights of the appearance model for individual nodes and the relative importance of spatial relationships between nodes are learnt simultaneously. Finally, the kernel trick can be used to handle high dimensional features and to enable complex similarity measure of shapes. We perform comparison experiments on the baseball datasets, showing significant improvements over state of the art methods.

引用

页码：3458 / +

页数：3

共 50 条

[1] Max-Margin Incremental CCG Parsing
Stanojevic, Milos
Steedman, Mark
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4111 - 4122
[2] Optimizing for Measure of Performance in Max-Margin Parsing
Bauer, Alexander
Nakajima, Shinichi
Goernitz, Nico
Mueller, Klaus-Robert
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2680 - 2684
[3] Graphonomy: Universal Human Parsing via Graph Transfer Learning
Gong, Ke
Gao, Yiming
Liang, Xiaodan
Shen, Xiaohui
Wang, Meng
Lin, Liang
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7442 - 7451
[4] Max Margin Learning of Hierarchical Configural Deformable Templates (HCDTs) for Efficient Object Parsing and Pose Estimation
Long (Leo) Zhu
Yuanhao Chen
Chenxi Lin
Alan Yuille
International Journal of Computer Vision, 2011, 93 : 1 - 21
[5] Max Margin Learning of Hierarchical Configural Deformable Templates (HCDTs) for Efficient Object Parsing and Pose Estimation
Zhu, Long
Chen, Yuanhao
Lin, Chenxi
Yuille, Alan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 93 (01) : 1 - 21
[6] Multilabel learning based adaptive graph convolutional network for human parsing
Hao, Huaqing
Liu, Weibin
Xing, Weiwei
Zhang, Shunli
PATTERN RECOGNITION, 2022, 127
[7] Learning Human-Object Interactions by Graph Parsing Neural Networks
Qi, Siyuan
Wang, Wenguan
Jia, Baoxiong
Shen, Jianbing
Zhu, Song-Chun
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423
[8] Max-Margin Contrastive Learning
Shah, Anshul
Sra, Suvrit
Chellappa, Rama
Cherian, Anoop
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8220 - 8230
[9] Learning compositional structures for semantic graph parsing
Groschwitz, Jonas
Fowlie, Meaghan
Koller, Alexander
SPNLP 2021: THE 5TH WORKSHOP ON STRUCTURED PREDICTION FOR NLP, 2021, : 22 - 32
[10] Learning to transfer focus of graph neural network for scene graph parsing
Jiang, Junjie
He, Zaixing
Zhang, Shuyou
Zhao, Xinyue
Tan, Jianrong
PATTERN RECOGNITION, 2021, 112

← 1 2 3 4 5 →