Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes

被引:2
|
作者
Zhao, Haoyu [1 ,2 ,3 ]
Wang, Qi [4 ]
Zhan, Guowei [1 ]
Min, Weidong [1 ,5 ,6 ]
Zou, Yi [1 ]
Cui, Shimiao [1 ]
机构
[1] Nanchang Univ, Sch Math & Comp Sci, Nanchang 330031, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[3] Shanghai Collaborat Innovat Ctr Intelligent Visual, Shanghai 200433, Peoples R China
[4] Nanchang Univ, Sch Software, Nanchang, Peoples R China
[5] Nanchang Univ, Inst Metaverse, Nanchang 330031, Peoples R China
[6] Jiangxi Key Lab Smart City, Nanchang 330031, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptation models; Training; Task analysis; Labeling; Computer science; Kernel; Feature extraction; Crowd counting; multi-head parallel network; need only one more point; perspective-adaptive;
D O I
10.1109/TMM.2022.3230337
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, solving the crowd counting problem under occlusion and complex perspective is a hot but difficult topic. Existing methods mainly constructed counters in parallel perspective, but when facing complex perspective, such as the influences of height difference and heavy occlusions, they fail to get good accuracy. To alleviate these problems, this work proposes a novel and interesting framework NOOMP (Need Only One More Point) for perspective adaptation crowd counting task in complex nature scenes. Firstly, this work considers that the common scenes in our daily life usually have the height difference, which brings complex perspective to crowd counting. So, a new labeled method, Absolute-geometry Gaussian Generation is proposed, which only needs one more point for each person in image and gets better accuracy. Secondly, the NOOMP framework consists of meta-learning structure and uses the few-shot way to train the counting model, which can implement the perspective adaptation effective and solve the problem of high label cost. Thirdly, for fitting the characteristic of few-shot learning, this work proposes a new Multi-head Parallel Network (MPNet) for NOOMP. The feature of crowd is extracted by MPNet, which is a hybrid structure composed of shallow network and deep network. This network can save the features of shallow network and the deeper network effectively, which makes MPNet performs well in NOOMP. In addition, this work collects a new dataset, named Multiple Height Differences in Mall (MHDM) for NOOMP, which contains images of different views and height differences from shopping malls and supermarkets. Experiments based on MHDM and other benchmarks show that the NOOMP has good performances in model accuracy and works well for solving perspective change problem.
引用
收藏
页码:1414 / 1426
页数:13
相关论文
共 5 条
  • [1] Adversarial Learning for Multiscale Crowd Counting Under Complex Scenes
    Zhou, Yuan
    Yang, Jianxing
    Li, Hongru
    Cao, Tao
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (11) : 5423 - 5432
  • [2] Depth Information Guided Crowd Counting for complex crowd scenes
    Xu, Mingliang
    Ge, Zhaoyang
    Jiang, Xiaoheng
    Cui, Gaoge
    Lv, Pei
    Zhou, Bing
    Xu, Changsheng
    PATTERN RECOGNITION LETTERS, 2019, 125 : 563 - 569
  • [3] Crowd counting in complex scenes based on an attention aware CNN network
    Li, Zhaoxin
    Lu, Shuhua
    Lan, Lingqiang
    Liu, Qiyuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [4] Conditional Marked Point Process-based Crowd Counting in Sparsely and Moderately Crowded Scenes
    Yoon, Yongsang
    Gwak, Jeonghwan
    Song, Jong-In
    Jeon, Moongu
    2016 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2016, : 215 - 220
  • [5] PiPCS: Perspective Independent Point Cloud Simplifier for Complex 3D Indoor Scenes
    Ebrahimi, Ali
    Czarnuch, Stephen
    IEEE ACCESS, 2024, 12 : 126983 - 127006