Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes

被引：2

作者：

Zhao, Haoyu ^{[1
,2
,3
]}

Wang, Qi ^{[4
]}

Zhan, Guowei ^{[1
]}

Min, Weidong ^{[1
,5
,6
]}

Zou, Yi ^{[1
]}

Cui, Shimiao ^{[1
]}

机构：

[1] Nanchang Univ, Sch Math & Comp Sci, Nanchang 330031, Peoples R China

[2] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China

[3] Shanghai Collaborat Innovat Ctr Intelligent Visual, Shanghai 200433, Peoples R China

[4] Nanchang Univ, Sch Software, Nanchang, Peoples R China

[5] Nanchang Univ, Inst Metaverse, Nanchang 330031, Peoples R China

[6] Jiangxi Key Lab Smart City, Nanchang 330031, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Adaptation models; Training; Task analysis; Labeling; Computer science; Kernel; Feature extraction; Crowd counting; multi-head parallel network; need only one more point; perspective-adaptive;

D O I：

10.1109/TMM.2022.3230337

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, solving the crowd counting problem under occlusion and complex perspective is a hot but difficult topic. Existing methods mainly constructed counters in parallel perspective, but when facing complex perspective, such as the influences of height difference and heavy occlusions, they fail to get good accuracy. To alleviate these problems, this work proposes a novel and interesting framework NOOMP (Need Only One More Point) for perspective adaptation crowd counting task in complex nature scenes. Firstly, this work considers that the common scenes in our daily life usually have the height difference, which brings complex perspective to crowd counting. So, a new labeled method, Absolute-geometry Gaussian Generation is proposed, which only needs one more point for each person in image and gets better accuracy. Secondly, the NOOMP framework consists of meta-learning structure and uses the few-shot way to train the counting model, which can implement the perspective adaptation effective and solve the problem of high label cost. Thirdly, for fitting the characteristic of few-shot learning, this work proposes a new Multi-head Parallel Network (MPNet) for NOOMP. The feature of crowd is extracted by MPNet, which is a hybrid structure composed of shallow network and deep network. This network can save the features of shallow network and the deeper network effectively, which makes MPNet performs well in NOOMP. In addition, this work collects a new dataset, named Multiple Height Differences in Mall (MHDM) for NOOMP, which contains images of different views and height differences from shopping malls and supermarkets. Experiments based on MHDM and other benchmarks show that the NOOMP has good performances in model accuracy and works well for solving perspective change problem.

引用

页码：1414 / 1426

页数：13

共 5 条

[1] Adversarial Learning for Multiscale Crowd Counting Under Complex Scenes
Zhou, Yuan
Yang, Jianxing
Li, Hongru
Cao, Tao
Kung, Sun-Yuan
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (11) : 5423 - 5432
[2] Depth Information Guided Crowd Counting for complex crowd scenes
Xu, Mingliang
Ge, Zhaoyang
Jiang, Xiaoheng
Cui, Gaoge
Lv, Pei
Zhou, Bing
Xu, Changsheng
PATTERN RECOGNITION LETTERS, 2019, 125 : 563 - 569
[3] Crowd counting in complex scenes based on an attention aware CNN network
Li, Zhaoxin
Lu, Shuhua
Lan, Lingqiang
Liu, Qiyuan
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
[4] Conditional Marked Point Process-based Crowd Counting in Sparsely and Moderately Crowded Scenes
Yoon, Yongsang
Gwak, Jeonghwan
Song, Jong-In
Jeon, Moongu
2016 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2016, : 215 - 220
[5] PiPCS: Perspective Independent Point Cloud Simplifier for Complex 3D Indoor Scenes
Ebrahimi, Ali
Czarnuch, Stephen
IEEE ACCESS, 2024, 12 : 126983 - 127006

← 1 →