Real-Time Human Group Detection and Clustering in Crowded Environments Using Enhanced Multi-Object Tracking

被引:1
作者
Lee, Hyunmin [1 ]
Kang, Donggoo [2 ]
Park, Hasil [2 ]
Park, Sangwoo [2 ]
Jeong, Dasol [2 ]
Paik, Joonki [1 ,2 ]
机构
[1] Chung Ang Univ, Dept Artificial Intelligence, Seoul 06974, South Korea
[2] Chung Ang Univ, Dept Image, Seoul 06974, South Korea
基金
新加坡国家研究基金会;
关键词
Pedestrians; Real-time systems; Accuracy; Heuristic algorithms; Tracking; Object recognition; Faces; Deep learning; Airports; Visualization; Multi-object tracking; visual surveillance; group detection;
D O I
10.1109/ACCESS.2024.3503661
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Group detection is a critical yet challenging task in video-based applications such as surveillance analysis, especially in crowded and dynamic environments where complex pedestrian interactions occur. Traditional trajectory-based methods often struggle with occlusions and overlapping behaviors, leading to inaccurate group identification. To address these limitations, we propose a novel algorithm that integrates an optimized YOLOv8 model with DeepSORT tracking, enhancing both detection accuracy and real time performance. Our approach uniquely combines high-precision object detection with stable multi-object tracking, ensuring consistent identification of individuals and groups over time, even in high-density scenarios. Additionally, we introduce an innovative method of constructing an adjacency matrix by integrating Euclidean distances and bounding box diagonal ratios, which is transformed into a graph to intricately analyze and predict complex group dynamics in real time. Experimental results on real-world airport CCTV footage demonstrate that our method significantly outperforms existing approaches, achieving higher precision and recall rates. Furthermore, the algorithm operates efficiently on standard hardware, indicating strong practical feasibility for real-time applications in public spaces. While challenges such as misclassification due to incomplete data annotations and occlusions remain, our study showcases the potential of integrating spatial and temporal data to advance real-time group detection and tracking, aiming to improve crowd management systems in public spaces.
引用
收藏
页码:184028 / 184039
页数:12
相关论文
共 23 条
[1]  
Akbari H., 2021, Indonesian Sport Innov. Rev., V3, P47
[2]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[3]  
Chen ML, 2017, INT CONF ACOUST SPEE, P1378, DOI 10.1109/ICASSP.2017.7952382
[4]   StrongSORT: Make DeepSORT Great Again [J].
Du, Yunhao ;
Zhao, Zhicheng ;
Song, Yang ;
Zhao, Yanyun ;
Su, Fei ;
Gong, Tao ;
Meng, Hongying .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :8725-8737
[5]   JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection [J].
Ehsanpour, Mahsa ;
Saleh, Fatemeh ;
Savarese, Silvio ;
Reid, Ian ;
Rezatofighi, Hamid .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :20951-20960
[6]   Panoramic Human Activity Recognition [J].
Han, Ruize ;
Yan, Haomin ;
Li, Jiacheng ;
Wang, Songmiao ;
Feng, Wei ;
Wang, Song .
COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 :244-261
[7]   Real-time Trajectory-based Social Group Detection [J].
Jahangard, Simindokht ;
Hayat, Munawar ;
Rezatofighi, Hamid .
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, :1901-1908
[8]  
Jie Shao, 2018, Pattern Recognition and Image Analysis, V28, P282
[9]   Frechet distance-based cluster analysis for multi-dimensional functional data [J].
Kang, Ilsuk ;
Choi, Hosik ;
Yoon, Young Joo ;
Park, Junyoung ;
Kwon, Soon-Sun ;
Park, Cheolwoo .
STATISTICS AND COMPUTING, 2023, 33 (04)
[10]   Self-supervised Social Relation Representation for Human Group Detection [J].
Li, Jiacheng ;
Han, Ruize ;
Yan, Haomin ;
Qian, Zekun ;
Feng, Wei ;
Wang, Song .
COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 :142-159