YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

被引：4896

作者：

Wang, Chien-Yao ^{[1
]}

Bochkovskiy, Alexey

Liao, Hong-Yuan Mark ^{[1
]}

机构：

[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00721

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Real-time object detection is one of the most important research topics in computer vision. As new approaches regarding architecture optimization and training optimization are continually being developed, we have found two research topics that have spawned when dealing with these latest state-of-the-art methods. To address the topics, we propose a trainable bag-of-freebies oriented solution. We combine the flexible and efficient training tools with the proposed architecture and the compound scaling method. YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 120 FPS and has the highest accuracy 56.8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100. Source code is released in https://github.com/WongKinYiu/yolov7.

引用

页码：7464 / 7475

页数：12

共 96 条

[1] [Anonymous], 2020, P IEEE CVF C COMP VI
[2] Experimental and numerical study on structural performance of reinforced concrete box sewer with localized extreme defect
Bao, Yuequan
Feng, Dongyang
Ma, Nan
Zhu, Hehua
Rabczuk, Timon
[J]. UNDERGROUND SPACE, 2018, 3 (02) : 166 - 179
[3] Bello Irwan, 2021, ADV NEUR IN, V34
[4] Bochkovskiy A., 2020, ARXIV, DOI DOI 10.48550/ARXIV.2004.10934
[5] Ensemble deep learning in bioinformatics
Cao, Yue
Geddes, Thomas Andrew
Yang, Jean Yee Hwa
Yang, Pengyi
[J]. NATURE MACHINE INTELLIGENCE, 2020, 2 (09) : 500 - 508
[6] Carion N., 2020, P EUR C COMP VIS GLA, P213, DOI DOI 10.1007/978-3-030-58452-813
[7] AP-Loss for Accurate One-Stage Object Detection
Chen, Kean
Lin, Weiyao
Li, Jianguo
See, John
Wang, Ji
Zou, Junni
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 3782 - 3798
[8] Learning 3-D Face Shape From Diverse Sources With Cross-Domain Face Synthesis
Chen, Zhuo
Guan, Tao
Wang, Yuesong
Luo, Yawei
Xu, Luoyuan
Liu, Wenkai
[J]. IEEE MULTIMEDIA, 2023, 30 (01) : 7 - 16
[9] Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving
Choi, Jiwoong
Chun, Dayoung
Kim, Hyun
Lee, Hyuk-Jae
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 502 - 511
[10] Dynamic Head: Unifying Object Detection Heads with Attentions
Dai, Xiyang
Chen, Yinpeng
Xiao, Bin
Chen, Dongdong
Liu, Mengchen
Yuan, Lu
Zhang, Lei
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7369 - 7378

← 1 2 3 4 5 6 7 8 9 10 →