Rethinking the Impacts of Overfitting and Feature Quality on Small-scale Video Classification

被引:2
作者
Wu, Xuansheng [1 ]
Yang, Feichi [2 ]
Zhou, Tong [3 ]
Lin, Xinyue [4 ]
机构
[1] Univ Georgia, Athens, GA 30602 USA
[2] Univ Southern Calif, Los Angeles, CA USA
[3] Cent Univ Finance & Econ, Beijing, Peoples R China
[4] Univ Virginia, Charlottesville, VA USA
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
关键词
Video Classification; Small Dataset; Overfitting; Feature Quality; Data Augment; NETWORKS;
D O I
10.1145/3474085.3479226
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While Transformers have yielded impressive results for video classification on large datasets recently, simpler models without the transformer architecture can be promising for small datasets. In this paper, we propose three major techniques to improve feature quality and another three to alleviate overfitting in an attempt to make lightweight models achieve higher performances. In particular, we enhance features of Image Flow by combining temporal information, multi-level features of CNNs, and Text embedding. We alleviate overfitting by removing redundant modal, fine-tuning dropout rate, and augmenting data. In the 2021 Tencent Advertisement Algorithm Competition, the baseline model achieved a GAP score of 0.8019 offline with our strategies. It's worth mentioning that our design works well with the 10-fold method, which produces our final submitting model with a GAP score of 0.8210 online, ranking the 5th among 287 teams. In addition, our solution is among the fastest within the top 10 teams.
引用
收藏
页码:4760 / 4764
页数:5
相关论文
共 38 条
[1]  
Abu-El-Haija Sami, 2016, ARXIV160908675CSCV
[2]  
Alberti Chris, 2019, ARXIV190805054CSCL
[3]  
Bertasius Gedas, 2021, ARXIV210205095CSCV
[4]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]  
Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261
[7]  
He, P IEEE C COMP VIS PA, P770, DOI DOI 10.1109/CVPR.2016.90
[8]  
Hershey S, 2017, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2017.7952132
[9]   Robust Speaker Verification Using GFCC Based i-Vectors [J].
Jeevan, Medikonda ;
Dhingra, Atul ;
Hanmandlu, M. ;
Panigrahi, B. K. .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL, NETWORKS, COMPUTING, AND SYSTEMS (ICSNCS 2016), VOL 1, 2017, 395 :85-91
[10]  
Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039