Low-resolution activity recognition using super-resolution and model ensemble networks

被引:2
作者
Liu, Tinglong [1 ]
Wang, Haiyan [2 ]
机构
[1] Dalian Polytech Univ, Ctr Informat Technol, Dalian, Peoples R China
[2] Digital Lib & Shared Engn Informat Network Ctr, Dalian Lib, Dalian, Peoples R China
关键词
activity recognition; attention mechanism; low-resolution video; model ensemble; super-resolution;
D O I
10.4218/etrij.2023-0523
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In real-world video super-resolution, the complexity and diversity of degradations pose substantial challenges during both training and inference. Videos captured in real-world settings often depict activities at varying resolutions. Typically, these activities are filmed from a distance that reduces the resolution of imagery, which thus lacks discriminative features. To address this problem, we introduce an activity recognition solution. First, a unique integration of data transformation and attention-based average discriminator are employed for super-resolution feature augmentation. This approach mitigates the lack of discriminative cues in low-resolution videos. Subsequently, high-resolution features extracted from the recovered data are directly fed into a model ensemble for activity recognition. We evaluate the resulting method on the TinyVIRAT-v2 and HMDB51 datasets, achieving improved visual quality by leveraging the super-resolution and model ensemble strategy. The proposed method enhances the quality of textures and boosts activity recognition in low-resolution videos.
引用
收藏
页码:303 / 311
页数:9
相关论文
共 28 条
[1]  
AtaerCansizoglu E., 2019, VERIFICATION VERY LO
[2]  
Bai Y., 2018, SOD MTGAN SMALL OBJE, P206
[3]  
Chen B., 2022, ARXIV, DOI DOI 10.48550/ARXIV.2209.14711
[4]   Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at Extremely Low Resolutions [J].
Chen, Jiawei ;
Wu, Jonathan ;
Konrad, Janusz ;
Ishwar, Prakash .
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :139-147
[5]   TinyVIRAT: Low-resolution Video Action Recognition [J].
Demir, Ugur ;
Rawat, Yogesh S. ;
Shah, Mubarak .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :7387-7394
[6]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[7]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[8]  
He J., DELVING HIGHQUALITY, P1
[9]   Extreme Low-Resolution Activity Recognition Using a Super-Resolution-Oriented Generative Adversarial Network [J].
Hou, Mingzheng ;
Liu, Song ;
Zhou, Jiliu ;
Zhang, Yi ;
Feng, Ziliang .
MICROMACHINES, 2021, 12 (06)
[10]   Simultaneous denoising and super-resolution of optical coherence tomography images based on a generative adversarial network [J].
Huang, Yongqiang ;
Lu, Zexin ;
Shao, Zhimin ;
Ran, Maosong ;
Zhou, Jiliu ;
Fang, Leyuan ;
Zhang, Yi .
OPTICS EXPRESS, 2019, 27 (09) :12289-12307