Low-resolution activity recognition using super-resolution and model ensemble networks

被引：2

作者：

Liu, Tinglong ^{[1
]}

Wang, Haiyan ^{[2
]}

机构：

[1] Dalian Polytech Univ, Ctr Informat Technol, Dalian, Peoples R China

[2] Digital Lib & Shared Engn Informat Network Ctr, Dalian Lib, Dalian, Peoples R China

来源：

ETRI JOURNAL | 2025年 / 47卷 / 02期

关键词：

activity recognition; attention mechanism; low-resolution video; model ensemble; super-resolution;

D O I：

10.4218/etrij.2023-0523

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In real-world video super-resolution, the complexity and diversity of degradations pose substantial challenges during both training and inference. Videos captured in real-world settings often depict activities at varying resolutions. Typically, these activities are filmed from a distance that reduces the resolution of imagery, which thus lacks discriminative features. To address this problem, we introduce an activity recognition solution. First, a unique integration of data transformation and attention-based average discriminator are employed for super-resolution feature augmentation. This approach mitigates the lack of discriminative cues in low-resolution videos. Subsequently, high-resolution features extracted from the recovered data are directly fed into a model ensemble for activity recognition. We evaluate the resulting method on the TinyVIRAT-v2 and HMDB51 datasets, achieving improved visual quality by leveraging the super-resolution and model ensemble strategy. The proposed method enhances the quality of textures and boosts activity recognition in low-resolution videos.

引用

页码：303 / 311

页数：9

共 28 条

[1]

AtaerCansizoglu E., 2019, VERIFICATION VERY LO

[2]

Bai Y., 2018, SOD MTGAN SMALL OBJE, P206

[3]

Chen B., 2022, ARXIV, DOI DOI 10.48550/ARXIV.2209.14711

[4] Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at Extremely Low Resolutions [J].

Chen, Jiawei ;

Wu, Jonathan ;

Konrad, Janusz ;

Ishwar, Prakash .

2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :139-147

[5] TinyVIRAT: Low-resolution Video Action Recognition [J].

Demir, Ugur ;

Rawat, Yogesh S. ;

Shah, Mubarak .

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :7387-7394

[6] Learning Spatiotemporal Features with 3D Convolutional Networks [J].

Du Tran ;

Bourdev, Lubomir ;

Fergus, Rob ;

Torresani, Lorenzo ;

Paluri, Manohar .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497

[7] Generative Adversarial Networks [J].

Goodfellow, Ian ;

Pouget-Abadie, Jean ;

Mirza, Mehdi ;

Xu, Bing ;

Warde-Farley, David ;

Ozair, Sherjil ;

Courville, Aaron ;

Bengio, Yoshua .

COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144

[8]

He J., DELVING HIGHQUALITY, P1

[9] Extreme Low-Resolution Activity Recognition Using a Super-Resolution-Oriented Generative Adversarial Network [J].

Hou, Mingzheng ;

Liu, Song ;

Zhou, Jiliu ;

Zhang, Yi ;

Feng, Ziliang .

MICROMACHINES, 2021, 12 (06)

[10] Simultaneous denoising and super-resolution of optical coherence tomography images based on a generative adversarial network [J].

Huang, Yongqiang ;

Lu, Zexin ;

Shao, Zhimin ;

Ran, Maosong ;

Zhou, Jiliu ;

Fang, Leyuan ;

Zhang, Yi .

OPTICS EXPRESS, 2019, 27 (09) :12289-12307

← 1 2 3 →