Tensor Dropout for Robust Learning

被引：15

作者：

Kolbeinsson, Arinbjorn ^{[1
]}

Kossaifi, Jean ^{[2
]}

Panagakis, Yannis ^{[3
]}

Bulat, Adrian ^{[4
]}

Anandkumar, Animashree ^{[2
]}

Tzoulaki, Ioanna ^{[5
,6
,7
]}

Matthews, Paul M. ^{[5
,6
]}

机构：

[1] Imperial Coll London, Dept Epidemiol & Biostat, 4615 London, London, England

[2] NVIDIA, Santa Clara, CA USA

[3] Natl & Kapodistrian Univ Athens, Dept Informat & Telecommun, Athens 1584, Greece

[4] Samsung AI, Cambridge CB1 2RE, England

[5] Imperial Coll London, 4615 London, London, England

[6] Imperial Coll London, UK Dementia Res Inst, 4615 London, London, England

[7] Univ Ioannina, Med Sch, Ioannina, Greece

来源：

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING | 2021年 / 15卷 / 03期

基金：

英国工程与自然科学研究理事会; 英国医学研究理事会;

关键词：

Tensors; Training; Robustness; Magnetic resonance imaging; Perturbation methods; Diseases; Deep learning; randomized tensor regression; robustness; stochastic regularization; tensor dropout; tensor methods; tensor regression; tensor regression layers; NEURAL-NETWORKS; BRAIN;

D O I：

10.1109/JSTSP.2021.3064182

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

CNNs achieve high levels of performance by leveraging deep, over-parametrized neural architectures, trained on large datasets. However, they exhibit limited generalization abilities outside their training domain and lack robustness to corruptions such as noise and adversarial attacks. To improve robustness and obtain more computationally and memory efficient models, better inductive biases are needed. To provide such inductive biases, tensor layers have been successfully proposed to leverage multi-linear structure through higher-order computations. In this paper, we propose tensor dropout, a randomization technique that can be applied to tensor factorizations, such as those parametrizing tensor layers. In particular, we study tensor regression layers, parametrized by low-rank weight tensors and augmented with our proposed tensor dropout. We empirically show that our approach improves generalization for image classification on ImageNet and CIFAR-100. We also establish state-of-the-art accuracy for phenotypic trait prediction on the largest available dataset of brain MRI (U.K. Biobank), where multi-linear structure is paramount. In all cases, we demonstrate superior performance and significantly improved robustness, both to noisy inputs and to adversarial attacks. We establish the theoretical validity of our approach and the regularizing effect of tensor dropout by demonstrating the link between randomized tensor regression with tensor dropout and deterministic regularized tensor regression.

引用

页码：630 / 640

页数：11

共 50 条

[31] Robust Channel Learning for Large-Scale Radio Speaker Verification
Yang, Wenhao
Wei, Jianguo
Lu, Wenhuan
Li, Lei
Lu, Xugang
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2025, 19 (01) : 248 - 259
[32] An Experimental Study of Byzantine-Robust Aggregation Schemes in Federated Learning
Li, Shenghui
Ngai, Edith
Voigt, Thiemo
IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (06) : 975 - 988
[33] Accurate and Robust Object Detection via Selective Adversarial Learning With Constraints
Chen, Jianpin
Li, Heng
Gao, Qi
Liang, Junling
Zhang, Ruipeng
Yin, Liping
Chai, Xinyu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5593 - 5605
[34] Occluded Target Recognition in SAR Imagery With Scattering Excitation Learning and Channel Dropout
He, Dunyun
Guo, Weiwei
Zhang, Tao
Zhang, Zenghui
Yu, Wenxian
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[35] Toward Compact and Robust Model Learning Under Dynamically Perturbed Environments
Luo, Hui
Zhuang, Zhuangwei
Li, Yuanqing
Tan, Mingkui
Chen, Cen
Zhang, Jianlin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4857 - 4873
[36] The dropout learning algorithm
Baldi, Pierre
Sadowski, Peter
ARTIFICIAL INTELLIGENCE, 2014, 210 : 78 - 122
[37] Robust Tensor Subspace Learning for Incomplete Multi-View Clustering
Liang, Cheng
Wang, Daoyuan
Zhang, Huaxiang
Zhang, Shichao
Guo, Fei
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6934 - 6948
[38] Robust Online Learning Over Networks
Bastianello, Nicola
Deplano, Diego
Franceschelli, Mauro
Johansson, Karl H.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (02) : 933 - 946
[39] Enhancing Federated Learning Robustness Using Locally Benignity-Assessable Bayesian Dropout
Xue, Jingjing
Sun, Sheng
Liu, Min
Li, Qi
Xu, Ke
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 2464 - 2479
[40] Dropout-Based Robust Self-Supervised Deep Learning for Seismic Data Denoising
Chen, Gui
Liu, Yang
Zhang, Mi
Zhang, Haoran
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

← 1 2 3 4 5 →