Tensor Dropout for Robust Learning

被引:15
|
作者
Kolbeinsson, Arinbjorn [1 ]
Kossaifi, Jean [2 ]
Panagakis, Yannis [3 ]
Bulat, Adrian [4 ]
Anandkumar, Animashree [2 ]
Tzoulaki, Ioanna [5 ,6 ,7 ]
Matthews, Paul M. [5 ,6 ]
机构
[1] Imperial Coll London, Dept Epidemiol & Biostat, 4615 London, London, England
[2] NVIDIA, Santa Clara, CA USA
[3] Natl & Kapodistrian Univ Athens, Dept Informat & Telecommun, Athens 1584, Greece
[4] Samsung AI, Cambridge CB1 2RE, England
[5] Imperial Coll London, 4615 London, London, England
[6] Imperial Coll London, UK Dementia Res Inst, 4615 London, London, England
[7] Univ Ioannina, Med Sch, Ioannina, Greece
基金
英国工程与自然科学研究理事会; 英国医学研究理事会;
关键词
Tensors; Training; Robustness; Magnetic resonance imaging; Perturbation methods; Diseases; Deep learning; randomized tensor regression; robustness; stochastic regularization; tensor dropout; tensor methods; tensor regression; tensor regression layers; NEURAL-NETWORKS; BRAIN;
D O I
10.1109/JSTSP.2021.3064182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
CNNs achieve high levels of performance by leveraging deep, over-parametrized neural architectures, trained on large datasets. However, they exhibit limited generalization abilities outside their training domain and lack robustness to corruptions such as noise and adversarial attacks. To improve robustness and obtain more computationally and memory efficient models, better inductive biases are needed. To provide such inductive biases, tensor layers have been successfully proposed to leverage multi-linear structure through higher-order computations. In this paper, we propose tensor dropout, a randomization technique that can be applied to tensor factorizations, such as those parametrizing tensor layers. In particular, we study tensor regression layers, parametrized by low-rank weight tensors and augmented with our proposed tensor dropout. We empirically show that our approach improves generalization for image classification on ImageNet and CIFAR-100. We also establish state-of-the-art accuracy for phenotypic trait prediction on the largest available dataset of brain MRI (U.K. Biobank), where multi-linear structure is paramount. In all cases, we demonstrate superior performance and significantly improved robustness, both to noisy inputs and to adversarial attacks. We establish the theoretical validity of our approach and the regularizing effect of tensor dropout by demonstrating the link between randomized tensor regression with tensor dropout and deterministic regularized tensor regression.
引用
收藏
页码:630 / 640
页数:11
相关论文
共 50 条
  • [31] Robust Channel Learning for Large-Scale Radio Speaker Verification
    Yang, Wenhao
    Wei, Jianguo
    Lu, Wenhuan
    Li, Lei
    Lu, Xugang
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2025, 19 (01) : 248 - 259
  • [32] An Experimental Study of Byzantine-Robust Aggregation Schemes in Federated Learning
    Li, Shenghui
    Ngai, Edith
    Voigt, Thiemo
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (06) : 975 - 988
  • [33] Accurate and Robust Object Detection via Selective Adversarial Learning With Constraints
    Chen, Jianpin
    Li, Heng
    Gao, Qi
    Liang, Junling
    Zhang, Ruipeng
    Yin, Liping
    Chai, Xinyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5593 - 5605
  • [34] Occluded Target Recognition in SAR Imagery With Scattering Excitation Learning and Channel Dropout
    He, Dunyun
    Guo, Weiwei
    Zhang, Tao
    Zhang, Zenghui
    Yu, Wenxian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [35] Toward Compact and Robust Model Learning Under Dynamically Perturbed Environments
    Luo, Hui
    Zhuang, Zhuangwei
    Li, Yuanqing
    Tan, Mingkui
    Chen, Cen
    Zhang, Jianlin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4857 - 4873
  • [36] The dropout learning algorithm
    Baldi, Pierre
    Sadowski, Peter
    ARTIFICIAL INTELLIGENCE, 2014, 210 : 78 - 122
  • [37] Robust Tensor Subspace Learning for Incomplete Multi-View Clustering
    Liang, Cheng
    Wang, Daoyuan
    Zhang, Huaxiang
    Zhang, Shichao
    Guo, Fei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6934 - 6948
  • [38] Robust Online Learning Over Networks
    Bastianello, Nicola
    Deplano, Diego
    Franceschelli, Mauro
    Johansson, Karl H.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (02) : 933 - 946
  • [39] Enhancing Federated Learning Robustness Using Locally Benignity-Assessable Bayesian Dropout
    Xue, Jingjing
    Sun, Sheng
    Liu, Min
    Li, Qi
    Xu, Ke
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 2464 - 2479
  • [40] Dropout-Based Robust Self-Supervised Deep Learning for Seismic Data Denoising
    Chen, Gui
    Liu, Yang
    Zhang, Mi
    Zhang, Haoran
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19