ModuleNet: Knowledge-Inherited Neural Architecture Search

被引:15
|
作者
Chen, Yaran [1 ,2 ]
Gao, Ruiyuan [3 ]
Liu, Fenggang [4 ]
Zhao, Dongbin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Coll Artificial Intelligence, Beijing 100049, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[4] Beijing Inst Technol, Coll Automat, Beijing 100811, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer architecture; Task analysis; Knowledge based systems; Microprocessors; Statistics; Sociology; Computational modeling; Evaluation algorithm; knowledge inherited; neural architecture search (NAS); GENETIC ALGORITHM; MODEL;
D O I
10.1109/TCYB.2021.3078573
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although neural the architecture search (NAS) can bring improvement to deep models, it always neglects precious knowledge of existing models. The computation and time costing property in NAS also means that we should not start from scratch to search, but make every attempt to reuse the existing knowledge. In this article, we discuss what kind of knowledge in a model can and should be used for a new architecture design. Then, we propose a new NAS algorithm, namely, ModuleNet, which can fully inherit knowledge from the existing convolutional neural networks. To make full use of the existing models, we decompose existing models into different modules, which also keep their weights, consisting of a knowledge base. Then, we sample and search for a new architecture according to the knowledge base. Unlike previous search algorithms, and benefiting from inherited knowledge, our method is able to directly search for architectures in the macrospace by the NSGA-II algorithm without tuning parameters in these modules. Experiments show that our strategy can efficiently evaluate the performance of a new architecture even without tuning weights in convolutional layers. With the help of knowledge we inherited, our search results can always achieve better performance on various datasets (CIFAR10, CIFAR100, and ImageNet) over original architectures.
引用
收藏
页码:11661 / 11671
页数:11
相关论文
共 50 条
  • [31] Enhanced Few-Shot Malware Traffic Classification via Integrating Knowledge Transfer With Neural Architecture Search
    Zhang, Xixi
    Wang, Qin
    Qin, Maoyang
    Wang, Yu
    Ohtsuki, Tomoaki
    Adebisi, Bamidele
    Sari, Hikmet
    Gui, Guan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5245 - 5256
  • [32] RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning
    Tan, Hao
    Cheng, Ran
    Huang, Shihua
    He, Cheng
    Qiu, Changxiao
    Yang, Fan
    Luo, Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 475 - 489
  • [33] Attention-Based Neural Architecture Search for Person Re-Identification
    Zhou, Qinqin
    Zhong, Bineng
    Liu, Xin
    Ji, Rongrong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6627 - 6639
  • [34] Toward Extremely Lightweight Distracted Driver Recognition With Distillation-Based Neural Architecture Search and Knowledge Transfer
    Liu, Dichao
    Yamasaki, Toshihiko
    Wang, Yu
    Mase, Kenji
    Kato, Jien
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 764 - 777
  • [35] AutoMER: Spatiotemporal Neural Architecture Search for Microexpression Recognition
    Verma, Monu
    Reddy, M. Satish Kumar
    Meedimale, Yashwanth Reddy
    Mandal, Murari
    Vipparthi, Santosh Kumar
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6116 - 6128
  • [36] AutoNAS: Automatic Neural Architecture Search for Hyperspectral Unmixing
    Han, Zhu
    Hong, Danfeng
    Gao, Lianru
    Zhang, Bing
    Huang, Min
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [37] Collaborative Neural Architecture Search for Personalized Federated Learning
    Liu, Yi
    Guo, Song
    Zhang, Jie
    Hong, Zicong
    Zhan, Yufeng
    Zhou, Qihua
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (01) : 250 - 262
  • [38] Reinforcement Learning for Neural Architecture Search in Hyperspectral Unmixing
    Han, Zhu
    Hong, Danfeng
    Gao, Lianru
    Roy, Swalpa Kumar
    Zhang, Bing
    Chanussot, Jocelyn
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [39] Privacy-Preserving Federated Neural Architecture Search With Enhanced Robustness for Edge Computing
    Zhang, Feifei
    Li, Mao
    Ge, Jidong
    Tang, Fenghui
    Zhang, Sheng
    Wu, Jie
    Luo, Bin
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 2234 - 2252
  • [40] MedNAS: Multiscale Training-Free Neural Architecture Search for Medical Image Analysis
    Wang, Yan
    Zhen, Liangli
    Zhang, Jianwei
    Li, Miqing
    Zhang, Lei
    Wang, Zizhou
    Feng, Yangqin
    Xue, Yu
    Wang, Xiao
    Chen, Zheng
    Luo, Tao
    Goh, Rich Siow Mong
    Liu, Yong
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (03) : 668 - 681