共 50 条
ModuleNet: Knowledge-Inherited Neural Architecture Search
被引:15
|作者:
Chen, Yaran
[1
,2
]
Gao, Ruiyuan
[3
]
Liu, Fenggang
[4
]
Zhao, Dongbin
[1
,2
]
机构:
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Coll Artificial Intelligence, Beijing 100049, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[4] Beijing Inst Technol, Coll Automat, Beijing 100811, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Computer architecture;
Task analysis;
Knowledge based systems;
Microprocessors;
Statistics;
Sociology;
Computational modeling;
Evaluation algorithm;
knowledge inherited;
neural architecture search (NAS);
GENETIC ALGORITHM;
MODEL;
D O I:
10.1109/TCYB.2021.3078573
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Although neural the architecture search (NAS) can bring improvement to deep models, it always neglects precious knowledge of existing models. The computation and time costing property in NAS also means that we should not start from scratch to search, but make every attempt to reuse the existing knowledge. In this article, we discuss what kind of knowledge in a model can and should be used for a new architecture design. Then, we propose a new NAS algorithm, namely, ModuleNet, which can fully inherit knowledge from the existing convolutional neural networks. To make full use of the existing models, we decompose existing models into different modules, which also keep their weights, consisting of a knowledge base. Then, we sample and search for a new architecture according to the knowledge base. Unlike previous search algorithms, and benefiting from inherited knowledge, our method is able to directly search for architectures in the macrospace by the NSGA-II algorithm without tuning parameters in these modules. Experiments show that our strategy can efficiently evaluate the performance of a new architecture even without tuning weights in convolutional layers. With the help of knowledge we inherited, our search results can always achieve better performance on various datasets (CIFAR10, CIFAR100, and ImageNet) over original architectures.
引用
收藏
页码:11661 / 11671
页数:11
相关论文