Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices

被引:0
|
作者
Lee, Jooyeon [1 ]
Park, Junsang [1 ]
Lee, Seunghyun [1 ]
Kung, Jaeha [1 ]
机构
[1] Daegu Gyeongbuk Inst Sci & Technol DGIST, Daegu 42988, South Korea
关键词
Dataflow optimization; neural networks; neural architecture search; neural processing unit;
D O I
10.1145/3513085
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in deep learning have made it possible to implement artificial intelligence in mobile devices. Many studies have put a lot of effort into developing lightweight deep learning models optimized for mobile devices. To overcome the performance limitations of manually designed deep learning models, an automated search algorithm, called neural architecture search (NAS), has been proposed. However, studies on the effect of hardware architecture of the mobile device on the performance of NAS have been less explored. In this article, we show the importance of optimizing a hardware architecture, namely, NPU dataflow, when searching for a more accurate yet fast deep learning model. To do so, we first implement an optimization framework, named FlowOptimizer, for generating a best possible NPU dataflow for a given deep learning operator. Then, we utilize this framework during the latency-aware NAS to find the model with the highest accuracy satisfying the latency constraint. As a result, we show that the searched model with FlowOptimizer outperforms the performance by 87.1% and 92.3% on average compared to the searched model with NVDLA and Eyeriss, respectively, with better accuracy on a proxy dataset. We also show that the searched model can be transferred to a larger model to classify a more complex image dataset, i.e., ImageNet, achieving 0.2%/5.4% higher Top-1/Top-5 accuracy compared to MobileNetV2-1.0 with 3.6x lower latency.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Neural Architecture Search for Optimizing Deep Belief Network Models of fMRI Data
    Qiang, Ning
    Ge, Bao
    Dong, Qinglin
    Ge, Fangfei
    Liu, Tianming
    MULTISCALE MULTIMODAL MEDICAL IMAGING, MMMI 2019, 2020, 11977 : 26 - 34
  • [22] Neural architecture search for the estimation of relative positioning of the autonomous mobile robot
    Teso-Fz-Betono, Daniel
    Zulueta, Ekaitz
    Sanchez-Chica, Ander
    Fernandez-Gamiz, Unai
    Teso-Fz-Betono, Adrian
    Manuel Lopez-Guede, Jose
    LOGIC JOURNAL OF THE IGPL, 2023, 31 (04) : 634 - 647
  • [23] Multi-objective Cuckoo Algorithm for Mobile Devices Network Architecture Search
    Zhang, Nan
    Wang, Jianzong
    Yang, Jian
    Qu, Xiaoyang
    Xiao, Jing
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 312 - 324
  • [24] Mobile search - Social network search using mobile devices
    Tiago, Pedro
    Kotilainen, Niko
    Vapa, Mikko
    Kokkinen, Heikki
    Nurminen, Jukka K.
    2008 5TH IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, VOLS 1-3, 2008, : 1201 - +
  • [25] Contrastive Neural Architecture Search with Neural Architecture Comparators
    Chen, Yaofo
    Guo, Yong
    Chen, Qi
    Li, Minli
    Zeng, Wei
    Wang, Yaowei
    Tan, Mingkui
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9497 - 9506
  • [26] An architecture for secure mobile devices
    Mayrhofer, Rene
    SECURITY AND COMMUNICATION NETWORKS, 2015, 8 (10) : 1958 - 1970
  • [27] Media Search in Mobile Devices
    Gilbert, Mazin
    Acero, Alex
    Cohen, Jordan
    Bourlard, Herve
    Chang, Shih-Fu
    Etoh, Minoru
    IEEE SIGNAL PROCESSING MAGAZINE, 2011, 28 (04) : 12 - U13
  • [28] HGNAS: Hardware-Aware Graph Neural Architecture Search for Edge Devices
    Zhou, Ao
    Yang, Jianlei
    Qi, Yingjie
    Qiao, Tong
    Shi, Yumeng
    Duan, Cenlin
    Zhao, Weisheng
    Hu, Chunming
    IEEE Transactions on Computers, 2024, 73 (12) : 2693 - 2707
  • [29] TAS: Ternarized Neural Architecture Search for Resource-Constrained Edge Devices
    Loni, Mohammad
    Mousavi, Hamid
    Riazati, Mohammad
    Daneshtalab, Masoud
    Sjodin, Mikael
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 1115 - 1118
  • [30] HyT-NAS: Hybrid Transformers Neural Architecture Search for Edge Devices
    Mecharbat, Lotfi Abdelkrim
    Benmeziane, Hadjer
    Ouarnoughi, Hamza
    Niar, Smail
    PROCEEDINGS 2023 IEEE/ACM INTERNATIONAL WORKSHOP ON COMPILERS, DEPLOYMENT, AND TOOLING FOR EDGE AI, CODAI 2023, 2023, : 41 - 45