Array-Aware Neural Architecture Search

被引：3

作者：

Chitty-Venkata, Krishna Teja ^{[1
]}

Somani, Arun K. ^{[1
]}

机构：

[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50011 USA

来源：

2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021) | 2021年

关键词：

Deep Convolutional Neural Networks; Array; Accelerators; Neural Architecture Search;

D O I：

10.1109/ASAP52443.2021.00026

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Networks (CNNs) have exceeded human accuracy in many Computer Vision tasks, such as Image Classification, Object Detection, Image Segmentation, etc. This advancement is due to the efficient manual design of CNNs in initially, followed by automated design through Neural Architecture Search (NAS). In parallel to neural network design, advances in Accelerator hardware design, such as Google's Tensor Processing Unit (TPU), Eyeriss, etc., also occurred for efficient processing of CNN forward propagation. The heart of these accelerators is an array processor (Systolic Array) of a fixed dimension, that limits the amount of CNN computation that can be carried out in a single clock cycle. While NAS is able to produce efficient neural architectures, the networks need to be co-designed with respect to the underlying array dimensions to obtain the best performance. In this paper, we introduce "Array Aware Neural Architecture Search" to automatically design efficient CNNs for a fixed array-based neural network accelerator. Previous Hardware Aware NAS methods consider a fixed search space for different hardware platforms and search within its predefined space. We explore the search space based on the underlying hardware array dimensions to design a more efficient CNN architectures for optimal performance. We observe that our proposed NAS methods on the CIFAR-10 dataset produce similar accuracy as the baseline network while saving a substantial number of cycles on the Array.

引用

页码：125 / 132

页数：8

共 17 条

[1]

[Anonymous], 2018, CoRR, abs/1811.02883

[2]

Cai H., 2018, ARXIV181200332

[3] Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks [J].

Chen, Yu-Hsin ;

Emer, Joel ;

Sze, Vivienne .

2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, :367-379

[4]

Choi K., 2020, ARXIV PREPRINT ARXIV

[5]

Gupta Suyog, 2020, ARXIV PREPRINT ARXIV

[6] In-Datacenter Performance Analysis of a Tensor Processing Unit [J].

Jouppi, Norman P. ;

Young, Cliff ;

Patil, Nishant ;

Patterson, David ;

Agrawal, Gaurav ;

Bajwa, Raminder ;

Bates, Sarah ;

Bhatia, Suresh ;

Boden, Nan ;

Borchers, Al ;

Boyle, Rick ;

Cantin, Pierre-luc ;

Chao, Clifford ;

Clark, Chris ;

Coriell, Jeremy ;

Daley, Mike ;

Dau, Matt ;

Dean, Jeffrey ;

Gelb, Ben ;

Ghaemmaghami, Tara Vazir ;

Gottipati, Rajendra ;

Gulland, William ;

Hagmann, Robert ;

Ho, C. Richard ;

Hogberg, Doug ;

Hu, John ;

Hundt, Robert ;

Hurt, Dan ;

Ibarz, Julian ;

Jaffey, Aaron ;

Jaworski, Alek ;

Kaplan, Alexander ;

Khaitan, Harshit ;

Killebrew, Daniel ;

Koch, Andy ;

Kumar, Naveen ;

Lacy, Steve ;

Laudon, James ;

Law, James ;

Le, Diemthu ;

Leary, Chris ;

Liu, Zhuyuan ;

Lucke, Kyle ;

Lundin, Alan ;

MacKean, Gordon ;

Maggiore, Adriana ;

Mahony, Maire ;

Miller, Kieran ;

Nagarajan, Rahul ;

Narayanaswami, Ravi .

44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, :1-12

[7] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[8]

Krizhevsky Alex, 2009, CsTorontoEdu

[9]

Lin Yujun, 2019, NeurIPS WS

[10]

Liu H., 2018, PROC INT C LEARN REP

← 1 2 →