共 44 条
[1]
An Adaptive Performance Modeling Tool for GPU Architectures
[J].
PPOPP 2010: PROCEEDINGS OF THE 2010 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING,
2010,
:105-114
[2]
PredJoule: A Timing-Predictable Energy Optimization Framework for Deep Neural Networks
[J].
2018 39TH IEEE REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2018),
2018,
:107-118
[3]
Estimating the WCET of GPU-Accelerated Applications using Hybrid Analysis
[J].
PROCEEDINGS OF THE 2013 25TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS (ECRTS 2013),
2013,
:193-202
[4]
Chetlur S., 2014, CUDNN EFFICIENT PRIM
[5]
The Cityscapes Dataset for Semantic Urban Scene Understanding
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:3213-3223
[6]
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
[J].
MOBICOM'18: PROCEEDINGS OF THE 24TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING,
2018,
:115-127
[7]
Girshick R., 2018, Detectron
[9]
Rich feature hierarchies for accurate object detection and semantic segmentation
[J].
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2014,
:580-587
[10]
Grubb Alex, 2012, P 15 INT C ARTIFICIA, P458