GEM: Ultra-Efficient Near-Memory Reconfigurable Acceleration for Read Mapping by Dividing and Predictive Scattering

被引:0
作者
Chen, Longlong [1 ]
Zhu, Jianfeng [1 ]
Peng, Guiqiang [1 ]
Liu, Mingxu [2 ]
Wei, Shaojun [1 ]
Liu, Leibo [1 ]
机构
[1] Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
[2] Beijing Superstring Acad Memory Technol, Dept DRAM, Beijing 100176, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Data-centric computing; dividing and scattering; genome assembly; near memory computing; read mapping; reconfigurable computing; SEQUENCE; CLASSIFICATION; ARCHITECTURE; ALIGNMENT;
D O I
10.1109/TPDS.2023.3309462
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Read mapping, which maps billions of reads to a reference DNA, poses a significant performance bottleneck in genomic analysis. Current accelerators for read mapping are primarily bounded by the intensive and random memory access to huge datasets. Near-data processing (NDP) infrastructures are promising to provide extremely high bandwidth. However, existing frameworks failed to reach this potential due to poor locality and high redundancy. Our idea is to introduce prediction under the insight that candidate mapping positions become predictable when the reference is organized in coarse-grain slices. We present GEM (Genomic Memory), an ultra-efficient near-memory accelerator for read mapping. GEM adopts a novel data-centric framework, named dividing-and-predictive-scattering (DPS), which synthesizes information of seed existence to predict the target mapping locations to reduce memory access redundancy. During preparation, DPS divides the reference into coarse-grained slices and creates predictive filters to assess the likelihood of reads belonging to each slice. During mapping, DPS predicts and scatters reads to considerably fewer slices compared than without prediction. By employing small on-chip SRAM-based predictors with high accuracy, DPS minimizes unnecessary DRAM access and data movement from remote memory. In essence, DPS trades pre-seeding predictors for localized access patterns and low redundancy, hence achieving high throughput for data-intensive applications. We implement GEM by integrating coarse-grain reconfigurable architectures (CGRAs) in the logic layer of a 3D-stacked DRAM infrastructure, utilizing the massive banks as slices. GEM leverages CGRAs for their flexibility in supporting various algorithms tailored to different datasets. Bloom filters are leveraged for slice prediction, providing an error rate below 1%. Evaluation results demonstrate that GEM reduces memory requests by 95% and alignments by 87%, achieving a throughput improvement of 15.3 x and 11.0 x compared to compute-centric and broadcast-based baselines on the same NDP platform. Overall, GEM achieves a 3.5 x 3.5 x throughput improvement and 2.1 x 2.1 x energy efficiency compared to state-of-the-art ASIC accelerators.
引用
收藏
页码:3059 / 3072
页数:14
相关论文
共 82 条
[1]   A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing [J].
Ahn, Junwhan ;
Hong, Sungpack ;
Yoo, Sungjoo ;
Mutlu, Onur ;
Choi, Kiyoung .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :105-117
[2]   Personalized copy number and segmental duplication maps using next-generation sequencing [J].
Alkan, Can ;
Kidd, Jeffrey M. ;
Marques-Bonet, Tomas ;
Aksay, Gozde ;
Antonacci, Francesca ;
Hormozdiari, Fereydoun ;
Kitzman, Jacob O. ;
Baker, Carl ;
Malig, Maika ;
Mutlu, Onur ;
Sahinalp, S. Cenk ;
Gibbs, Richard A. ;
Eichler, Evan E. .
NATURE GENETICS, 2009, 41 (10) :1061-U29
[3]   Accelerating Genome Analysis: A Primer on an Ongoing Journey [J].
Alser, Mohammed ;
Bingol, Zulal ;
Cali, Damla Senol ;
Kim, Jeremie ;
Ghose, Saugata ;
Alkan, Can ;
Mutlu, Onur .
IEEE MICRO, 2020, 40 (05) :65-75
[4]   Shouji: a fast and efficient pre-alignment filter for sequence alignment [J].
Alser, Mohammed ;
Hassan, Hasan ;
Kumar, Akash ;
Mutlu, Onur ;
Alkan, Can .
BIOINFORMATICS, 2019, 35 (21) :4255-4263
[5]   Opportunities and challenges in long-read sequencing data analysis [J].
Amarasinghe, Shanika L. ;
Su, Shian ;
Dong, Xueyi ;
Zappia, Luke ;
Ritchie, Matthew E. ;
Gouil, Quentin .
GENOME BIOLOGY, 2020, 21 (01)
[6]   AlignS: A Processing-In-Memory Accelerator for DNA Short Read Alignment Leveraging SOT-MRAM [J].
Angizi, Shaahin ;
Sun, Jiao ;
Zhang, Wei ;
Fan, Deliang .
PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,
[7]  
[Anonymous], 2015, DNA sequencing costs: Data from the NHGRI large-scale genome sequencing program
[8]  
[Anonymous], 2019, IBMWhite Paper
[9]   Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics [J].
Ardui, Simon ;
Ameur, Adam ;
Vermeesch, Joris R. ;
Hestand, Matthew S. .
NUCLEIC ACIDS RESEARCH, 2018, 46 (05) :2159-2168
[10]   Towards precision medicine [J].
Ashley, Euan A. .
NATURE REVIEWS GENETICS, 2016, 17 (09) :507-522