Merging VLIW and vector processing techniques for a simple, high-performance processor architecture

被引:0
|
作者
Soliman, Mostafa I. [1 ,2 ]
机构
[1] Taibah Univ, Comp Sci & Informat Dept, Community Coll, Al Adinah Al Unawwarah 2898, Saudi Arabia
[2] Aswan Univ, Dept Elect Engn, Comp & Syst Sect, Fac Engn, Aswan 81542, Egypt
关键词
Data-parallel applications; VLIW; Vector processing; VHDL; Performance evaluation; SUPERSCALAR; LEVEL; CORE;
D O I
10.1016/j.mejo.2015.03.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a new processor architecture called VVSHP for accelerating data-parallel applications, which are growing in importance and demanding increased performance from hardware. VVSHP merges VLIW and vector processing techniques for a simple, high-performance processor architecture. One key point of VVSHP is the execution of multiple scalar instructions within VLIW and vector instructions on unified parallel execution datapaths. Another key point is to reduce the complexity of VVSHP by designing a two-part register file: (1) shared scalar-vector part with eight-read/four-write ports 64 x 32-bit registers (64 scalar or 16 x 4 vector registers) for storing scalar/vector data and (2) vector part with two-read/one-write ports 48 vector-registers, each stores 4 x 32-bit vector data. Moreover, processing vector data with lengths varying from 1 to 256 represents a key point for reducing the loop overheads. VVSHP can issue up to four scalar/vector operations in each cycle for parallel processing a set of operands and producing up to four results to be written back into VVSHP register file. However, it cannot issue more than one memory operation at a time, which loads/stores 128-bit scalar/vector data from/to data memory. The design of our proposed VVSHP processor is implemented using VHDL targeting the Xilinx FPGA Virtex-5 and its performance is evaluated. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:637 / 655
页数:19
相关论文
共 50 条
  • [21] High-performance ECC processor architecture design for IoT security applications
    Thirumalesu Kudithi
    R. Sakthivel
    The Journal of Supercomputing, 2019, 75 : 447 - 474
  • [22] A HIGH-PERFORMANCE SINGLE-CHIP VLSI SIGNAL PROCESSOR ARCHITECTURE
    KANOPOULOS, N
    MARINOS, PN
    LECTURE NOTES IN COMPUTER SCIENCE, 1986, 227 : 166 - 179
  • [23] High-performance ECC processor architecture design for IoT security applications
    Kudithi, Thirumalesu
    Sakthivel, R.
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (01): : 447 - 474
  • [24] Zodiac: System Architecture Implementation for a High-Performance Network Security Processor
    Wang Haixin
    Bai Guoqiang
    Chen Hongyi
    2008 INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2008, : 91 - 96
  • [25] SIGNAL PROCESSOR ARCHITECTURE FOR HIGH-PERFORMANCE REAL-TIME APPLICATIONS
    ISHSHALOM, J
    KAZANZIDES, P
    REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 1989, : 184 - 193
  • [26] A High-Performance Architecture of an XML Processor For SIP-based Presence
    El-Hassan, Fadi
    Peterkin, Raymond
    Abou-Gabal, Mohamed
    Ionescu, Dan
    PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 90 - 95
  • [27] Low-power, high-performance architecture of the PWRficient processor family
    Yeh, Tse-Yu
    IEEE MICRO, 2007, 27 (02) : 69 - 78
  • [28] VGL: a high-performance graph processing framework for the NEC SX-Aurora TSUBASA vector architecture
    Afanasyev, Ilya V.
    Voevodin, Vladimir V.
    Komatsu, Kazuhiko
    Kobayashi, Hiroaki
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (08): : 8694 - 8715
  • [29] VGL: a high-performance graph processing framework for the NEC SX-Aurora TSUBASA vector architecture
    Ilya V. Afanasyev
    Vladimir V. Voevodin
    Kazuhiko Komatsu
    Hiroaki Kobayashi
    The Journal of Supercomputing, 2021, 77 : 8694 - 8715
  • [30] VLIW scheduling for high performance embedded energy-efficient processor
    Guan, Maolin
    Yang, Qianming
    Zhang, Chunyuan
    Wen, Mei
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2012, 34 (06): : 26 - 33