The Floating-Point Unit of the Jaguar x86 Core

被引:16
|
作者
Rupley, Jeff [1 ]
King, John [1 ]
Quinnell, Eric [1 ]
Galloway, Frank [1 ]
Patton, Ken [1 ]
Seidel, Peter-Michael [1 ]
Dinh, James [1 ]
Bui, Hai [1 ]
Bhowmik, Anasua [1 ]
机构
[1] AMD Austin & Bangalore, Bangalore, Karnataka, India
来源
2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH) | 2013年
关键词
AMD Jaguar; floating-point unit; x87; SSE; AVX; MMX; AES; CLMUL; F16C; industry implementation;
D O I
10.1109/ARITH.2013.24
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The AMD Jaguar x86 core uses a fully-synthesized, 128-bit native floating-point unit (FPU) built as a co-processor model. The Jaguar FPU supports several x86 ISA extensions, including x87, MMX, SSE1 through SSE4.2, AES, CLMUL, AVX, and F16C instruction sets. The front end of the unit decodes two complex operations per cycle and uses a dedicated renamer (RN), free list (FL), and retire queue (RQ) for in-order dispatch and retire. The FPU issues to the execution units with a dedicated out-of-order, dual-issue scheduler. Execution units source operands from a synthesized physical register file (PRF) and bypass network. The back end of the unit has two execution pipes: the first pipe contains a vector integer ALU, a vector integer MUL unit, and a floating-point adder (FPA); the second pipe contains a vector integer ALU, a store-convert unit, and a floating-point iterative multiplier (FPM). The implementation of the unit focused on low-power design and on vectorized single-precision (SP) performance optimizations. The verification of the unit required complex pseudo-random and formal verification techniques. The Jaguar FPU is built in a 28nm CMOS process.
引用
收藏
页码:7 / 16
页数:10
相关论文
共 50 条
  • [1] The floating-point unit of the Jaguar x86 core
    Rupley, Jeff
    King, John
    Quinnell, Eric
    Galloway, Frank
    Patton, Ken
    Seidel, Peter-Michael
    Dinh, James
    Bui, Hai
    Bhowmik, Anasua
    Proceedings - Symposium on Computer Arithmetic, 2013, : 7 - 16
  • [2] Secure, Precise, and Fast Floating-Point Operations on x86 Processors
    Rane, Ashay
    Lin, Calvin
    Tiwari, Mohit
    PROCEEDINGS OF THE 25TH USENIX SECURITY SYMPOSIUM, 2016, : 71 - 86
  • [3] Floating-Point Division Algorithms for an x86 Microprocessor with a Rectangular Multiplier
    Schulte, Michael J.
    Tan, Dimitri
    Lemonds, Carl E.
    2007 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, VOLS, 1 AND 2, 2007, : 304 - +
  • [4] Formal Verification of an Iterative Low-Power x86 Floating-Point Multiplier with Redundant Feedback
    Seidel, Peter-Michael
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2011, (70): : 70 - 83
  • [5] "JAGUAR" AMD's Next Generation Low Power x86 Core
    Rupley, Jeff
    2012 IEEE HOT CHIPS 24 SYMPOSIUM (HCS), 2012,
  • [6] Fused Floating-Point Magnitude Unit
    Min, Jae Hong
    Swartzlander, Earl E., Jr.
    2013 IEEE 56TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2013, : 1383 - 1386
  • [7] THE SUPRENUM VECTOR FLOATING-POINT UNIT
    KAMMER, H
    PARALLEL COMPUTING, 1988, 7 (03) : 315 - 323
  • [8] A FLOATING-POINT RESIDUE ARITHMETIC UNIT
    TAYLOR, FJ
    HUANG, CH
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1981, 311 (01): : 33 - 53
  • [9] Efficient Floating-Point Givens Rotation Unit
    Hormigo, Javier
    Munoz, Sergio D.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (05) : 2419 - 2442
  • [10] A combined interval and floating-point reciprocal unit
    Kucukkabak, Umut
    Akkas, Ahmet
    2005 39TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2005, : 1366 - 1371