Optimization model for memory bandwidth usage in X-ray image enhancement

被引:0
作者
Albers, Rob [1 ,2 ]
Suijs, Eric [2 ]
de With, Peter H. N. [1 ,3 ]
机构
[1] Eindhoven Univ Technol, Eindhoven, Netherlands
[2] Philips Med Syst CardioVasc X Ray, Best, Netherlands
[3] LogicaCMG Nederland BV, TSE3, Eindhoven, Netherlands
来源
REAL-TIME IMAGE PROCESSING 2008 | 2008年 / 6811卷
关键词
bandwidth reduction; low latency; multi-core; memory optimization; X-ray; image enhancement;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In Cardiovascular minimal invasive interventions, physicians require low-latency X-ray imaging applications, as their actions must be directly visible on the screen. The image-processing system should enable the simultaneous execution of a plurality of functions. Because dedicated hardware lacks flexibility, there is a growing interest in using off-the-shelf computer technology. Because memory bandwidth is a scarce parameter, we will focus on optimization methods for bandwidth reduction within multiprocessor systems at the chip level. We create a practical realistic model of required compute and memory bandwidth for a given set of image-processing functions. Similar modeling is applied for the available system resources. We concentrate in particular on X-ray image processing based on multi-resolution decomposition, noise reduction and image-enhancement techniques. We derive formulas for which we can optimize the mapping of the application onto processors, cache and memory for different configurations. The data-block granularity is matched to the memory hierarchy, so that caching will be optimized for low latency. More specifically, we exploit the locality of the signal-processing functions to streamline the memory communication. A substantial performance improvement is realized by a new memory-communication model that incorporates the data dependencies of the image-processing functions. Results show a memory-bandwidth reduction in the order of 60% and a latency reduction in the order of 30-60% compared to straightforward implementations.
引用
收藏
页数:12
相关论文
共 27 条
  • [1] ALBERS R, 2007, SPIE P, V6509
  • [2] AUVRAY V, 2005, IMAGE PROCESSING 200, V1
  • [3] BEYLS K, 2004, THESIS
  • [4] THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE
    BURT, PJ
    ADELSON, EH
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) : 532 - 540
  • [5] Memory performance optimizations for real-time software HDTV decoding
    Chen, H
    Li, K
    Wei, B
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 41 (02): : 193 - 207
  • [6] A Framework for System-Level Modeling and Simulation of Embedded Systems Architectures
    Erbas, Cagkan
    Pimentel, Andy D.
    Thompson, Mark
    Polstra, Simon
    [J]. EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2007, (01)
  • [7] FARIN D, 2007, P SPIE, V6496
  • [8] Gerber R., 2005, SOFTWARE OPTIMIZATIO
  • [9] Real-time implementation of a multi-resolution motion compensating temporal filter on general-purpose hardware
    Groth, A
    Eck, K
    [J]. Real-Time Imaging IX, 2005, 5671 : 132 - 141
  • [10] Hennessy John L., 2003, COMPUTER ARCHITECTUR