In this paper, it is shown that the 2-D 1-multiply 1-add structure, i.e. a structure with minimum cycle time introduced by Gnanasekaran [1] can be implemented without shift registers, but with a RAM-implementation of the element z1(-1) z2(+1). A systolic realization for this structure is developed which is suited for VLSI. It is shown that the systolic realization can be extended to higher order simply by adding processing elements (PEs).