The task of mapping a nested loop algorithm onto a multidimensional systolic array is considered. A buffer structure for the processing elements (PEs) that allows the data tokens to arrive at the PE earlier than when they are needed is proposed. Necessary and sufficient conditions for valid mappings using this buffer structure are then given. A refinement technique for deriving efficient statement level mappings from iteration level mappings is then proposed.<
Published in:
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
(Volume:1
,
Issue:
4
)
Date of Publication: Dec. 1993