Apostol Natsev, Alexander Haubold, et al.
MMSP 2007
Several illustrations of a general technique called the Algorithm and Architecture approach was presented. The programmer controlled unrolling of loops was demonstrated equivalent to customized vectorization of RISC-type code. Its use was illustrated to show that RS/6000 processors could compute the distribution (-1, 1) at the rate of 3.25 multiply-adds. A linear congruential generators, related to the multiplicative congruential generators was also specified.
Apostol Natsev, Alexander Haubold, et al.
MMSP 2007
Bowen Zhou, Bing Xiang, et al.
SSST 2008
Fan Jing Meng, Ying Huang, et al.
ICEBE 2007
David S. Kung
DAC 1998