Beomseok Nam, Henrique Andrade, et al.
ACM/IEEE SC 2006
Several illustrations of a general technique called the Algorithm and Architecture approach was presented. The programmer controlled unrolling of loops was demonstrated equivalent to customized vectorization of RISC-type code. Its use was illustrated to show that RS/6000 processors could compute the distribution (-1, 1) at the rate of 3.25 multiply-adds. A linear congruential generators, related to the multiplicative congruential generators was also specified.
Beomseok Nam, Henrique Andrade, et al.
ACM/IEEE SC 2006
Michael C. McCord, Violetta Cavalli-Sforza
ACL 2007
Corneliu Constantinescu
SPIE Optical Engineering + Applications 2009
Maurice Hanan, Peter K. Wolff, et al.
DAC 1976