Hans Becker, Frank Schmidt, et al.
Photomask and Next-Generation Lithography Mask Technology 2004
A 16 way cache-coherent nonuniform memory access (ccNUMA) Intel system consisting of four commodity four-processor Fujitsu Teamserver SMPs connected by a Synfinity cache-coherent switch was built. Results from a performance-evaluation study confirm the success of the combined hardware/software approach for performance tuning in computation-intensive workloads. The results also show that the poor local-memory bandwidth in the commodity Intel-based systems is often the main contributor to poor scalability and performance.
Hans Becker, Frank Schmidt, et al.
Photomask and Next-Generation Lithography Mask Technology 2004
Charles H. Bennett, Aram W. Harrow, et al.
IEEE Trans. Inf. Theory
Zohar Feldman, Avishai Mandelbaum
WSC 2010
Joel L. Wolf, Mark S. Squillante, et al.
IEEE Transactions on Knowledge and Data Engineering