Paul J. Steinhardt, P. Chaudhari
Journal of Computational Physics
Techniques and algorithms for efficient in-place conversion to and from standard and blocked matrix storage formats are described. Such functionality is required by numerical libraries that use different data layouts internally. Parallel algorithms and a software package for in-place matrix storage format conversion based on in-place matrix transposition are presented and evaluated. A new algorithm for in-place transposition which efficiently determines the structure of the transposition permutation a priori is one of the key ingredients. It enables effective load balancing in a parallel environment. © 2012 ACM.
Paul J. Steinhardt, P. Chaudhari
Journal of Computational Physics
Kenneth L. Clarkson, K. Georg Hampel, et al.
VTC Spring 2007
Shashanka Ubaru, Lior Horesh, et al.
Journal of Biomedical Informatics
Ronen Feldman, Martin Charles Golumbic
Ann. Math. Artif. Intell.