Khalid Abdulla, Andrew Wirth, et al.
ICIAfS 2014
In this paper, we present several algorithms for performing all-to-many personalized communication on distributed memory parallel machines. We assume that each processor sends a different message (of potentially different size) to a subset of all the processors involved in the collective communication. The algorithms are based on decomposing the communication matrix into a set of partial permutations. We study the effectiveness of our algorithms from both the view of static scheduling and runtime scheduling. © 1995 Academic Press, Inc.
Khalid Abdulla, Andrew Wirth, et al.
ICIAfS 2014
Hannah Kim, Celia Cintas, et al.
IJCAI 2023
Ismail Akhalwaya, Shashanka Ubaru, et al.
ICLR 2024
Chen-chia Chang, Wan-hsuan Lin, et al.
ICML 2025