SSpMM: Efficiently Scalable SpMM Kernels Across Multiple Generations of Tensor Cores
Published in IEEE Transactions on Parallel and Distributed Systems (TPDS), 2025
Zeyu Xue, Mei Wen, Jianchao Yang, Minjin Tang, Zhongdi Luo, Jing Feng, Yang Shi, Zhaoyun Chen, Junzhong Shen and Johannes Langguth. SSpMM: Efficiently Scalable SpMM Kernels Across Multiple Generations of Tensor Cores[J]. IEEE Transactions on Parallel and Distributed Systems (TPDS), 2025.
