SSpMM: Efficiently Scalable SpMM Kernels Across Multiple Generations of Tensor Cores

Published in IEEE Transactions on Parallel and Distributed Systems (TPDS), 2025

Recommended citation: Zeyu Xue, Mei Wen, Jianchao Yang, Minjin Tang, Zhongdi Luo, Jing Feng, Yang Shi, Zhaoyun Chen, Junzhong Shen and Johannes Langguth. SSpMM: Efficiently Scalable SpMM Kernels Across Multiple Generations of Tensor Cores[J]. IEEE Transactions on Parallel and Distributed Systems (TPDS), 2025.
Download Paper | Download Slides