The Fused Kernel Library: A C++ API to Develop Highly-Efficient GPU Libraries
Published in arXiv preprint, 2025
FKL provides a C++ API for developing highly-efficient fused GPU kernels, enabling significant performance improvements in GPU-accelerated computing.
Recommended citation: O. Amoros, A. Andaluz, J. Nunez, A.J. Pena. (2025). "The Fused Kernel Library: A C++ API to Develop Highly-Efficient GPU Libraries." arXiv preprint arXiv:2508.07071.
Download Paper | Download Slides
