Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

One of the main author here, the readme isn't really well up-to-date. We have our own gemm implementation based on CubeCL. It's still moving a lot, but we support tensor cores, use warp operations (Plane Operations in CubeCL), we even added TMA instructions for CUDA.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: