Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
From 800ms to ~25ms: harness-driven optimization of a CUDA matmul kernel (github.com/yupenghan)
3 points by icyace 11 hours ago | past | discuss

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: