Hacker Newsnew | past | comments | ask | show | jobs | submit | icyace's submissionslogin
1.From 800ms to ~25ms: harness-driven optimization of a CUDA matmul kernel (github.com/yupenghan)
3 points by icyace 5 days ago | past | discuss

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: