Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I made an explicit reverse pass (no autodiff), it was 8x faster in Python
 help



I made an explicit double-reverse pass (no code!), it was 80x faster in my head!

tradeoff worth naming: you avoid the autodiff graph overhead (hence the speedup), but any architecture change means rewriting every gradient by hand. fine for a pedagogical project, but that's exactly why autodiff exists.

Can you share a link?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: