I wrote a C++ translation of it: https://github.com/verma7/microgpt/blob/main/mi...

WithinReason · 2026-03-01T09:21:35 1772356895

I made an explicit reverse pass (no autodiff), it was 8x faster in Python

bear3r · 2026-03-02T05:13:02 1772428382

tradeoff worth naming: you avoid the autodiff graph overhead (hence the speedup), but any architecture change means rewriting every gradient by hand. fine for a pedagogical project, but that's exactly why autodiff exists.

hu3 · 2026-03-02T04:56:42 1772427402

I made an explicit double-reverse pass (no code!), it was 80x faster in my head!

spopejoy · 2026-03-02T14:20:21 1772461221

"I've got an ipod -- In My Mind"

https://theonion.com/i-have-an-ipod-in-my-mind-1819584018/

WithinReason · 2026-03-02T21:26:22 1772486782

code here, it's just not interesting to look at:

https://news.ycombinator.com/item?id=47220542

love2read · 2026-03-01T23:05:08 1772406308

Can you share a link?

WithinReason · 2026-03-02T16:54:14 1772470454

https://www.ideone.com/VAz4Nn

Doesn't run inside IDEone due to the external download link, but you can copy&paste the code over