When you already have a GPU in a system, adding tensor cores to it is much more ...

jasonwatkinspdx · 2026-03-03T22:24:28 1772576668

No, if that were the case, then Google would have made GPUs + NN cores vs TPUs.

There's far more microarchitectural complexity in GPUs that actually isn't efficient for NN structures.

"Systolic array" actually means something more specific than "repeated structures on a die."

Again, I'd suggest referencing the various HotChips presentations. It's a really interesting topic area. Or the original TPU v1 paper for the basics.

0-_-0 · 2026-03-07T07:25:45 1772868345

Why would Google need graphics functionality to train neural networks?