But starcraft training is not through mimicking human strategies - it was pure R... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Mentlo 18 days ago \| parent \| context \| favorite \| on: Claude Code Unpacked : A visual guide But starcraft training is not through mimicking human strategies - it was pure RL with a reward function shaped around winning, which allows it to emerge non-human and eventually super-human strategies (such as the worker oversaturation). The current training loop for coding is RL as well - so a departure from human coding patterns is not unexpected (even if departure from human coding structure is unexpected, as that would require development of a new coding language).

syphia 10 days ago [–]

AlphaStar (2019) refined through self-play but was initially trained on human data. I don't know of any other high-level Starcraft AI, but if you do let me know.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact