Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

According to my understanding they are referring to parameter count. If we go by that logic, BERT has 340M parameters. GPT3 has 175B. So this will have 340B parameters?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: