Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There's the nominal context length, and the effective one. You need a benchmark like the needle-in-a-haystack or RULER to determine the latter.

https://github.com/NVIDIA/RULER



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: