Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
PragmaticPulp
on April 6, 2023
|
parent
|
context
|
favorite
| on:
Using mmap to make LLaMA load faster
The initial claims about memory savings and sparse models weren’t correct at all. This was immediately clarified by people who tested it, but the headline had already moved toward the top of HN.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: