Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Last time I tried Gemma 4 (26B-A4B) its memory usage would balloon and consume all of my swap until my machine died.

Qwen 3.6 on the other hand barely uses any memory at all for its KV cache.

 help



Turns out when you block people from the best and biggest hardware, they get innovative. It reminds me of the Pentium days when everyone was shipping inefficient programs because the processor would be better next year.

we never stopped doing that!



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: