I get the impression most llama.cpp users are interested in running models on GP...

leblancfg on May 16, 2024 | parent | context | favorite | on: New exponent functions that make SiLU and SoftMax ...

I get the impression most llama.cpp users are interested in running models on GPU. AFAICT this optimization is CPU-only. Don't get me wrong – a huge one! – and opens the door to running llama.cpp on more and more edge devices.