> Spinning up a GPU seems like something to avoid. You can do inference on GPUs ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

lxgr on Sept 13, 2024 | parent | context | favorite | on: Exploring the scalable matrix extension of the App...

> Spinning up a GPU seems like something to avoid.

You can do inference on GPUs as well, and for anything other than very small/lightweight models, such as noise cancellation or maybe speech recognition, it's probably worth the initial overhead.

I believe CoreML already splits workloads between CPU, NPU, and GPU as appropriate.

jhugo on Sept 14, 2024 [–]

It’s likely not worth the additional energy usage though, at least when running on battery.

bee_rider on Sept 14, 2024 | [–]

Yeah, this is what I was getting at. In some sense, the list of “capabilities which don’t require spinning up the GPU” is expanded. Whether something could be done by spinning up the GPU is beside the point.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact