OpenAI releases a free GPT model that can run on your laptop

ryujin470@fedia.io · 2 months ago

OpenAI releases a free GPT model that can run on your laptop

Nate@piefed.alphapuggle.dev · 2 months ago

*if you have a laptop with 16gb of vram. Otherwise you’ll be watching ollama hit your CPU for 5 minutes with no output

Seefra 1@lemmy.zip · 2 months ago

Isn’t that true for most models until someone destiles and quantises them so they can run on common hardware?

fuckwit_mcbumcrumble@lemmy.dbzer0.com · 2 months ago

This is the internet, we’re only allowed to be snarky here.

Ghoelian@lemmy.dbzer0.com · edit-2 2 months ago

I mean yeah, but that doesn’t make the title any more true.

CyberSeeker@discuss.tchncs.de · 2 months ago

Yes, but 20 billion parameters is too much for most GPUs, regardless of quantization. You would need at least 14GB, and even that’s unlikely without offloading major parts to the CPU and system RAM (which kills the token rate).