librekitty on nostr

Open in Damus

librekitty · 4w

intel is seriously competitive for price-to-VRAM, but i don't know about compatibility NVIDIA is usually the clear winner for performance, 5xxx series/blackwell has support for NVFP4 quantized models...

librekitty @librekitty 1775518139

you can also go the CPU route with tons of RAM, but inference speed will be terrible compared to GPU accellerated

nicodemus · 4w

This is true, GPUs are faster for inference. But you'll also be consuming 1500 watts, have to deal with those thermal issues, and still struggle to fit a model larger than 32B with decent quantization. Alternatively, the 395 chips and their NPU are doing pretty good. Combine 2 of them and you're lo...