m4d4m · 2w Afaik it won't help much because if model must be split between VRAM <> anything else - it's slow. And there is the KV cache for the context adding to the load. Iirc the inference, in case it doesn't ... Aldin @Aldin 1775733046 I meant the optane would be for bitcoin node. There, data is read and written on the disk constantly.