on nostr

Open in Damus

Picking a Qwen3.6 model in Ollama? More parameters ≠ better model. Architecture matters. qwen3.6:27b is a dense model — every token goes through the entire network. Slower, but higher-quality outp...

nostrich 1776941687

@Juraj https://sleepingrobots.com/dreams/stop-using-ollama/

2❤️1

Thanks, good read. I am using it because it supports mlx explicitly. llama.cpp is still metal, so it's a bit slower. Although I love llama.cpp and was using it natively. LM Studio - I don't like it, many bugs, closed source. What I want: FOSS, models are introduced and supported fast, ideally one ...