utxo the webmaster 🧑‍💻 on nostr

Open in Damus

utxo the webmaster 🧑‍💻

@utxo the webmaster 🧑‍💻 1775842904

My conclusion after a few days of local llming is that qwen3.5 is the best model for nerds, gemma4 not very impressive and pretty bad at tool use out of the box, maybe this will improve

16❤️3❤️2🌲1👍1💯1

NonMetalCoin · 4w

Will you continue or use cloud LLM?

How does their performance compare?

yes, the tool calls in gemma4 are not great yet unless you use ollama latest build, generally tool calling doesnt work well at release until engines support the model

I can corroborate, have had a similar experience.

Gemma is smart but not for agentic stuff. Best for chats and so on. Qwen coder next is the best stuff right now. Running it on CPU which is slow a bit but it is looocaaal!

jon martins · 4w

Running gemma 4 on ollama, latest llama.cpp or something else?

Reichard Könige · 4w

So I should then just try to get the biggest version I can fit into rtx 3060 12gb or get a rtx 3090 24gb