Update May 3, 2025 tracked by Updatify
v0.6.8
What’s Changed
-
Performance improvements for Qwen 3 MoE models (
30b-a3band235b-a22b) on NVIDIA and AMD GPUs -
Fixed
GGML_ASSERT(tensor->op == GGML_OP_UNARY) failedissue caused by conflicting installations - Fixed a memory leak that occurred when providing images as input
-
ollama showwill now correctly label older vision models such asllava - Reduced out of memory errors by improving worst-case memory estimations
-
Fix issue that resulted in a
context cancelederror
New Contributors
- @AliAhmedNada made their first contribution in https://github.com/ollama/ollama/pull/10522
- @HarshNevse made their first contribution in https://github.com/ollama/ollama/pull/10465
Full Changelog: https://github.com/ollama/ollama/compare/v0.6.7…v0.6.8