Update Oct 30, 2025 tracked by Updatify
v0.12.8
What’s Changed
-
qwen3-vlperformance improvements, including flash attention support by default -
qwen3-vlwill now output less leading whitespace in the response when thinking -
Fixed issue where
deepseek-v3.1thinking could not be disabled in Ollama’s new app -
Fixed issue where
qwen3-vlwould fail to interpret images with transparent backgrounds -
Ollama will now stop running a model before removing it via
ollama rm - Fixed issue where prompt processing would be slower on Ollama’s engine
- Ignore unsupported iGPUs when doing device discovery on Windows
New Contributors
- @athshh made their first contribution in https://github.com/ollama/ollama/pull/12822
Full Changelog: https://github.com/ollama/ollama/compare/v0.12.7…v0.12.8