Update Sep 26, 2025 tracked by Updatify

v0.12.3

New models

  • DeepSeek-V3.1-Terminus: DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode. It delivers more stable & reliable outputs across benchmarks compared to the previous version:

    Run on Ollama’s cloud:

    ollama run deepseek-v3.1:671b-cloud

    Run locally (requires 500GB+ of VRAM)

    ollama run deepseek-v3.1
  • Kimi-K2-Instruct-0905: Kimi K2-Instruct-0905 is the latest, most capable version of Kimi K2. It is a state-of-the-art mixture-of-experts (MoE) language model, featuring 32 billion activated parameters and a total of 1 trillion parameters.

    ollama run kimi-k2:1t-cloud

What’s Changed

  • Fixed issue where tool calls provided as stringified JSON would not be parsed correctly
  • ollama push will now provide a URL to follow to sign in
  • Fixed issues where qwen3-coder would output unicode characters incorrectly
  • Fix issue where loading a model with /load would crash

New Contributors

Full Changelog: https://github.com/ollama/ollama/compare/v0.12.2…v0.12.3