v0.12.3 — Ollama - Product release notes & changelog tool

New models

DeepSeek-V3.1-Terminus: DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode. It delivers more stable & reliable outputs across benchmarks compared to the previous version:

Run on Ollama’s cloud:
```
ollama run deepseek-v3.1:671b-cloud
```
Run locally (requires 500GB+ of VRAM)
```
ollama run deepseek-v3.1
```
Kimi-K2-Instruct-0905: Kimi K2-Instruct-0905 is the latest, most capable version of Kimi K2. It is a state-of-the-art mixture-of-experts (MoE) language model, featuring 32 billion activated parameters and a total of 1 trillion parameters.
```
ollama run kimi-k2:1t-cloud
```

Fixed issue where tool calls provided as stringified JSON would not be parsed correctly
ollama push will now provide a URL to follow to sign in
Fixed issues where qwen3-coder would output unicode characters incorrectly
Fix issue where loading a model with /load would crash

Full Changelog: https://github.com/ollama/ollama/compare/v0.12.2…v0.12.3