v0.11.7 — Ollama - Product release notes & changelog tool

DeepSeek-V3.1

DeepSeek-V3.1 is now available to run via Ollama.

This model supports hybrid thinking, meaning thinking can be enabled or disabled by setting think in Ollama’s API:

curl http://localhost:11434/api/chat -d '{
  "model": "deepseek-v3.1",
  "messages": [
    {
      "role": "user",
      "content": "why is the sky blue?"
    }
  ],
  "think": true
}'

In Ollama’s CLI, thinking can be enabled or disabled by running the /set think or /set nothink commands.

Turbo (in preview)

DeepSeek-V3.1 has over 671B parameters, and so a large amount of VRAM is required to run it. Ollama’s Turbo mode (in preview) provides access to powerful hardware in the cloud you can use to run the model.

Turbo via Ollama’s app

Download Ollama for macOS or Windows
Select deepseek-v3.1:671b from the model selector
Enable Turbo

Turbo via Ollama’s CLI and libraries

Create an account on ollama.com/signup
Follow the docs for Ollama’s CLI to upload authenticate your Ollama installation
Run the following:

OLLAMA_HOST=ollama.com ollama run deepseek-v3.1

For instructions on using Turbo with Ollama’s Python and JavaScript library, see the docs

What’s Changed

Fixed issue where multiple models would not be loaded on CPU-only systems
Ollama will now work with models who skip outputting the initial<think> tag (e.g. DeepSeek-V3.1)
Fixed issue where text would be emitted when there is no opening <think> tag from a model
Fixed issue where tool calls containing { or } would not be parsed correctly

New Contributors

@zoupingshi made their first contribution in https://github.com/ollama/ollama/pull/12028

Full Changelog: https://github.com/ollama/ollama/compare/v0.11.6…v0.11.7