Update Aug 25, 2025 tracked by Updatify
v0.11.7
DeepSeek-V3.1
DeepSeek-V3.1 is now available to run via Ollama.
This model supports hybrid thinking, meaning thinking can be enabled or disabled by setting think in Ollama’s API:
curl http://localhost:11434/api/chat -d '{
"model": "deepseek-v3.1",
"messages": [
{
"role": "user",
"content": "why is the sky blue?"
}
],
"think": true
}'
In Ollama’s CLI, thinking can be enabled or disabled by running the /set think or /set nothink commands.
Turbo (in preview)
DeepSeek-V3.1 has over 671B parameters, and so a large amount of VRAM is required to run it. Ollama’s Turbo mode (in preview) provides access to powerful hardware in the cloud you can use to run the model.
Turbo via Ollama’s app
- Download Ollama for macOS or Windows
-
Select
deepseek-v3.1:671bfrom the model selector - Enable Turbo
Turbo via Ollama’s CLI and libraries
- Create an account on ollama.com/signup
- Follow the docs for Ollama’s CLI to upload authenticate your Ollama installation
- Run the following:
OLLAMA_HOST=ollama.com ollama run deepseek-v3.1
For instructions on using Turbo with Ollama’s Python and JavaScript library, see the docs
What’s Changed
- Fixed issue where multiple models would not be loaded on CPU-only systems
-
Ollama will now work with models who skip outputting the initial
<think>tag (e.g. DeepSeek-V3.1) -
Fixed issue where text would be emitted when there is no opening
<think>tag from a model -
Fixed issue where tool calls containing
{or}would not be parsed correctly
New Contributors
- @zoupingshi made their first contribution in https://github.com/ollama/ollama/pull/12028
Full Changelog: https://github.com/ollama/ollama/compare/v0.11.6…v0.11.7