Update Nov 19, 2025 tracked by Updatify

v0.13.0

New models

  • DeepSeek-OCR: DeepSeek-OCR uses optical 2D mapping to compress long contexts, achieving high OCR precision with reduced vision tokens and demonstrating practical value in document processing.
  • Cogito-V2.1: instruction tuned generative models, currently the best open-weight LLM by a US company

DeepSeek-OCR

DeepSeek-OCR is now available on Ollama. Example inputs:

ollama run deepseek-ocr "/path/to/image\n<|grounding|>Given the layout of the image."
ollama run deepseek-ocr "/path/to/image\nFree OCR."
ollama run deepseek-ocr "/path/to/image\nParse the figure."
ollama run deepseek-ocr "/path/to/image\nExtract the text in the image."
ollama run deepseek-ocr "/path/to/image\n<|grounding|>Convert the document to markdown."

New bench tool

Ollama’s GitHub repo now includes a bench tool that can be used to test model performance. For the time being this is a separate tool that can be built in the Ollama GitHub repository:

First, install Go. Then from the root of the Ollama repository run:

go run ./cmd/bench -model gpt-oss:20b

For more information see the tool’s documentation

What’s Changed

  • DeepSeek-OCR is now supported
  • DeepSeek-V3.1 architecture is now supported in Ollama’s engine
  • Fixed performance issues that arose in Ollama 0.12.11 on CUDA
  • Fixed issue where Linux install packages were missing required Vulkan libraries
  • Improved CPU and memory detection while in containers/cgroups
  • Improved VRAM information detection for AMD GPUs
  • Improved KV cache performance to no longer require defragmentation

New Contributors

Full Changelog: https://github.com/ollama/ollama/compare/v0.12.11…v0.13.0