Update Jun 3, 2026 tracked by Updatify

v0.30.2

What’s Changed

  • ollama launch now supports Qwen Code and can guide users through installing the Cline CLI when it is missing.
  • ollama launch codex now uses an isolated launch configuration, avoiding conflicts with a user’s existing Codex settings.
  • Added llama.cpp backend compatibility support for Poolside’s Laguna architecture.
  • The llama.cpp backend now includes cached prompt tokens in token accounting, improving usage reporting for requests with prompt cache hits.
  • The llama.cpp backend now ignores SSE ping comments, improving streaming compatibility with newer backend behavior.
  • The llama.cpp backend now detects load stalls from server output so failed model loads surface more reliably instead of hanging.
  • Radeon 8060S integrated GPUs are now allowed by default.
  • Template details are included in logs to make troubleshooting model prompts easier.
  • Added Hermes Desktop configuration docs.
  • Fixed a build issue in the Laguna compatibility patch, restoring Laguna support in release builds.

Full Changelog: https://github.com/ollama/ollama/compare/v0.30.0…v0.30.2