Update Jun 3, 2026 tracked by Updatify
v0.30.2
What’s Changed
-
ollama launchnow supports Qwen Code and can guide users through installing the Cline CLI when it is missing. -
ollama launch codexnow uses an isolated launch configuration, avoiding conflicts with a user’s existing Codex settings. - Added llama.cpp backend compatibility support for Poolside’s Laguna architecture.
- The llama.cpp backend now includes cached prompt tokens in token accounting, improving usage reporting for requests with prompt cache hits.
- The llama.cpp backend now ignores SSE ping comments, improving streaming compatibility with newer backend behavior.
- The llama.cpp backend now detects load stalls from server output so failed model loads surface more reliably instead of hanging.
- Radeon 8060S integrated GPUs are now allowed by default.
- Template details are included in logs to make troubleshooting model prompts easier.
- Added Hermes Desktop configuration docs.
- Fixed a build issue in the Laguna compatibility patch, restoring Laguna support in release builds.
Full Changelog: https://github.com/ollama/ollama/compare/v0.30.0…v0.30.2