Update Jun 5, 2026 tracked by Updatify
v0.30.6
New models
-
Gemma 4 QAT weights: the Gemma 4 family is now optimized with Quantization-Aware Training (QAT) to dramatically reduce memory requirements and maximize on-device performance. Look for the tags ending in
-qat:-
gemma4:e2b-it-qat -
gemma4:e4b-it-qat -
gemma4:12b-it-qat -
gemma4:26b-a4b-it-qat -
gemma4:31b-it-qat
-
What’s Changed
-
ollama launch ompnow integrates with Oh My Pi, an AI coding agent with IDE integration - MLX embedding layers now use NVFP4 global scale for improved quantization on Apple Silicon
Full Changelog: https://github.com/ollama/ollama/compare/v0.30.5…v0.30.6