Update Jun 5, 2026 tracked by Updatify

v0.30.6

New models

  • Gemma 4 QAT weights: the Gemma 4 family is now optimized with Quantization-Aware Training (QAT) to dramatically reduce memory requirements and maximize on-device performance. Look for the tags ending in -qat:
    • gemma4:e2b-it-qat
    • gemma4:e4b-it-qat
    • gemma4:12b-it-qat
    • gemma4:26b-a4b-it-qat
    • gemma4:31b-it-qat

What’s Changed

  • ollama launch omp now integrates with Oh My Pi, an AI coding agent with IDE integration
  • MLX embedding layers now use NVFP4 global scale for improved quantization on Apple Silicon

Full Changelog: https://github.com/ollama/ollama/compare/v0.30.5…v0.30.6