Google previewed a new AI model, Gemini Diffusion, that it claims is “state-of-the-art” on certain coding and math tasks — leveraging parallel generation to achieve low latency. The version being released today generates answers five times faster than Google’s Gemini 2.0 Flash-Lite model while matching its programming performance, the company claims.
Google says that Gemini Diffusion is currently testing with a small group.