News Grower

Independent coverage of AI, startups, and technology.

Ars Technica Jun 10, 2026 at 19:29 Big Tech Rising Hot

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Diffusion AI is most common in image generation, but it can make text outputs much faster.

Signal weather

Rising

Momentum is building quickly, so this card is a good early entry point into the topic.

By Ryan Whitwam Original source
Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Another day, another AI model from Google. This time, Google DeepMind has released a new member of the Gemma 4 open model family, but it's fundamentally different from the rest of the lineup. DiffusionGemma doesn't generate outputs linearly like most AI models. Instead, it can produce an entire block of text in parallel. Google says this makes it faster and more efficient when running on local hardware like an Nvidia DGX or a humble gaming GPU. Most AI models are designed to be autoregressive—they generate text left to right one token at a time. DiffusionGemma has more in common with image generation models, which start with static and then denoise it to create the desired content. This model takes a field of placeholder tokens running over the canvas multiple times to generate likely tokens and using those to improve estimation of others. At the end of the process, the model finalizes its token outputs in one large block—the "denoised" text canvas. DiffusionGemma is fairly large in the realm of Google's open models. It's a Mixture of Experts (MoE) model with a total of 26 billion parameters, but only 3.8 billion are activated during inference. That means it should fit in the 18GB RAM allotment of a high-end GPU. In testing with an RTX 5090, DiffusionGemma spits out around 700 tokens per second. With a single Nvidia H100 AI accelerator, DiffusionGemma can produce 1,000+ tokens per second. That's about four times the output of the similarly sized autoregressive Gemma models. Read full article Comments

Stay on the signal

Follow Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Follow this story beyond a single article: new follow-ups, adjacent sources, and the evolving storyline.

We send a confirmation link first, then only meaningful digests.

Story map

Understand this topic fast

A quick entry into the story: why it matters now, who is involved, and where to go next for context.

Why it matters now

Fresh coverage with immediate momentum.
There are already 6 connected articles in the same storyline to continue from here.
The story keeps orbiting around AI, Ars Technica, and Diffusion, so the entity pages are the fastest way to build context.
Ars Technica already has 4 follow-up stories on the same theme.

Topic constellation

Open the live map for this story

See which entities, story threads, sources, and follow-up articles shape this story right now.

Click nodes to continue

Entity Cluster Article Hub Source

Story timeline

Continue with this story

A short sequence of events and follow-up stories to understand the arc quickly.

Jun 10, 2026 at 20:19 TechCrunch

Fresh off bond sale, Amazon borrows $17.5B from banks as AI spending continues

Companies are burning through exorbitant sums of money to keep pace in the AI arms race. Debt is climbing.

Jun 10, 2026 at 19:57 Ars Technica

Logitech’s foldable mouse is for people who refuse to carry a mouse with them

The Mobi Fold is an $80 Bluetooth mouse with a silicone-wrapped hinge.

Jun 10, 2026 at 19:29 Ars Technica

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Diffusion AI is most common in image generation, but it can make text outputs much faster.

Jun 10, 2026 at 18:36 Hacker News

Policy on the AI Exponential

Comments

Jun 10, 2026 at 17:31 Ars Technica

We managed to glean some interesting details about the Artemis III mission

"I was on the phone with Blue Origin leadership that night, all the next day, all through the weekend."

Jun 10, 2026 at 17:19 Ars Technica

Nobody needs AI to search the Internet, court says in ruling against Google

Google AI Overview court loss in Germany could spell doom for AI search industry.

How reliable this looks

Signal and trust for Ars Technica

This source works at a rapid pace: 100% of recent stories land in the hot window, and 0% carry visible search signal.

Trusted

Reliability

92

Freshness

100

Sources in storyline

3

Related articles

More stories that share tags, source, or category context.

More from Ars Technica

Fresh reporting and follow-up coverage from the same newsroom.

Open source page