Google announces Gemma 4 open AI models, switches to Apache 2.0 license
Gemma 4 brings the first major update to Google's open models in a year.
Google's Gemini AI models have improved by leaps and bounds over the past year, but you can only use Gemini on Google's terms. The company's Gemma open-weight models have provided more freedom, but Gemma 3, which launched over a year ago, is getting a bit long in the tooth. Starting today, developers can start working with Gemma 4, which comes in four sizes optimized for local usage. Google has also acknowledged developer frustrations with AI licensing, so it's dumping the custom Gemma license. Like past versions of its open-weight models, Google has designed Gemma 4 to be usable on local machines. That can mean plenty of things, of course. The two large Gemma variants, 26B Mixture of Experts and 31B Dense, are designed to run unquantized in bfloat16 format on a single 80GB Nvidia H100 GPU. Granted, that's a $20,000 AI accelerator, but it's still local hardware. If quantized to run at lower precision, these big models will fit on consumer GPUs. Google also claims it has focused on reducing latency to really take advantage of Gemma's local processing. The 26B Mixture of Experts model activates only 3.8 billion of its 26 billion parameters in inference mode, giving it much higher tokens-per-second than similarly sized models. Meanwhile, 31B Dense is more about quality than speed, but Google expects developers to fine-tune it for specific uses.Read full article Comments
Related tags
Companies and people
Story threads
Announces
Последние материалы и связанный контекст по теме Announces.
Apache
Latest coverage and related links about Apache.
Apache
Последние материалы и связанный контекст по теме Apache.
Ars Technica
Latest coverage and related links about Ars Technica.
Ars Technica
Последние материалы и связанный контекст по теме Ars Technica.
Gemma
Последние материалы и связанный контекст по теме Gemma.
Gemma
Latest coverage and related links about Gemma.
Latest coverage and related links about Google.
Последние материалы и связанный контекст по теме Google.
Continue with this story
Follow the same topic through connected articles, entity pages, and active story threads.
Google releases Gemma 4 open models
Comments
This Ford is the quickest production car at the Nürburgring, ever
Only three race cars have ever gone quicker around this famous track.
Google now lets you direct avatars through prompts in its Vids app
Google is adding a way to customize and instruct avatars for video creation in the Vids app.
Anthropic says its leak-focused DMCA effort unintentionally hit legit GitHub forks
But the effort to stop the spread of leaked Claude Code client code is an uphill battle.
Why is NASA bothering to go back to the Moon if we've already been there?
NASA has struggled to deal with the widespread sentiment that NASA has “been there, done that."
Tesla sales grew by 6% in Q1, but company has an overproduction problem
Between January and March, Tesla built 50,000 more cars than it could sell.
Entity pages
Ad slot
Article inline monetization block
A reserved partner slot for relevant tools, services, and contextual editorial integrations.
Related articles
More stories that share tags, source, or category context.
Microsoft takes on AI rivals with three new foundational models
MAI released models that can transcribe voice into text as well as generate audio and images after the group's formation six months ago.
Google releases Gemma 4 open models
Comments
This Ford is the quickest production car at the Nürburgring, ever
Only three race cars have ever gone quicker around this famous track.
Google now lets you direct avatars through prompts in its Vids app
Google is adding a way to customize and instruct avatars for video creation in the Vids app.
More from Ars Technica
Fresh reporting and follow-up coverage from the same newsroom.
This Ford is the quickest production car at the Nürburgring, ever
Only three race cars have ever gone quicker around this famous track.
Anthropic says its leak-focused DMCA effort unintentionally hit legit GitHub forks
But the effort to stop the spread of leaked Claude Code client code is an uphill battle.
Why is NASA bothering to go back to the Moon if we've already been there?
NASA has struggled to deal with the widespread sentiment that NASA has “been there, done that."
Tesla sales grew by 6% in Q1, but company has an overproduction problem
Between January and March, Tesla built 50,000 more cars than it could sell.