Ars Technica May 1, 2026 at 15:32 Big Tech Stable Warm

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

New results suggest Mythos' cyber threat isn't "a breakthrough specific to one model."

Signal weather

Stable

The story has moved beyond the first headline and now acts as a reliable context anchor.

By Kyle Orland Original source

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Last month, Anthropic made a big deal about the supposedly outsize cybersecurity threat represented by its Mythos Preview model, leading the company to restrict the initial release to “critical industry partners.” But new research from the UK's AI Security Institute (AISI) suggests that OpenAI's GPT-5.5, which launched publicly last week, reached "a similar level of performance on our cyber evaluations" as Mythos Preview, which the group evaluated last month. Since 2023, the AISI has run a variety of frontier AI models through 95 different Capture the Flag challenges designed to test capabilities on cybersecurity tasks, such as reverse engineering, web exploitation, and cryptography. On the highest-level "Expert" tasks, GPT-5.5 passed an average of 71.4 percent, slightly higher than the 68.6 percent achieved by Mythos Preview (though within the margin of error). In one particularly difficult task that involved building a disassembler to decode a Rust binary, AISI notes that "GPT-5.5 solved the challenge in 10 minutes and 22 seconds with no human assistance at a cost of $1.73" in API calls. GPT-5.5 also matched Mythos Preview in its progress on "The Last Ones" (TLO), an AISI test range set up to simulate a 32-step data extraction attack on a corporate network. GPT-5.5 succeeded in 3 of 10 attempts on TLO, compared to 2 of 10 for Mythos Preview—no previous model had ever succeeded at the test even once. But GPT-5.5 still fails at AISI's more difficult "Cooling Tower" simulation of an attempted disruption of the control software for a power plant, as every previously tested AI model also has. Read full article Comments

Read the full article

Stay on the signal

Follow GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Follow this story beyond a single article: new follow-ups, adjacent sources, and the evolving storyline.

Story map

Understand this topic fast

A quick entry into the story: why it matters now, who is involved, and where to go next for context.

Why it matters now

This story is still moving and pulling follow-up coverage.

There are already 6 connected articles in the same storyline to continue from here.

The story keeps orbiting around Ars Technica, Breakthrough, and Breakthrough Specific, so the entity pages are the fastest way to build context.

Ars Technica already has 4 follow-up stories on the same theme.

Topic constellation

Open the live map for this story

See which entities, story threads, sources, and follow-up articles shape this story right now.

Click nodes to continue

Entity Cluster Article Hub Source

Entity pages

Ars Technica Breakthrough Breakthrough Specific Cybersecurity GPT-5.5 Matches Heavily

Story threads

Ars Technica

Latest coverage and related links about Ars Technica.

Ars Technica

Последние материалы и связанный контекст по теме Ars Technica.

Breakthrough

Последние материалы и связанный контекст по теме Breakthrough.

Cybersecurity

Последние материалы и связанный контекст по теме Cybersecurity.

Story timeline

Continue with this story

A short sequence of events and follow-up stories to understand the arc quickly.

Jun 21, 2026 at 17:49 Ars Technica

Trump admin’s coal investments assist plants with repeated violations

At least three coal plants have been repeatedly cited for violating environmental regulations.

Jun 21, 2026 at 10:00 Ars Technica

Review: Widow's Bay is a boldly original take on comedic horror

An eminently binge-able series that honors classic horror tropes while reinventing them in surprising ways

Jun 20, 2026 at 11:15 Ars Technica

The UK will scan asylum-seekers’ faces for age checks—despite knowing the tech is flawed

Tests of age-verification technology show the risks of life-altering errors.

Jun 19, 2026 at 22:40 TechCrunch

From PGP to Mythos: a brief history of export controls that didn’t stop anyone

For the last 30 years, stopping the flow of cybersecurity-related software has proven to be ineffective. It's unclear why it would work n...

Jun 19, 2026 at 16:11 Hacker News

GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2

Comments

May 1, 2026 at 15:32 Ars Technica

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

New results suggest Mythos' cyber threat isn't "a breakthrough specific to one model."

How reliable this looks

Signal and trust for Ars Technica

This source works at a steady pace: 100% of recent stories land in the hot window, and 0% carry visible search signal.

Trusted

Reliability

Freshness

100

Sources in storyline

More stories that share tags, source, or category context.

Hacker News Jun 21, 2026 at 21:15 Developer Tools

Rising Hot

NSA chief says Mythos breached 'almost all' classified systems in hours

Comments

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Almost Breached Classified Classified Systems

Read article Follow story

bankwatch.ca

Trump admin’s coal investments assist plants with repeated violations

Ars Technica Jun 21, 2026 at 17:49 Big Tech

Rising Hot

Trump admin’s coal investments assist plants with repeated violations

At least three coal plants have been repeatedly cited for violating environmental regulations.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Coal Coal Investments Environmental

Read article Follow story

arstechnica.com

Review: Widow's Bay is a boldly original take on comedic horror

Ars Technica Jun 21, 2026 at 10:00 Big Tech

Rising Hot

Review: Widow's Bay is a boldly original take on comedic horror

An eminently binge-able series that honors classic horror tropes while reinventing them in surprising ways

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Bay Binge Able Binge Able Series

Read article Follow story

arstechnica.com

Hacker News Jun 21, 2026 at 09:45 Developer Tools

Rising Hot

NSA director: 'Mythos "broke into almost all of our classified systems in hours"

Comments

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Almost Classified Classified Systems Comments

Read article Follow story

economist.com

More from Ars Technica

Fresh reporting and follow-up coverage from the same newsroom.

Open source page

Ars Technica Jun 21, 2026 at 17:49 Big Tech

Rising Hot

Trump admin’s coal investments assist plants with repeated violations

At least three coal plants have been repeatedly cited for violating environmental regulations.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Coal Coal Investments Environmental

Read article Follow story

arstechnica.com

Ars Technica Jun 21, 2026 at 10:00 Big Tech

Rising Hot

Review: Widow's Bay is a boldly original take on comedic horror

An eminently binge-able series that honors classic horror tropes while reinventing them in surprising ways

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Bay Binge Able Binge Able Series

Read article Follow story

arstechnica.com

The UK will scan asylum-seekers’ faces for age checks—despite knowing the tech is flawed

Ars Technica Jun 20, 2026 at 11:15 Big Tech

Rising Hot

The UK will scan asylum-seekers’ faces for age checks—despite knowing the tech is flawed

Tests of age-verification technology show the risks of life-altering errors.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Age Verification Age Verification Technology Ars Technica Asylum Seekers

Read article Follow story

arstechnica.com

Rocket Report: Rebuild begins at Blue Origin launch pad; Relativity targets Mars

Ars Technica Jun 19, 2026 at 13:36 Big Tech

Rising Hot

Rocket Report: Rebuild begins at Blue Origin launch pad; Relativity targets Mars

A French launch startup is scrapping the name of its rocket, apparently due to a trademark issue.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Blue Origin Mars A French Rebuild

Read article Follow story

arstechnica.com

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Follow GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Understand this topic fast

Why it matters now

Open the live map for this story

Entity pages

Story threads

Continue with this story

Signal and trust for Ars Technica

Related articles

More from Ars Technica