AI Revolution Week in Review – April 25, 2026
Saturday, April 25, 2026·10:03
Enjoy the show? Subscribe to never miss an episode.
Show Notes
AI Revolution – April 25, 2026
Daily AI briefing — frontier models, research, and infrastructure.
Episode Summary
Today's episode covers 17 stories across 5 topic areas, including: OpenAI unveils GPT-5.5, claims a "new class of intelligence" at double the API price; Google pours up to $40 billion into ChatGPT rival Anthropic; DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs.
Stories Covered
• Model_Release
OpenAI unveils GPT-5.5, claims a "new class of intelligence" at double the API price
The Decoder · Apr 25 · Relevance: ██████████ 10/10
Why it matters: GPT-5.5 represents OpenAI's push into natively agentic models that autonomously switch between tools, marking a significant architectural shift. The doubled API pricing signals the rising cost floor for frontier capabilities and forces enterprises to recalculate AI deployment economics.
- GPT-5.5 is an agentic model designed for autonomous multi-tool task completion
- API pricing roughly doubled compared to previous generation
- Tops benchmarks but hallucination rates remain a significant concern
- OpenAI's chief scientist Jakub Pachocki calls recent progress 'surprisingly slow' and promises bigger leaps ahead
DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs
The Register AI · Apr 24 · Relevance: █████████░ 9/10
Why it matters: DeepSeek V4 dramatically reduces inference costs and explicitly supports Huawei Ascend NPUs, demonstrating China's progress toward a fully domestic AI stack independent of NVIDIA. This has major geopolitical implications for the US chip export control strategy.
- DeepSeek V4 claims performance rivaling best proprietary American LLMs
- Dramatically reduced inference costs compared to R1
- Extended support for Huawei's Ascend AI accelerators
- Available in preview with open weights
GPT-5.5 tops benchmarks but still hallucinates frequently and costs 20 percent more over the API
The Decoder · Apr 24 · Relevance: ████████░░ 8/10
Why it matters: Independent benchmark evaluation shows GPT-5.5 reclaims the top position for OpenAI but persistent hallucination issues temper enthusiasm. This highlights the ongoing reliability gap that remains the primary blocker for high-stakes enterprise deployments.
- GPT-5.5 leads across major AI benchmarks
- Hallucination rates remain high despite capability gains
- Price increase described as 20% over API in some configurations
- Considered best value among proprietary models despite price hike
• Industry
Google pours up to $40 billion into ChatGPT rival Anthropic
The Decoder · Apr 25 · Relevance: ██████████ 10/10
Why it matters: Google's up-to-$40B investment in Anthropic, combined with Amazon's $25B pledge, represents an unprecedented $65B flowing into a single AI company in weeks. This reshapes the competitive landscape and cements a new model where cloud hyperscalers fund frontier labs as strategic proxies.
- Google investing up to $40 billion in Anthropic
- Amazon separately pledged $25 billion, bringing total to potentially $65 billion
- Investment includes both cash and compute resources
- Follows Anthropic's limited release of its Mythos cybersecurity model
Anthropic gets $5B investment from Amazon, will use it to buy Amazon chips
Ars Technica AI · Apr 21 · Relevance: ████████░░ 8/10
Why it matters: Amazon's latest $5B tranche for Anthropic explicitly earmarked for Amazon's custom silicon reveals how hyperscaler investments create vertically integrated compute dependencies. Anthropic securing 5 GW of custom silicon underscores the scale of infrastructure needed to compete at the frontier.
- Anthropic secures $5 billion from Amazon
- Investment will fund purchases of Amazon's custom AI chips
- Anthropic secures 5 gigawatts of Amazon's custom silicon
- Claude demand described as soaring
Cohere takes over Aleph Alpha shortly after the German startup ousted its original founder
The Decoder · Apr 24 · Relevance: ███████░░░ 7/10
Why it matters: Cohere's acquisition of Aleph Alpha, backed by $600M from Schwarz Group, signals the consolidation wave hitting European AI startups that couldn't independently reach frontier scale. Europe's sovereign AI ambitions are being absorbed by North American players.
- Canadian AI company Cohere is acquiring German AI startup Aleph Alpha
- Schwarz Group investing $600 million in the deal
- Aleph Alpha's original founder Jonas Andrulis was recently ousted
- Aleph Alpha was considered Germany's answer to OpenAI
The billion-dollar startup with a different idea for AI
AI News · Apr 23 · Relevance: ███████░░░ 7/10
Why it matters: Yann LeCun's AMI Labs raising $1B with just 12 employees to pursue alternatives to LLMs represents the most prominent bet yet that the current paradigm may not be the path to general intelligence. This could redirect significant research capital toward non-LLM approaches.
- AMI Labs founded by Yann LeCun raised $1 billion with only 12 employees
- LeCun argues that large language models are not the path to advanced AI
- Pursuing alternative approaches to artificial intelligence
- Represents the largest investment in explicitly non-LLM AI research
Anthropic admits it dumbed down Claude when trying to make it smarter
The Register AI · Apr 23 · Relevance: ██████░░░░ 6/10
Why it matters: Anthropic confirming that system changes and bugs caused Claude quality degradation validates widespread user complaints and exposes the fragility of production LLM systems. It also raises questions about testing practices when overlapping updates can silently degrade model performance.
- Anthropic confirmed Claude quality declined due to overlapping system changes and bugs
- Users had been complaining about lower-quality responses for a month
- Anthropic separately faced backlash over Opus 4.7's overzealous content filtering
- Company also tested removing Claude Code from Pro plan amid unsustainable demand
• Research
What Anthropic’s Mythos Means for the Future of Cybersecurity
IEEE Spectrum AI · Apr 23 · Relevance: █████████░ 9/10
Why it matters: Anthropic's Mythos can autonomously find and weaponize software vulnerabilities into working exploits — capabilities that eluded thousands of developers. The restricted-release model establishes a new paradigm for capability-gated AI distribution with profound implications for offensive and defensive security.
- Mythos can autonomously find and weaponize software vulnerabilities into working exploits
- Anthropic restricting access to limited number of companies
- Model finds vulnerabilities in operating systems and internet infrastructure
- Security community angered by lack of details in announcement
Mozilla: Anthropic's Mythos found 271 security vulnerabilities in Firefox 150
Ars Technica AI · Apr 21 · Relevance: ████████░░ 8/10
Why it matters: Mozilla's real-world test of Mythos finding 271 Firefox vulnerabilities provides the first concrete evidence of the model's defensive potential. Mozilla's CTO calling it a watershed moment validates the claim that AI is shifting the balance of power toward defenders — at least temporarily.
- Mythos found 271 security vulnerabilities in Firefox 150
- Mozilla CTO says AI represents a watershed moment for software defenders
- CTO says model is 'every bit as capable' as world's best security researchers
- None of the vulnerabilities were beyond what a human could find, but the speed and scale are unprecedented
AI Agent Designs a RISC-V CPU Core From Scratch
IEEE Spectrum AI · Apr 22 · Relevance: ████████░░ 8/10
Why it matters: An agentic AI system designing a complete RISC-V CPU core at 1.5GHz (comparable to 2011 laptop performance) from scratch represents a major milestone in AI-driven hardware design. This could dramatically accelerate chip development cycles and reduce design costs.
- Verkor.io's agentic AI system designed a complete RISC-V CPU core called VerCore
- CPU achieves 1.5 GHz clock speed, similar to 2011-era laptop CPU
- Entirely designed by AI rather than AI-assisted human design
- Key finding: end-to-end agentic approach outperforms using specialized AI for individual design tasks
It's a myth that you need Mythos to find bugs: Open source models can do it just as well
The Register AI · Apr 24 · Relevance: ███████░░░ 7/10
Why it matters: OpenAI's first security hire argues open-source models match Mythos for bug-finding, challenging Anthropic's restricted-access approach. If true, the genie is already out of the bottle — restricting Mythos won't prevent AI-powered vulnerability discovery from becoming ubiquitous.
- RunSybil CEO Ari Herbert-Voss (OpenAI's first security hire) claims open source models match Mythos
- Argues more automated bug finding will improve security without costing jobs
- Challenges the premise behind Anthropic's restricted distribution model
- Presented at Black Hat Asia
• Infrastructure
Google unveils two new TPUs designed for the "agentic era"
Ars Technica AI · Apr 22 · Relevance: ████████░░ 8/10
Why it matters: Google's split of TPU 8 into separate training and inference chips reflects the industry's recognition that agentic workloads have fundamentally different hardware requirements. This dual-chip strategy could reshape AI infrastructure economics as inference costs become the dominant concern.
- Google announced two new TPU chips: one for training, one for inference
- Designed specifically for agentic AI workloads
- Part of Google Cloud Next announcements
- Paired with Arm-based Axion cores, moving away from x86
In another wild turn for AI chips, Meta signs deal for millions of Amazon AI CPUs
TechCrunch AI · Apr 24 · Relevance: ████████░░ 8/10
Why it matters: Meta sourcing millions of Amazon CPUs (not GPUs) for agentic workloads signals a fundamental shift in what hardware AI systems need — inference-heavy agent operations favor different silicon than training. A new chip race beyond NVIDIA GPUs is emerging.
- Meta signed deal for millions of Amazon's homegrown CPUs
- Specifically for AI agentic workloads, not training
- Signals that inference demands different hardware than training
- Represents a new dimension of the chip competition beyond NVIDIA GPUs
NVIDIA and Google infrastructure cuts AI inference costs
AI News · Apr 23 · Relevance: ███████░░░ 7/10
Why it matters: NVIDIA and Google's co-designed A5X bare-metal instances on Vera Rubin NVL72 racks promise 10x lower inference costs, directly addressing the economics bottleneck preventing widespread agentic AI deployment at enterprise scale.
- New A5X bare-metal instances run on NVIDIA Vera Rubin NVL72 rack-scale systems
- Claims up to 10x lower inference costs through hardware-software codesign
- Announced at Google Cloud Next conference
- Targets cost of AI inference at scale
Greenhouse gases from data center boom could outpace entire nations
Ars Technica AI · Apr 23 · Relevance: ███████░░░ 7/10
Why it matters: The environmental footprint of AI infrastructure is reaching nation-state scale, with planned facilities from OpenAI, Meta, xAI, and Microsoft potentially emitting 129M+ tons of CO2 annually. This creates regulatory and reputational risk that could constrain the pace of AI infrastructure buildout.
- Planned data centers from OpenAI, Meta, xAI, and Microsoft could emit over 129 million tons of CO2 annually
- Emissions would outpace those of entire nations
- Highlights tension between AI infrastructure expansion and climate commitments
- Growing pressure for regulatory intervention on AI energy consumption
• Applications
AI-Designed Drugs by a DeepMind Spinoff Are Headed to Human Trials
Wired · Apr 24 · Relevance: ███████░░░ 7/10
Why it matters: Isomorphic Labs moving AI-designed drugs to human trials represents the transition from theoretical AI drug discovery to clinical validation. If successful, this could fundamentally change pharmaceutical R&D timelines and economics.
- Isomorphic Labs (DeepMind spinoff) has built a pipeline of AI-designed new medicines
- Drugs are headed to human clinical trials
- Announced by president Max Jaderberg at WIRED Health in London
- Represents progression from AlphaFold protein structure prediction to actual drug candidates
Further Reading
- • OpenAI unveils GPT-5.5, claims a "new class of intelligence" at double the API price — The Decoder
- • Google pours up to $40 billion into ChatGPT rival Anthropic — The Decoder
- • DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs — The Register AI
- • What Anthropic’s Mythos Means for the Future of Cybersecurity — IEEE Spectrum AI
- • GPT-5.5 tops benchmarks but still hallucinates frequently and costs 20 percent more over the API — The Decoder
- • Anthropic gets $5B investment from Amazon, will use it to buy Amazon chips — Ars Technica AI
- • Mozilla: Anthropic's Mythos found 271 security vulnerabilities in Firefox 150 — Ars Technica AI
- • Google unveils two new TPUs designed for the "agentic era" — Ars Technica AI
- • In another wild turn for AI chips, Meta signs deal for millions of Amazon AI CPUs — TechCrunch AI
- • AI Agent Designs a RISC-V CPU Core From Scratch — IEEE Spectrum AI
- • Cohere takes over Aleph Alpha shortly after the German startup ousted its original founder — The Decoder
- • It's a myth that you need Mythos to find bugs: Open source models can do it just as well — The Register AI
- • NVIDIA and Google infrastructure cuts AI inference costs — AI News
- • AI-Designed Drugs by a DeepMind Spinoff Are Headed to Human Trials — Wired
- • The billion-dollar startup with a different idea for AI — AI News
- • Greenhouse gases from data center boom could outpace entire nations — Ars Technica AI
- • Anthropic admits it dumbed down Claude when trying to make it smarter — The Register AI
Full Transcript
Click to expand full episode transcript
Sam: GPT-5.5 dropped this week, and OpenAI is calling it a new class of intelligence. The more important signal isn't the benchmark numbers — it's that this is their first model architected from the ground up for autonomous, multi-tool agentic operation. The design philosophy shifted, not just the scale.
Priya: Welcome back to AI Revolution. I'm Priya Nair, here with Sam Kim, and this is our Saturday Week in Review for the week ending April 25th, 2026. This was a genuinely dense week, so we're going to synthesize rather than recap. Four themes dominated: the arrival of natively agentic frontier models and what that costs; an extraordinary capital concentration story around Anthropic; a serious inflection point in AI-powered cybersecurity; and a hardware architecture shift that's been building for months and is now becoming impossible to ignore. Let's get into it.
Sam: So let's start with the model releases, because GPT-5.5 is the headline, but the interesting story is architectural. Previous GPT generations were fundamentally completion engines — you gave them a context, they produced output. Tool use and multi-step reasoning were bolted on. GPT-5.5 is designed so that task decomposition, tool selection, and context management across a long-running job are first-class behaviors, not afterthoughts. That's a real difference in how the model is trained and how it reasons about its own action space.
Priya: And the pricing tells you something. API costs roughly doubled. OpenAI's own chief scientist Jakub Pachocki said progress has been "surprisingly slow" recently and promised bigger leaps ahead — which is a fascinating thing to say at a launch event. Read that as: this is a meaningful step, but they know the hard problems aren't solved.
Sam: The unsolved problems are visible in the benchmarks. GPT-5.5 reclaims the top position across major evals, but hallucination rates remain high. And here's why that matters more for agentic systems than it did for chat: when a model hallucinates in a conversation, a human catches it. When an agent hallucinates in step three of a twelve-step workflow and then builds on that mistake for nine more steps, you get compounding error that's genuinely hard to detect. High hallucination rates are a much more serious reliability problem in agentic contexts.
Priya: Which means the doubled price and the benchmark leadership don't automatically translate to enterprise deployment. The reliability gap is still the primary blocker for anything high-stakes.
Sam: Now, while OpenAI was announcing GPT-5.5, the Anthropic funding story was unfolding in parallel, and the numbers are staggering even by recent standards. Google announced up to forty billion dollars in Anthropic. Amazon, which had already pledged twenty-five billion, put in another five-billion-dollar tranche this week specifically earmarked for Amazon's custom silicon — Anthropic is essentially committing to buy Amazon chips with Amazon's own investment. Total capital flowing into Anthropic from just these two hyperscalers is approaching sixty-five billion dollars.
Priya: To put that in context, that's more than the GDP of many mid-sized countries, going to a single AI lab. And the structure here matters. This isn't passive financial investment — it's compute dependency by design. Google gets a strategic stake in a company building on Google infrastructure. Amazon gets a major customer locked into its chip ecosystem. Anthropic gets the resources to compete at the frontier. All three parties are building interlocking dependencies.
Sam: And Cohere acquiring Aleph Alpha this week, with six hundred million from Schwarz Group, is the other side of this story. Aleph Alpha was Germany's bid at sovereign AI capability — a frontier lab that could be trusted with European data and European values. It's now being absorbed by a North American company. The consolidation math is brutal: frontier model training requires capital at a scale that almost nobody can access independently.
Priya: Yann LeCun's AMI Labs raising a billion dollars with twelve employees is worth a mention here too. He's explicitly betting that large language models aren't the path to general intelligence and using serious capital to pursue alternatives. The runway to know if he's right is probably years, but it represents the most prominent institutional bet yet that the current paradigm has a ceiling.
Sam: Let's move to what I think was actually the most technically significant theme of the week: AI and cybersecurity. Anthropic's Mythos model has been out for a couple weeks in limited preview, but this week we got real data on what it actually does. Mozilla ran it against Firefox 150 and it found two hundred and seventy-one security vulnerabilities. The CTO called it a watershed moment and said the model is "every bit as capable" as the world's best security researchers.
Priya: Let's be precise about what's technically happening here, because it matters. Vulnerability discovery has historically required a human who deeply understands a codebase, can hold a mental model of how components interact, and can reason about edge cases in memory management, type handling, input validation — the kinds of subtle flaws that don't show up in static analysis. What Mythos appears to be doing is autonomous reasoning over code at a scale and speed that human researchers can't match. Two hundred and seventy-one vulnerabilities in a mature, heavily audited codebase is not a small number.
Sam: And crucially, Anthropic is restricting access — only a limited number of companies can use it. That restriction is exactly what the security community is arguing about. The counterargument came this week from Ari Herbert-Voss, who was OpenAI's first security hire and now runs an AI security startup. His position, presented at Black Hat Asia: open-source models can find bugs just as effectively as Mythos. If that's true, and it's a meaningful if, then Anthropic's restricted distribution model doesn't actually contain the capability — it just slows the most responsible deployment of it.
Priya: This is genuinely uncertain territory. The honest answer is we don't have enough independent data to know whether the open-source models are actually at parity or whether Herbert-Voss is extrapolating from a narrower set of tests. But the policy question is real regardless: if the capability is becoming widely available anyway, is restriction achieving safety goals or just competitive ones?
Sam: The hardware story this week deserves its own segment because several things happened simultaneously and they're all pointing the same direction. Google announced two new TPU chips — one for training, one for inference. They've split what used to be a single chip architecture into purpose-built silicon for two fundamentally different workload profiles. And they paired it with Arm-based Axion cores, moving away from x86.
Priya: The reason training and inference have divergent hardware needs comes down to arithmetic intensity and memory access patterns. Training is compute-bound — you want maximum throughput on matrix multiplications, you're tolerating high memory bandwidth requirements. Inference, especially for agentic workloads, is often memory-bandwidth-bound. You're serving many concurrent requests, you need fast token generation, and the optimal silicon is quite different. Google splitting these into dedicated chips is an acknowledgment that the inference cost problem needs hardware-level solutions, not just software optimization.
Sam: Meta's deal for millions of Amazon CPUs — not GPUs — for agentic workloads is the same signal from a different angle. The inference-dominated economics of running agents at scale favor different silicon than what you use for training runs. And NVIDIA and Google announced co-designed infrastructure at Cloud Next claiming ten-times lower inference costs through hardware-software codesign on Vera Rubin NVL72 rack-scale systems. That ten-times number is aggressive, but the direction of travel is clear: inference cost reduction is the critical path to making agentic AI economically viable at scale.
Priya: There's also an AI-designed chip story worth flagging. Verkor.io published results this week showing their agentic AI system designed a complete RISC-V CPU core from scratch — not AI-assisted human design, but end-to-end autonomous design. The VerCore hits 1.5 gigahertz, which is roughly 2011-era laptop performance. Not state of the art, but an entire working CPU designed by an AI system is a different category of result than helping write HDL code. The finding that end-to-end agentic design outperformed using specialized AI tools for individual subtasks is worth paying attention to — it's consistent with what we're seeing in software as well.
Priya: We should also quickly note that all of this infrastructure expansion has an environmental cost that's reaching a scale that's hard to ignore. Planned data centers from OpenAI, Meta, xAI, and Microsoft are projected to emit over a hundred and twenty-nine million tons of CO2 annually — comparable to entire mid-sized nations. That's starting to attract serious regulatory attention and it's a real constraint on how fast this buildout can proceed.
Sam: And before we wrap — DeepSeek V4 dropped this week with open weights, claiming performance rivaling frontier proprietary models at dramatically reduced inference costs, and with extended support for Huawei's Ascend NPUs. The Huawei piece is geopolitically significant. US chip export controls were designed to limit China's access to frontier AI compute. A fully capable Chinese AI model running efficiently on Chinese hardware is a direct test of whether that strategy is working.
Priya: So stepping back — what does this week mean? The convergence I keep coming back to is that agentic AI is no longer an architectural aspiration, it's the design center. GPT-5.5, the hardware splits at Google and Meta, the inference cost focus, even the cybersecurity story — it all assumes models that run extended autonomous tasks, not single-turn interactions. The infrastructure is being rebuilt around that assumption right now.
Sam: What I'm watching is the reliability question. The hallucination problem that persists in GPT-5.5 is not a minor footnote — it's the thing that determines whether agentic systems can be deployed in production for anything that matters. The hardware and capital questions are largely solved by throwing money at them. The reliability question requires fundamental progress in how these models reason and verify their own outputs. That's what I want to see evidence of in the next wave of releases.
Priya: And the Anthropic cybersecurity story is going to continue developing. Two hundred and seventy-one vulnerabilities in Firefox is the kind of result that shifts how security teams think about their tooling. Whether that capability stays restricted or becomes broadly available — through Mythos, through open-source equivalents, or both — is one of the most consequential near-term questions in applied AI.
Sam: A genuinely significant week. A lot of pieces moving at once, and most of them in the same direction.
Priya: That's the Week in Review for April 25th, 2026. We cover the daily stories with the same depth on AI Revolution every weekday — subscribe wherever you get your podcasts. We'll see you Monday.
AI Revolution is an automated daily podcast covering AI advancements. Generated 2026-04-25.
Sources: MIT Technology Review, VentureBeat AI, The Verge, Wired, TechCrunch AI, Ars Technica, IEEE Spectrum, The Decoder, The Gradient, Hugging Face Blog, Google AI Blog, AI News, SemiAnalysis, and The Register.