AI Weekly #22: The Trillion-Dollar Era Begins, Agent Coding Becomes the New Battleground

I. Funding & Industry Landscape: The Trillion-Dollar Club Expands

Anthropic's $965B Valuation: AI's "Apple Moment"

Anthropic completed a $65B Series H round led by Altimeter Capital, reaching a post-money valuation of $965B with annualized revenue exceeding $47B. This valuation approaches the world's top 10 tech companies by market cap. Anthropic's revenue growth is primarily driven by enterprise deployments — Claude's paid usage in coding agent scenarios has surged, with enterprise plans shifting from fixed discounts to usage-based billing.

DeepSeek's STAR Market IPO: China's AI Capitalization Accelerates

DeepSeek plans to file for a STAR Market IPO immediately after completing approximately $50B in funding — the first Chinese AI foundation model company to clearly define an IPO path. Combined with Alibaba Cloud being named the leader in agent AI by Omdia and Qwen3.7-Max topping OpenRouter's popularity chart, Chinese AI's global presence continues to rise.

OpenRouter's $113M Series B: Model Aggregation Layer Validated

OpenRouter secured a Series B led by CapitalG, confirming that "model routing" as a middleware layer has become essential. As model count explodes, developers need a unified gateway to switch, compare, and reduce costs.

II. Model Releases: Agent Capabilities as Core Selling Point

Claude Opus 4.8: Incremental Upgrades, Coding is King

Anthropic released Claude Opus 4.8, positioned as an upgrade to 4.7 with improvements in coding, agent skills, and reasoning at the same price. The simultaneously launched "Dynamic Workflows" feature enables Claude Code to run dozens to hundreds of sub-agents in parallel within a single session, handling cross-codebase bug hunting and large-scale migrations.

Grok Build 0.1: xAI Enters the Coding Agent Arena

xAI's Grok Build is specifically trained for agent coding tasks, supporting web development, debugging, and MCP with inference speeds exceeding 100 tok/s. This marks xAI's move beyond chat models into direct competition with Cursor/Claude Code.

Step 3.7 Flash: Chinese Models Take the Efficiency Route

Step 3.7 Flash (198B MoE) ranked first in ClawEval and SimpleVQA Search benchmarks, focusing on agent workflow efficiency. Chinese models are shifting from "catching up on general capabilities" to "optimizing for specific scenarios."

III. Products & Tools: Agent Ecosystem Taking Shape

This week's product launches reveal a clear trend: AI is moving from "chat box" to "workbench."

Replit Canvas: Agent design tool, moving design work from chat to canvas
Perplexity Computer: Integrates Microsoft Office suite, invoking AI directly in Excel/Word/PPT
Data Formulator: Microsoft's enterprise data AI analysis tool
Google Pay MCP Server: Connecting AI dev assistants directly to payment APIs
Alibaba Cloud Bailian CLI: Open-source Agent with full model and application capabilities

MCP (Model Context Protocol) is becoming the de facto standard for connecting AI with external tools.

IV. Research Frontiers: New Paradigms for Agent Training & Evaluation

NVIDIA Polar Framework: Breakthrough in Agent RL Training

NVIDIA open-sourced Polar, an agent reinforcement learning framework that doesn't require rewriting existing agent frameworks (Codex CLI, Claude Code, etc.), connecting to GRPO training by placing agents at model API boundaries. Experiments show Qwen-based models improved Codex benchmark scores by 594.74%.

SIA Framework: AI Recursive Self-Improvement

hexoai's open-source SIA (Self-Improving AI) framework demonstrates that AI agents can not only optimize external workflows but also directly update their own model weights through task feedback — another proof of "AI training AI."

KPop: Stable RL Training for Large-Scale MoE Models

The KPop method pushed Ring-2.6-1T past 76 on SWE-bench Verified, using adaptive masking to replace fixed-ratio masks, solving stability issues in large-scale MoE model RL training.

V. Trend Signals

Coding Agent PMF Confirmed: Cursor reports developer weekly code output grew from 3.6K to 8.6K lines, AI code retention rates continue improving
Safety Governance Catching Up: OpenAI released "Frontier Governance Framework" aligned with EU and California regulations, ITBench shows frontier models still score below 50% on enterprise IT tasks
Hardware Race Intensifies: Samsung samples HBM4E, NVIDIA invests ~$150B annually in Taiwan, Huawei releases new Kirin chip
Prompt Engineering Goes Scientific: FaceMind proposes Adam's Law (text frequency law), providing quantifiable theoretical basis for prompt optimization

Key Numbers This Week

Metric	Value	Significance
Anthropic Valuation	$965B	Approaching trillion, first AI company at this level
Anthropic Annual Revenue	$47B	Enterprise AI payments at scale
DeepSeek Funding Valuation	$50B	Among China's highest AI unicorn valuations
Cognition Annual Revenue	$4.92B	Agent lab commercialization validated
Polar Framework Improvement	594.74%	Massive potential of agent RL training
Developer Weekly Code Output	8.6K lines	Productivity doubled with AI assistance

Overall Analysis

The most significant signal in AI this week: capital scale and technical competition have simultaneously entered a new magnitude.

From the capital side, Anthropic's $965B valuation, DeepSeek's $50B IPO push, OpenRouter's $113M Series B — the market is no longer debating "whether AI has value" but competing for "who becomes the next trillion-dollar platform." Anthropic and OpenAI simultaneously shifting from "subsidized acquisition" to "usage-based billing" indicates coding agent usage has grown large enough to support massive revenue — PMF confirmed.

From the technology side, focus has shifted from "bigger, stronger models" to "how agents land in production." Claude Opus 4.8's Dynamic Workflows, NVIDIA Polar's 594% improvement, SIA's recursive self-improvement — these advances point in the same direction: AI coding capabilities can continuously evolve through automated RL training, no longer dependent on human annotation. Anthropic's strategy is particularly clear: no pursuit of shocking launches, but high-frequency iteration, with each upgrade focused on "making daily developer use smoother."

From the product side, "embedded AI" has become mainstream — not making users find AI, but making AI appear in existing workflows. For developers, API costs will become an increasingly important selection factor, and model aggregation platforms (like OpenRouter) are being validated by capital for this reason.

In one sentence: In Q2 2026, the AI industry has officially moved from "technology validation" to "commercial scaling," with coding agents as the first confirmed killer application.

Next Week Outlook

Google I/O 2026 follow-up product launches (Gemini Omni, Gemini 3.5 Flash)
Kling AI showcases 4K original short films at AI on the Lot, AI filmmaking enters new phase
Watch DeepSeek IPO progress and Anthropic's post-funding product moves