Claude Sonnet 5 is launching at $2 per million input tokens and $10 per million output tokens through August 31, 2026, giving Anthropic a cheaper way to sell agentic AI without forcing every workflow onto its flagship Opus model.

Claude Sonnet 5 Slashes AI Agent Costs for Developers
XOOMAR Intelligence
Analyst Take
The new model is Anthropic’s midsize answer to the rising cost of AI agents that browse, code, call tools, and run multi-step tasks with less human steering, according to TechCrunch. Anthropic is pitching Claude Sonnet 5 as close to Claude Opus 4.8 on performance, but at a lower price point for developers and enterprises.
“It can make plans, use tools like browsers and terminals, and run autonomously at a level that, just a few months ago, required larger and more expensive models,” Anthropic said in its launch post.
Anthropic launches Claude Sonnet 5 for cheaper AI agents
Claude Sonnet 5 is available starting June 30, 2026, across Anthropic’s plans. It becomes the default model for Free and Pro users, while Max, Team, and Enterprise customers also get access.
Anthropic says the model is available in Claude Code, on the Claude Platform, and through the Claude API under the model name claude-sonnet-5, according to Anthropic. That matters because the launch is aimed at developers building agents, not just consumers chatting in the Claude app.
The pricing is the sharper move. Anthropic’s own announcement lists introductory API pricing at $2 per million input tokens and $10 per million output tokens through August 31, 2026. After that, it rises to $3 per million input tokens and $15 per million output tokens.
| Model | Input price | Output price | Positioning from supplied sources |
|---|---|---|---|
| Claude Sonnet 5 | $2 per million tokens through Aug. 31, then $3 | $10 per million tokens through Aug. 31, then $15 | Lower-cost agentic model close to Opus 4.8 |
| Claude Opus 4.8 | $5 per million tokens | $25 per million tokens | Higher-accuracy flagship option |
| Gemini 3.5 Flash | Not supplied | Not supplied | Cheaper than Sonnet 5, per source material |
| GPT-5.5 and Gemini 3.1 Pro | Not supplied | Not supplied | Sonnet 5 is cheaper, per source material |
Anthropic says Sonnet 5 improves on Sonnet 4.6, which was released in February, across reasoning, tool use, software coding, and knowledge work. On one agentic coding benchmark, Sonnet 5 scored 63.2%, compared with 69.2% for Opus 4.8 and 58.1% for Sonnet 4.6.
The more interesting claim is in knowledge work. Source material says Sonnet 5 slightly outperformed Opus 4.8 on one knowledge-work benchmark, even though Opus remains Anthropic’s preferred model for higher accuracy on harder tasks.
Claude Sonnet 5 targets the cost problem inside long-running agents
AI agents don’t just answer once. They plan, inspect files, browse data, call tools, write code, check results, and sometimes retry when they fail. That sequence can burn through tokens fast.
XOOMAR analysis: Sonnet 5 is Anthropic’s bid to make those repeat workflows cheaper without dropping all the way to a lightweight model tier. The company is not saying Sonnet 5 beats Opus 4.8 across the board. It is saying the gap has narrowed enough that developers can pick cost and performance more deliberately.
Anthropic frames the choice this way: Opus 4.8 remains the model for higher accuracy, while Sonnet 5 offers a lower-priced option that is “of much higher quality than what was previously available.” In practical terms, that gives teams a reason to reserve Opus for the hardest cases and run more routine agent work through Sonnet.
OpenAI and Google are pushing the same broad theme. TechCrunch notes that OpenAI’s GPT-5.6 Sol launched in preview last week as its most agentic model yet, while Google’s Gemini 3.5 Flash, launched in May, was pitched around planning, building, and iterating with minimal human input.
That puts pressure on price, not just benchmark scores. If agentic behavior is now expected across model tiers, the contest shifts to how reliably these systems can complete work without making the economics painful.
For related XOOMAR coverage on the economics and product tradeoffs around AI deployment, see AI Token Costs Threaten to Break Cybersecurity Budgets and Free Gemini AI Image Generation Mines Your Google Data.
Improved safety gives Anthropic a sharper enterprise pitch for Claude Sonnet 5
Anthropic is also selling Claude Sonnet 5 as safer for agentic use than Sonnet 4.6. The company says the new model has a lower rate of “undesirable behaviors,” including cooperation with misuse and deception.
The model is also described as better at refusing malicious requests and handling prompt-injection attacks. Those details matter more for agents than for simple chatbots because agents can touch software, change files, use terminals, and act across connected tools.
Anthropic says Sonnet 5 hallucinates less and shows less sycophantic behavior than Sonnet 4.6. It also says the model has “a much lower ability to perform dangerous cybersecurity tasks” than current Opus models.
There is a ceiling to that safety claim. Source material says Sonnet 5 is not at the same level as Opus 4.8 and Claude Mythos Preview when it comes to misaligned behavior.
Lovable co-founder Fabian Hedin said in a statement that Claude Sonnet 5 “refuses unsafe requests cleanly and consistently.”
“At Lovable, we’re putting powerful tools in the hands of millions of builders,” Hedin said. “A model that knows when to say no is just as important as one that knows how to build.”
Claude Sonnet 5 now faces live trials against GPT-5.5 and Gemini Pro
Early testers cited by Anthropic said Sonnet 5 finishes complex tasks that previous versions left incomplete. Daniel Shepard, a senior engineer at Zapier, said the model completed a two-part job involving Salesforce account tiers and a launch announcement to enterprise contacts, a task that previously stalled halfway.
The supplied materials do not show independent production data on failure rates, latency, or cost per completed workflow. That is the next test.
XOOMAR analysis: Developers will likely judge Claude Sonnet 5 against the exact claims Anthropic is making: agentic coding, tool use, knowledge work, refusal behavior, and whether the model can recover when a multi-step task goes off track. Benchmark gains help, but agents live or die in messy workflows.
The immediate pressure now falls on GPT-5.5, Gemini Pro, and cheaper model tiers that want the same developer workloads. Anthropic’s bet is clear: the next AI agent race won’t be won only by the smartest model. It will be won by the model companies can afford to keep running.
The Bottom Line
- Claude Sonnet 5 gives developers a lower-cost option for running agentic AI workflows.
- The model is positioned as close to Claude Opus 4.8 in performance without requiring flagship-model spending.
- Its pricing jump after August 31, 2026 gives enterprises a limited window to test costs at introductory rates.
Claude Sonnet 5 API Pricing
| Pricing period | Input price | Output price |
|---|---|---|
| Through August 31, 2026 | $2 per million tokens | $10 per million tokens |
| After August 31, 2026 | $3 per million tokens | $15 per million tokens |
Claude Sonnet 5 Token Pricing
Sources
Written by
XOOMAR Insights Team
Research and Editorial Desk
The XOOMAR Insights Team pairs automated research with human editorial judgment. We track hundreds of sources across technology, fintech, trading, SaaS, and cybersecurity, cross-check the facts, and explain what happened, why it matters, and what to watch next. We do not just rewrite headlines. Every article is fact-checked and scored for reliability before it goes live, and we link back to the original sources so you can verify anything yourself.
Explore More Topics
Related Articles
Technology$50M Raise Arms Patronus for AI Agent Testing Boom
Patronus AI raised $50M to test agents in simulated worlds as trust becomes AI’s next battleground.
TechnologyElite Researchers Bolt Google AI for OpenAI, Anthropic
Gemini and AlphaFold veterans are leaving Google for OpenAI and Anthropic, turning AI talent into an IPO-era power play.
Technology1,000 Skills Push Acti AI Keyboard Beyond Autocomplete
Acti wants the smartphone keyboard to become mobile AI’s action layer, with 1,000 early Skills pointing to real user behavior.
TechnologyTidal Cuts AI Music Royalties in Crackdown on Fake Songs
Tidal will stop paying royalties on fully AI-generated tracks, targeting fake songs, fraud, and artist impersonation.
TechnologyAI Music Loses Royalties under Tidal's New Pay Rules
Tidal will keep fully AI-generated tracks online, but it won't pay them royalties. Listener labels arrive July 15.
CybersecurityAI Token Costs Threaten to Break Cybersecurity Budgets
Palo Alto Networks spent over $1 million testing Claude, showing agentic AI can expose flaws while blowing up SOC budgets.
FintechOKX AI Turns Crypto Wallets Into Agent Payroll Machines
OKX AI opens agent payments to developers, betting wallets can become payroll accounts for autonomous software.
Global TrendsSupreme Court Throws Assault Weapons Bans Into Peril
The Supreme Court will test whether AR-15-style rifle bans in about a dozen states can survive the Second Amendment.
FintechSVB FDIC Trial Throws $1.7 Billion Into Legal Fire
SVB’s trust wants $1.7 billion back. The FDIC says executives’ pre-collapse conduct should keep the money out of reach.
Global Trends14 Children Die as Lahore Roof Collapse Crushes Class
A roof collapse at a Lahore tuition centre killed 14 children as construction continued above classrooms.
Don't miss the signal
Get our weekly roundup of the stories that matter across tech, fintech, and trading. No noise, just signal.
Free forever. No spam. Unsubscribe anytime.