News/Anthropic, Business Standard, Android Headlines, Galaxy.ai

Anthropic Launches Claude Sonnet 4.6 With 1M Token Context Window and Flagship-Level Performance at Budget Pricing

VirtualAssistantVA Research Team·

Anthropic has launched Claude Sonnet 4.6, the most capable model in its Sonnet line, delivering performance that previously required its flagship Opus-class models - at a fraction of the cost. The release marks a significant shift in how businesses can access top-tier AI capabilities without enterprise-level pricing.

Sonnet 4.6 is now the default model for Free and Pro users on claude.ai and Claude Cowork, signaling Anthropic's strategy to make high-performance AI broadly accessible across its platform.

Key Performance Benchmarks

The headline number is a 72.5% score on benchmarks designed to test how well AI navigates web and desktop applications - a meaningful jump from 61.4% achieved by its predecessor. This improvement reflects real-world capability gains, not just synthetic benchmark tuning.

Metric Sonnet 4.5 Sonnet 4.6 Change
App Navigation Benchmark 61.4% 72.5% +18.1%
Max Output Tokens 64k 64k Same
Context Window 200k 1M (beta) +5x
Input Pricing $3/M tokens $3/M tokens Same
Output Pricing $15/M tokens $15/M tokens Same

For coding tasks specifically, early-access developers preferred Sonnet 4.6 to its predecessor by a wide margin, with improvements in consistency, instruction following, and overall reliability.

What Makes Sonnet 4.6 Different

Opus-Class Performance at Sonnet Pricing

The most significant development is not any single feature but the overall performance tier. Tasks that previously required reaching for an Opus-class model - including economically valuable office work, complex reasoning chains, and multi-step agent planning - are now achievable with Sonnet 4.6. This effectively compresses the price-performance gap between Anthropic's model tiers.

1M Token Context Window

Both Opus 4.6 and Sonnet 4.6 now support a 1 million token context window, currently in beta. For context, 1M tokens translates to roughly 750,000 words - enough to process entire codebases, lengthy legal contracts, or months of business communications in a single prompt.

This capability is particularly relevant for enterprise use cases involving:

  • Full codebase analysis - reviewing entire repositories without chunking
  • Document processing - ingesting complete regulatory filings or contract sets
  • Business intelligence - analyzing large datasets of customer interactions

Extended Thinking and Agent Planning

Sonnet 4.6 supports extended thinking alongside all existing Claude API features, enabling the model to work through complex multi-step problems before generating responses. Combined with improved agent planning capabilities, this makes the model significantly more effective at autonomous task execution.

Enterprise Implications

Cost-Performance Equation

The pricing structure - unchanged from Sonnet 4.5 at $3 per million input tokens and $15 per million output tokens - means enterprises can now access what is functionally flagship performance without upgrading to Opus pricing. For companies running large-scale automation, customer support, or document processing workflows, this represents substantial cost savings.

Competitive Positioning

Anthropic's release puts additional pressure on OpenAI and Google to deliver comparable price-performance ratios. The trend toward making high-capability models affordable accelerates the timeline for widespread enterprise AI adoption.

Developer Adoption

The coding improvements are particularly noteworthy. Developers report better instruction following and consistency - two pain points that have historically limited AI pair programming tools. With Sonnet 4.6 as the default model, the barrier to entry for AI-assisted development drops significantly.

Market Context

The release comes as the enterprise AI market continues its rapid expansion, with global AI spending forecast to exceed $300 billion in 2026. Anthropic's strategy of delivering flagship-level capabilities at mid-tier pricing positions it to capture a larger share of the enterprise market, particularly among organizations that have been cost-constrained in their AI adoption.

The 1M context window also addresses one of the most requested enterprise features - the ability to process large documents and datasets without splitting them across multiple API calls. This reduces engineering complexity and improves output quality for workflows that depend on full-context understanding.

What This Means for Virtual Assistant Services

Claude Sonnet 4.6 has direct implications for virtual assistant services and the businesses that rely on them. The 1M token context window means virtual assistants can now use AI tools that process entire client portfolios, full project histories, or complete business documentation sets in a single interaction - dramatically improving the quality and speed of research, analysis, and reporting tasks.

For businesses considering virtual assistant support, the cost-performance improvements in models like Sonnet 4.6 mean that AI-augmented VA services are becoming more powerful without proportional cost increases. Tasks that once required hours of manual research or analysis can be completed faster, with AI handling the heavy lifting while human VAs provide the strategic judgment, client relationship management, and quality assurance that remain distinctly human capabilities.

The compression of the price-performance gap also means smaller businesses can now access the same caliber of AI-powered assistance that was previously available only to enterprise clients - leveling the playing field for companies looking to scale operations efficiently.


Explore how businesses use virtual assistant services to delegate tasks and scale operations.

See our guide on hiring a virtual assistant to get started.