1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
| ---
title: "Anthropic Ships Sonnet 4.6 With 1M Context Window"
date: 2026-02-19
author: "Digital Frontier"
draft: false
categories: ["Technical"]
tags: ["anthropic", "claude", "sonnet", "model releases", "benchmarks"]
description: "Anthropic releases Claude Sonnet 4.6 with doubled context window, improved coding benchmarks, and a notable 60.4% ARC-AGI-2 score."
summary: "Claude Sonnet 4.6 ships as the new default for Free and Pro users, doubling the context window to 1M tokens. Benchmark results show strong gains in coding and computer use, with a 60.4% ARC-AGI-2 score trailing only Opus 4.6, Gemini 3 Deep Think, and a refined GPT 5.2 variant."
article:
type: "analysis"
technologies: ["Claude Sonnet 4.6", "Anthropic API", "OpenClaw"]
keywords: ["claude sonnet 4.6", "anthropic model release", "1 million context window", "arc-agi-2", "swe-bench", "claude coding", "anthropic update cycle"]
---
Anthropic released Claude Sonnet 4.6 on February 17, continuing the company's roughly four-month update cadence for its midsized model line. The release lands two weeks after Opus 4.6 and makes Sonnet 4.6 the default model for both Free and Pro plan users.
The headline change: a 1 million token context window in beta, doubling the previous maximum for Sonnet. Anthropic positions this as sufficient for entire codebases, lengthy contracts, or dozens of research papers in a single request. Improvements target three areas โ coding, instruction-following, and computer use.
## Benchmarks
Sonnet 4.6 posts new records on several evaluations:
| Benchmark | Domain | Sonnet 4.6 | Position |
|-----------|--------|------------|----------|
| SWE-Bench | Software Engineering | Record | Top of class |
| OS World | Computer Use | Record | Top of class |
| ARC-AGI-2 | Human-like Intelligence | 60.4% | Trails Opus 4.6, Gemini 3 Deep Think, refined GPT 5.2 |
The ARC-AGI-2 result is the most telling. At 60.4%, Sonnet 4.6 outperforms most comparable midsized models but remains behind the flagship tier โ Opus 4.6, Gemini 3 Deep Think, and a tuned GPT 5.2 variant all score higher.
## Update Cycle Context
Anthropic's release cadence has settled into a predictable pattern. Opus 4.6 shipped February 5 with agent teams support. Sonnet 4.6 follows 12 days later. An updated Haiku model is likely next, completing the trio within a few weeks.
For infrastructure operators using model aliases (e.g., `sonnet` โ `anthropic/claude-sonnet-4-6`), this is a drop-in upgrade. The 1M context window in beta may require testing for applications that push context limits, particularly around latency and cost at high token counts.
## Practical Implications
- **Coding agents:** SWE-Bench records suggest measurable improvement for automated code repair and generation workflows.
- **Computer use:** OS World scores indicate better reliability for browser and desktop automation tasks.
- **Context-heavy workloads:** 1M tokens opens use cases previously requiring chunking or retrieval โ full repo analysis, multi-document legal review, long-form research synthesis.
- **Cost:** Pricing not yet detailed. Historically, Sonnet occupies the mid-tier price point between Haiku and Opus.
## References
1. [Anthropic announcement](https://www.anthropic.com/news/claude-sonnet-4-6)
2. [TechCrunch coverage](https://techcrunch.com/2026/02/17/anthropic-releases-sonnet-4-6/)
3. [Opus 4.6 release](https://techcrunch.com/2026/02/05/anthropic-releases-opus-4-6-with-new-agent-teams/)
|