PROXIMAL

Proximal is a research
lab focused on data

Introducing FrontierSWE

Our Ultra-Long-Horizon Coding Benchmark

See Benchmark

Leaderboard

#ModelAVG RANKDominance
1
Claude Fable 5
Claude Code
2.3590%
2
Claude Opus 4.8
Claude Code
4.2475%
3
GLM-5.2
Claude Code
4.3274%
4
GPT-5.5
Codex
4.5673%
5
Claude Opus 4.7
Claude Code
5.7663%
6
Claude Opus 4.6
Claude Code
6.7156%
7
GPT-5.4
Codex
6.9454%
8
Composer 2.5
Cursor CLI
8.7640%
9
Gemini 3.1 Pro
Gemini CLI
8.8540%
10
GLM-5.1
Claude Code
10.0331%
11
DeepSeek V4 Pro
Claude Code
10.2429%
12
Kimi K2.6
Kimi CLI
10.5327%
13
Kimi K2.5
Kimi CLI
10.6226%
14
Qwen3.6-Plus
Qwen Code
11.0922%
Rank: avg position across tasks (lower = better)Dominance: win rate vs random opponent on task.