PROXIMAL

Proximal is a research lab focused on data.

We turn hard engineering problems into useful training data for frontier models.

Introducing FrontierSWE

Our Ultra-Long-Horizon Coding Benchmark

See Benchmark

Leaderboard

#Modelavg RankDominance
1
GPT-5.5
Codex
2.3583%
2
Claude Opus 4.7
Claude Code
3.1873%
3
Claude Opus 4.6
Claude Code
3.8864%
4
GPT-5.4
Codex
3.9763%
5
Gemini 3.1 Pro
Gemini CLI
5.2647%
6
DeepSeek V4 Pro
Claude Code
6.2634%
7
Kimi K2.6
Kimi CLI
6.4432%
8
Kimi K2.5
Kimi CLI
6.7428%
9
Qwen3.6-Plus
Qwen Code
6.9126%
Rank: avg position across tasks (lower = better)Dominance: win rate vs random opponent on task.