Showing 131 of 131 submissions
| # | Name | Resolvedโ | % | Model | Org | Open | Date | Links |
|---|---|---|---|---|---|---|---|---|
| 1 | TRAE + Doubao-Seed-Code | 394 | 78.8% | Doubao | ByteDance | S | 9/28/2025 | |
| 2 | live-SWE-agent + Gemini 3 Pro Preview (2025-11-18) | 387 | 77.4% | Gemini | UIUC | S | 11/20/2025 | |
| 3 | Atlassian Rovo Dev (2025-09-02) | 384 | 76.8% | Claude | Atlassian | 9/2/2025 | ||
| 4 | EPAM AI/Run Developer Agent v20250719 + Claude 4 Sonnet | 384 | 76.8% | Claude | EPAM Systems, Inc. | 8/4/2025 | ||
| 5 | ACoder | 382 | 76.4% | Claude | ACoder | 8/19/2025 | ||
| 6 | Warp | 378 | 75.6% | GPT | Warp | 9/1/2025 | ||
| 7 | TRAE | 376 | 75.2% | Claude | TRAE | S | 6/12/2025 | |
| 8 | Sonar Foundation Agent + Claude 4.5 Sonnet | 374 | 74.8% | Claude | Sonar | 11/3/2025 | ||
| 9 | Harness AI | 374 | 74.8% | Claude | Harness | 7/31/2025 | ||
| 10 | JoyCode | 373 | 74.6% | Claude | JoyCode | S | 9/15/2025 | |
| 11 | Lingxi-v1.5_claude-4-sonnet-20250514 | 373 | 74.6% | Claude | Lingxi | S | 7/20/2025 | |
| 12 | Prometheus-v1.2.1 + GPT-5 | 372 | 74.4% | GPT | EuniAI, Delysium | S | 10/15/2025 | |
| 13 | Refact.ai Agent | 372 | 74.4% | Claude | Refact.ai | S | 6/3/2025 | |
| 14 | Salesforce AI Research SAGE (OpenHands) | 369 | 73.8% | Claude | Salesforce AI Research | 11/3/2025 | ||
| 15 | Tools + Claude 4 Opus (2025-05-22) | 366 | 73.2% | Claude | โ | 5/22/2025 | ||
| 16 | Salesforce AI Research SAGE (bash-only) | 365 | 73.0% | Claude | Salesforce AI Research | 10/21/2025 | ||
| 17 | Tools + Claude 4 Sonnet (2025-05-22) | 362 | 72.4% | Claude | โ | 5/22/2025 | ||
| 18 | OpenHands + GPT-5 | 359 | 71.8% | GPT | OpenHands | Sโ | 8/7/2025 | |
| 19 | Lingxi v1.5 x Kimi K2 | 356 | 71.2% | Kimi | Huawei | WS | 10/14/2025 | |
| 20 | Prometheus-v1.2 + GPT-5 | 356 | 71.2% | GPT | EuniAI, Delysium | S | 9/29/2025 | |
| 21 | Qodo Command | 356 | 71.2% | Claude | Qodo | 7/15/2025 | ||
| 22 | Bloop | 356 | 71.2% | Claude | Bloop AI | 7/10/2025 | ||
| 23 | Warp | 355 | 71.0% | Claude | Warp | 6/23/2025 | ||
| 24 | Moatless Tools + Claude 4 Sonnet | 354 | 70.8% | Claude | Moatless AI | Sโ | 6/11/2025 | |
| 25 | TRAE | 353 | 70.6% | Other | โ | 5/19/2025 | ||
| 26 | Augment Agent v1 | 352 | 70.4% | Other | โ | S | 6/10/2025 | |
| 27 | OpenHands + Claude 4 Sonnet | 352 | 70.4% | Claude | OpenHands | Sโ | 5/24/2025 | |
| 28 | Refact.ai Agent | 352 | 70.4% | Claude | Refact.ai | S | 5/15/2025 | |
| 29 | devlo | 351 | 70.2% | Claude | devlo | 5/19/2025 | ||
| 30 | Zencoder (2025-04-30) | 350 | 70.0% | Claude | โ | 4/30/2025 | ||
| 31 | OpenHands + Qwen3-Coder-480B-A35B-Instruct | 348 | 69.6% | Qwen | Qwen, OpenHands | WS | 8/5/2025 | |
| 32 | GLM-4.6 | 341 | 68.2% | GLM | Z.ai | WS | 9/30/2025 | |
| 33 | Nemotron-CORTEXA | 341 | 68.2% | Other | NVIDIA | S | 5/16/2025 | |
| 34 | SWE-agent + Claude 4 Sonnet | 333 | 66.6% | Claude | SWE-agent | Sโ | 5/22/2025 | |
| 35 | Aime-coder v1 + Anthopic Claude 3.7 Sonnet | 332 | 66.4% | Claude | ByteDance DevInfra | 5/14/2025 | ||
| 36 | OpenHands | 329 | 65.8% | Other | OpenHands | Sโ | 4/15/2025 | |
| 37 | OpenHands + Kimi K2 | 327 | 65.4% | Kimi | OpenHands | WSโ | 7/16/2025 | |
| 38 | Amazon Q Developer Agent (v20250405-dev) | 327 | 65.4% | Other | AWS | 4/5/2025 | ||
| 39 | Augment Agent v0 | 327 | 65.4% | Other | โ | S | 3/16/2025 | |
| 40 | PatchPilot-v1.1 | 323 | 64.6% | GPT | โ | S | 5/3/2025 | |
| 41 | W&B Programmer O1 crosscheck5 | 323 | 64.6% | GPT | โ | 1/17/2025 | ||
| 42 | GLM-4.5 | 321 | 64.2% | GLM | Z.ai | WS | 7/28/2025 | |
| 43 | AgentScope | 317 | 63.4% | Claude | Alibaba | 2/6/2025 | ||
| 44 | Tools + Claude 3.7 Sonnet (2025-02-24) | 316 | 63.2% | Claude | Anthropic | 2/24/2025 | ||
| 45 | EPAM AI/Run Developer Agent v20250219 + Anthopic Claude 3.5 Sonnet | 314 | 62.8% | Claude | โ | 2/28/2025 | ||
| 46 | Blackbox AI Agent | 314 | 62.8% | Other | โ | 1/10/2025 | ||
| 47 | SWE-agent + Claude 3.7 Sonnet w/ Review Heavy | 312 | 62.4% | Claude | SWE-agent | Sโ | 2/25/2025 | |
| 48 | CodeStory Midwit Agent + swe-search | 311 | 62.2% | Claude | โ | 12/21/2024 | ||
| 49 | OpenHands + 4x Scaled (2024-02-03) | 304 | 60.8% | Other | OpenHands | Sโ | 2/3/2025 | |
| 50 | EntroPO + R2E + Qwen3-Coder-30B-A3B-Instruct | 302 | 60.4% | Qwen | 42-b3yond-6ug | WS | 9/1/2025 | |
| 51 | Learn-by-interact | 301 | 60.2% | Claude | 1/10/2025 | |||
| 52 | DeepSWE-Preview + TTS(Bo16) | 294 | 58.8% | Other | Agentica | WS | 6/29/2025 | |
| 53 | Nemotron-CORTEXA | 291 | 58.2% | Other | NVIDIA | โ | 4/10/2025 | |
| 54 | devlo | 291 | 58.2% | Other | โ | 12/13/2024 | ||
| 55 | Emergent E1 (v2024-12-23) | 286 | 57.2% | Claude | โ | 12/23/2024 | ||
| 56 | Artemis Agent v2 (2025-09-24) | 285 | 57.0% | Claude | Turintech | 9/24/2025 | ||
| 57 | Gru(2024-12-08) | 285 | 57.0% | Other | โ | 12/8/2024 | ||
| 58 | SWE-Rizzo | 283 | 56.6% | Claude | โ | S | 4/5/2025 | |
| 59 | EPAM AI/Run Developer Agent v20241212 + Anthopic Claude 3.5 Sonnet | 277 | 55.4% | Claude | โ | 12/12/2024 | ||
| 60 | Amazon Q Developer Agent (v20241202-dev) | 275 | 55.0% | Other | AWS | 12/2/2024 | ||
| 61 | devlo | 271 | 54.2% | Claude | โ | 11/8/2024 | ||
| 62 | FrogBoss-32B-2510 | 268 | 53.6% | Other | Microsoft | S | 11/10/2025 | |
| 63 | CodeSweep - SWE-agent - Kimi K2 Instruct | 267 | 53.4% | Kimi | CodeSweep Inc. | WS | 8/4/2025 | |
| 64 | Bracket.sh | 266 | 53.2% | Other | โ | 1/20/2025 | ||
| 65 | OpenHands + CodeAct v2.1 (claude-3-5-sonnet-20241022) | 265 | 53.0% | Claude | OpenHands | Sโ | 10/29/2024 | |
| 66 | EntroPO + R2E + Qwen3-Coder-30B-A3B-Instruct | 261 | 52.2% | Qwen | 42-b3yond-6ug | WS | 9/1/2025 | |
| 67 | Google Jules + Gemini 2.0 Flash (v20241212-experimental) | 261 | 52.2% | Gemini | 12/12/2024 | |||
| 68 | Engine Labs (2024-11-25) | 259 | 51.8% | Claude | Engine Labs | 11/25/2024 | ||
| 69 | OpenHands + Qwen3-Coder-30B-A3B-Instruct | 258 | 51.6% | Qwen | Qwen, OpenHands | WS | 8/5/2025 | |
| 70 | AutoCodeRover-v2.1 (Claude-3.5-Sonnet-20241022) | 258 | 51.6% | Claude | โ | 1/22/2025 | ||
| 71 | Agentless-1.5 + Claude-3.5 Sonnet (20241022) | 254 | 50.8% | Claude | Agentless | S | 12/2/2024 | |
| 72 | Bytedance MarsCode Agent | 250 | 50.0% | Other | Bytedance | 11/25/2024 | ||
| 73 | Solver (2024-10-28) | 250 | 50.0% | Other | โ | 10/28/2024 | ||
| 74 | nFactorial (2024-11-05) | 246 | 49.2% | Claude | โ | 11/5/2024 | ||
| 75 | Tools + Claude 3.5 Sonnet (2024-10-22) | 245 | 49.0% | Claude | Anthropic | 10/22/2024 | ||
| 76 | Composio SWE-Kit (2024-10-25) | 243 | 48.6% | Claude | โ | Sโ | 10/25/2024 | |
| 77 | AppMap Navie v2 | 236 | 47.2% | Claude | โ | Sโ | 11/6/2024 | |
| 78 | Skywork-SWE-32B + TTS(Bo8) | 235 | 47.0% | Qwen | Skywork AI | WSโ | 6/16/2025 | |
| 79 | OpenHands + DevStral Small 2505 | 234 | 46.8% | Mistral | OpenHands, Mistral | WSโ | 5/20/2025 | |
| 80 | Emergent E1 (v2024-10-12) | 233 | 46.6% | Claude | โ | 10/23/2024 | ||
| 81 | AutoCodeRover-v2.0 (Claude-3.5-Sonnet-20241022) | 231 | 46.2% | Claude | โ | S | 11/8/2024 | |
| 82 | PatchPilot + Co-PatcheR | 230 | 46.0% | Other | โ | WS | 5/28/2025 | |
| 83 | Solver (2024-09-12) | 227 | 45.4% | Other | โ | 9/24/2024 | ||
| 84 | Gru(2024-08-24) | 226 | 45.2% | Other | โ | 8/24/2024 | ||
| 85 | FrogMini-14B-2510 | 225 | 45.0% | Other | Microsoft | S | 11/10/2025 | |
| 86 | CodeShellAgent + Gemini 2.0 Flash (Experimental) | 221 | 44.2% | Gemini | โ | 1/18/2025 | ||
| 87 | Solver (2024-09-12) | 218 | 43.6% | Other | โ | 9/20/2024 | ||
| 88 | Amazon Nova Premier 1.0 (2025-04-30) | 212 | 42.4% | Nova | Amazon Nova | 5/27/2025 | ||
| 89 | Agentless Lite + O3 Mini (20250214) | 212 | 42.4% | GPT | Agentless | Sโ | 2/14/2025 | |
| 90 | DeepSWE-Preview | 211 | 42.2% | Other | Agentica | WS | 6/29/2025 | |
| 91 | SWE-Exp | 210 | 42.0% | DeepSeek | SWE-Exp | WS | 8/6/2025 | |
| 92 | ugaiforge | 208 | 41.6% | Other | โ | 1/12/2025 | ||
| 93 | nFactorial (2024-10-30) | 208 | 41.6% | Claude | โ | 10/30/2024 | ||
| 94 | SWE-RL (Llama3-SWE-RL-70B + Agentless Mini) (20250226) | 206 | 41.2% | Llama | Agentless | S | 2/26/2025 | |
| 95 | Nebius AI Qwen 2.5 72B Generator + LLama 3.1 70B Critic | 203 | 40.6% | Llama | โ | 11/13/2024 | ||
| 96 | Tools + Claude 3.5 Haiku | 203 | 40.6% | Claude | Anthropic | 10/22/2024 | ||
| 97 | Composio SWEkit + Claude 3.5 Sonnet (2024-10-16) | 203 | 40.6% | Claude | โ | S | 10/16/2024 | |
| 98 | Honeycomb | 203 | 40.6% | Other | โ | 8/20/2024 | ||
| 99 | SWE-agent + SWE-agent-LM-32B | 201 | 40.2% | Qwen | SWE-agent | WSโ | 5/11/2025 | |
| 100 | EPAM AI/Run Developer Agent v20241029 + Anthopic Claude 3.5 Sonnet | 198 | 39.6% | Claude | โ | 10/29/2024 | ||
| 101 | Agentless-1.5 + GPT 4o (2024-05-13) | 194 | 38.8% | GPT | Agentless | S | 10/28/2024 | |
| 102 | Amazon Q Developer Agent (v20240719-dev) | 194 | 38.8% | Other | AWS | 7/21/2024 | ||
| 103 | AutoCodeRover (v20240620) + GPT 4o (2024-05-13) | 192 | 38.4% | GPT | โ | 6/28/2024 | ||
| 104 | SWE-agent + DevStral Small 2507 | 190 | 38.0% | Mistral | SWE-agent | WSโ | 7/25/2025 | |
| 105 | Skywork-SWE-32B | 190 | 38.0% | Qwen | Skywork AI | WSโ | 6/16/2025 | |
| 106 | Factory Code Droid | 185 | 37.0% | Other | โ | 6/17/2024 | ||
| 107 | SWE-agent + Claude 3.5 Sonnet | 168 | 33.6% | Claude | SWE-agent | Sโ | 6/20/2024 | |
| 108 | SWE-Fixer (Qwen2.5-7b retriever + Qwen2.5-72b editor) | 164 | 32.8% | Qwen | โ | WSโ | 3/6/2025 | |
| 109 | MASAI + GPT 4o (2024-06-12) | 163 | 32.6% | GPT | โ | 6/12/2024 | ||
| 110 | Artemis Agent v1 (2024-11-20) | 160 | 32.0% | Other | โ | 11/20/2024 | ||
| 111 | nFactorial (2024-10-07) | 158 | 31.6% | Other | โ | 10/7/2024 | ||
| 112 | SWE-Fixer (Qwen2.5-7b retriever + Qwen2.5-72b editor) 20241128 | 151 | 30.2% | Qwen | โ | S | 11/28/2024 | |
| 113 | Lingma Agent + Lingma SWE-GPT 72b (v0925) | 144 | 28.8% | GPT | Alibaba | S | 10/2/2024 | |
| 114 | EPAM AI/Run Developer Agent + GPT4o | 135 | 27.0% | GPT | โ | 10/16/2024 | ||
| 115 | AppMap Navie + GPT 4o (2024-05-13) | 131 | 26.2% | GPT | โ | Sโ | 6/15/2024 | |
| 116 | nFactorial (2024-10-01) | 129 | 25.8% | Other | โ | 10/1/2024 | ||
| 117 | Amazon Q Developer Agent (v20240430-dev) | 128 | 25.6% | Other | AWS | 5/9/2024 | ||
| 118 | Lingma Agent + Lingma SWE-GPT 72b (v0918) | 125 | 25.0% | GPT | Alibaba | S | 9/18/2024 | |
| 119 | EPAM AI/Run Developer Agent + GPT4o | 120 | 24.0% | GPT | โ | 8/20/2024 | ||
| 120 | MCTS-Refine-7B | 116 | 23.2% | Other | โ | WS | 6/27/2025 | |
| 121 | SWE-agent + GPT 4o (2024-05-13) | 116 | 23.2% | GPT | SWE-agent | Sโ | 7/28/2024 | |
| 122 | SWE-agent + GPT 4 (1106) | 112 | 22.4% | GPT | SWE-agent | Sโ | 4/2/2024 | |
| 123 | Lingma Agent + Lingma SWE-GPT 7b (v0925) | 91 | 18.2% | GPT | Alibaba | S | 10/2/2024 | |
| 124 | SWE-agent + Claude 3 Opus | 79 | 15.8% | Claude | SWE-agent | Sโ | 4/2/2024 | |
| 125 | Lingma Agent + Lingma SWE-GPT 7b (v0918) | 51 | 10.2% | GPT | Alibaba | S | 9/18/2024 | |
| 126 | RAG + Claude 3 Opus | 35 | 7.0% | Claude | โ | Sโ | 4/2/2024 | |
| 127 | RAG + Claude 2 | 22 | 4.4% | Claude | โ | Sโ | 10/10/2023 | |
| 128 | RAG + GPT 4 (1106) | 14 | 2.8% | GPT | โ | Sโ | 4/2/2024 | |
| 129 | RAG + SWE-Llama 7B | 7 | 1.4% | Llama | SWE-agent | Sโ | 10/10/2023 | |
| 130 | RAG + SWE-Llama 13B | 6 | 1.2% | Llama | SWE-agent | Sโ | 10/10/2023 | |
| 131 | RAG + ChatGPT 3.5 | 2 | 0.4% | GPT | SWE-agent | Sโ | 10/10/2023 |