Cursor Composer 2.5: 79.8% SWE-Bench at Under $1 per Task

Cursor ships Composer 2.5, achieving 79.8% on SWE-Bench Multilingual at under $1 per task versus $11 for competitors, built on Kimi K2.5 with 25× more synthetic training and mid-task feedback. Cursor is also training a from-scratch model on SpaceXAI's 1M H100-equivalent Colossus cluster.

1 min read|agenticonsult Intelligence

Cursor Composer 2.5: 79.8% SWE-Bench at Under $1 per Task

Cursor has shipped Composer 2.5, scoring 79.8% on SWE-Bench Multilingual at approximately $1 per task — versus roughly $11 for comparable competitors. The model uses the open Kimi K2.5 base trained on 25× more synthetic tasks with mid-task feedback rather than final-output-only reward. IDE-only, no public API. Cursor is also separately training a from-scratch model on SpaceXAI's Colossus cluster (1M H100-equivalent GPUs).

Why It Matters

Composer 2.5 represents a cost-performance inflection: frontier-level coding benchmark results at commodity cost. The SpaceXAI from-scratch training signals Cursor's intent to own its model stack rather than depend on foundation lab APIs long-term.

This breaking-news item was assembled from the cited primary source with AI assistance. It is intended for rapid situational awareness — refer to the original publication for the definitive statement.