Warp Grep Benchmarks

Agentic code search performance on real-world repositories.

SWE-bench Pro

Leaderboard view. Green bars use Warp Grep, outlined bars are baseline.

SWE-bench Pro Improvement
Model + Morph WarpGrep
Baseline
15%
Average Cost Reduction
19%
Average Time Reduction
28
Turns Saved on Average

Performance Breakdown

Official SWE-bench Pro benchmark, MiniMax 2.5 with and without Warp Grep.

Sweep: MiniMax 2.5
MetricBaselineWarp GrepDelta
Avg events/instance15713514% faster
Avg prompt tokens2,926,5022,461,97316% less
Avg completion tokens17,19015,22211% less
Avg reasoning tokens7,3476,8357% less
Avg cost/instance$0.18$0.1517% cheaper
Total cost (18 inst)$3.26$2.7715% cheaper

WarpGrep helps models focus on coding, not searching.

-39%
Input Tokens
-26%
Agent Turns
+10%
Tasks Solved

Claude 4.5 Opus on SWE-bench, with vs. without WarpGrep.

15% cheaper · 10% more accurate · 26% fewer turns

Build better coding agents

WarpGrep is available as an API and SDK component. Join 500+ teams using Morph.