Getting Started - BenchmarkMD

Quick Start

Measures code quality based on:

Based on API pricing for the selected agent. Includes input + output tokens.

How long the agent took to complete the task.

Plan	Price	Features
Free	$0	3 benchmarks/day
Pro	$29/mo	Unlimited benchmarks + API access
Enterprise	Custom	Custom audits + support

Our benchmarks use standardized tasks to ensure fair comparison across agents.

Yes! Enter any task in the text field. The more specific, the better the results.

Currently: Claude Code, Cursor, GitHub Copilot, Devin, and Bolt.new.