Next.js AI Model Performance Evaluations
Blackbox CLI has achieves a State of the art results on Next.js Evals benchmark :| Model | Success Rate | Tasks Completed |
|---|---|---|
| Claude Sonnet 4.5 | 52% | 26/50 |
| Claude Opus 4.5 | 60% | 30/50 |
About the benchmark
Next.js Evals provides comprehensive performance evaluations of AI models and agents on Next.js code generation and migration tasks. The evaluation framework measures several key metrics:- Success Rate: Percentage of successful code generation and migration tasks
- Execution Time: Average duration to complete tasks
- Token Usage: Total tokens consumed during evaluations
- Quality Improvements: Assessment of code quality and best practices