@reaatech/agent-eval-harness-cli
End-to-end agent evaluation — trajectory eval, tool-use correctness, cost-per-task, latency budgets, regression suites with golden trajectories, LLM-as-judge with calibration. For full agent runs, not just classifiers.
- Latest release
- 1d ago
- Releases
- 2
- Known CVEs
- 0
- First release
- May 06, 2026
- License
- MIT
Repository
Source
- Stars
- 0
- Forks
- 0
- Open issues
- 1
- Language
- TypeScript
- agent-evaluation
- agentic-ai
- ai-agents
- cost-tracking
- latency
- llm-as-a-judge
- llm-as-judge
- llm-eval
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
Insights
Activity
- Total releases
- 2
- Last 12 months
- 2
- Cadence
- ~33 days
- Dependencies
- 16
Releases per month
last 12 monthsRelease mix
- patch 1
2
releases
Dependencies
Depends on
0.1.1-
chalk ^5.3.0
-
cli-progress ^3.12.0
-
commander ^15.0.0
-
@reaatech/agent-eval-harness-cost 0.1.0
-
@reaatech/agent-eval-harness-gate 0.1.1
-
@reaatech/agent-eval-harness-golden 0.1.0
-
@reaatech/agent-eval-harness-judge 0.1.0
-
@reaatech/agent-eval-harness-latency 0.1.0
-
@reaatech/agent-eval-harness-mcp-server 0.1.1
-
@reaatech/agent-eval-harness-observability 0.1.1
1–10 of 16
Used by
Nothing tracked depends on this yet.
Releases