@reaatech/agent-eval-harness-gate
End-to-end agent evaluation — trajectory eval, tool-use correctness, cost-per-task, latency budgets, regression suites with golden trajectories, LLM-as-judge with calibration. For full agent runs, not just classifiers.
- Latest release
- 1d ago
- Releases
- 2
- Known CVEs
- 0
- First release
- May 04, 2026
- License
- MIT
Repository
Source
- Stars
- 0
- Forks
- 0
- Open issues
- 1
- Language
- TypeScript
- agent-evaluation
- agentic-ai
- ai-agents
- cost-tracking
- latency
- llm-as-a-judge
- llm-as-judge
- llm-eval
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
Insights
Activity
- Total releases
- 2
- Last 12 months
- 2
- Cadence
- ~35 days
- Dependencies
- 3
Releases per month
last 12 monthsRelease mix
- patch 1
2
releases
Dependencies
Depends on
0.1.1-
@reaatech/agent-eval-harness-suite 0.1.1
-
@reaatech/agent-eval-harness-types 0.1.0
-
yaml ^2.8.4
Used by
2Releases
| Version | Released | |
|---|---|---|
0.1.1
patch
| ||
0.1.0
initial
|