Sign in Sign up
npm

@reaatech/agent-eval-harness-gate

End-to-end agent evaluation — trajectory eval, tool-use correctness, cost-per-task, latency budgets, regression suites with golden trajectories, LLM-as-judge with calibration. For full agent runs, not just classifiers.

Latest release
1d ago
Releases
2
Known CVEs
0
First release
May 04, 2026
License
MIT
View on Npm
Repository

Source

reaatech/agent-eval-harness
Stars
0
Forks
0
Open issues
1
Language
TypeScript
  • agent-evaluation
  • agentic-ai
  • ai-agents
  • cost-tracking
  • latency
  • llm-as-a-judge
  • llm-as-judge
  • llm-eval

Security score

No OpenSSF Scorecard available for this repository.

Packages from this repo

Insights

Activity

Total releases
2
Last 12 months
2
Cadence
~35 days
Dependencies
3

Releases per month

last 12 months

Release mix

  • patch 1
2 releases
Releases
Version Released
0.1.1 patch
0.1.0 initial