@reaatech/agent-eval-harness-observability
End-to-end agent evaluation — trajectory eval, tool-use correctness, cost-per-task, latency budgets, regression suites with golden trajectories, LLM-as-judge with calibration. For full agent runs, not just classifiers.
- Latest release
- 1d ago
- Releases
- 2
- Known CVEs
- 0
- First release
- May 04, 2026
- License
- MIT
Repository
Source
- Stars
- 0
- Forks
- 0
- Open issues
- 1
- Language
- TypeScript
- agent-evaluation
- agentic-ai
- ai-agents
- cost-tracking
- latency
- llm-as-a-judge
- llm-as-judge
- llm-eval
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
Insights
Activity
- Total releases
- 2
- Last 12 months
- 2
- Cadence
- ~35 days
- Dependencies
- 11
Releases per month
last 12 monthsRelease mix
- patch 1
2
releases
Dependencies
Depends on
0.1.1-
@opentelemetry/api ~1.8.0
-
@opentelemetry/core ^2.7.0
-
@opentelemetry/exporter-trace-otlp-http ^0.51.0
-
@opentelemetry/exporter-zipkin ^2.7.1
-
@opentelemetry/resources ^2.7.1
-
@opentelemetry/sdk-metrics ^2.7.1
-
@opentelemetry/sdk-node ^0.51.0
-
@opentelemetry/sdk-trace-node ^2.7.1
-
@opentelemetry/semantic-conventions ^1.24.0
-
pino ^10.3.1
1–10 of 11
Used by
2Releases