agent-evaluator
Production-ready evaluation framework for AI agents — 58 metrics (25 native + 33 Harness Config) across 7 evaluation gates: goal achievement, behavioral integrity, reliability, performance, security, multi-agent coordination, and observability
- Latest release
- May 28, 2026
- Releases
- 22
- Known CVEs
- 0
- First release
- Mar 19, 2026
- License
- MIT
Repository
Source
- Stars
- —
- Forks
- —
- Open issues
- —
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
No other tracked packages from this repository.
Insights
Activity
- Total releases
- 22
- Last 12 months
- 22
- Cadence
- ~daily
- Dependencies
- 41
Releases per month
last 12 monthsRelease mix
- minor 4
- patch 17
22
releases
Dependencies
Depends on
0.9.4-
anthropic <1.0.0,>=0.20.0
-
arize-phoenix >=15.4.0
-
autogen-agentchat <1.0.0,>=0.4.0
-
autogen-core <1.0.0,>=0.4.0
-
build >=1.0.0
-
crewai <2.0.0,>=1.0.0
-
datasets <6.0.0,>=4.0.0
-
deepeval <4.0.0,>=3.0.0
-
dspy-ai >=2.0.0
-
fastapi <1.0.0,>=0.110.0
1–10 of 41
Used by
Nothing tracked depends on this yet.
Releases
| Version | Released | |
|---|---|---|
0.9.4
patch
| ||
0.9.3
patch
| ||
0.9.2
patch
| ||
0.9.1
patch
| ||
0.9.0
minor
| ||
0.8.5
patch
| ||
0.8.4
patch
| ||
0.8.1
patch
| ||
0.8.0
minor
| ||
0.7.9
patch
| ||
0.7.8
patch
| ||
0.7.7
patch
| ||
0.7.4
patch
| ||
0.7.0
minor
| ||
0.6.7
patch
| ||
0.6.6
patch
| ||
0.6.0
minor
| ||
0.5.8
patch
| ||
0.5.7
patch
| ||
0.5.6
patch
|
1–20 of 22