agent-evaluator

Production-ready evaluation framework for AI agents — 58 metrics (25 native + 33 Harness Config) across 7 evaluation gates: goal achievement, behavioral integrity, reliability, performance, security, multi-agent coordination, and observability

Latest release: May 28, 2026
Releases: 22
Known CVEs: 0
First release: Mar 19, 2026
License: MIT

View on Pypi

Repository

Source

bullpeng72/agent-evaluator

Stars: —
Forks: —
Open issues: —

Security score

No OpenSSF Scorecard available for this repository.

Packages from this repo

No other tracked packages from this repository.

Insights

Activity

Total releases: 22
Last 12 months: 22
Cadence: ~daily
Dependencies: 41

Releases per month

last 12 months

Release mix

minor 4
patch 17

22 releases

Dependencies

Depends on

0.9.4

anthropic <1.0.0,>=0.20.0
arize-phoenix >=15.4.0
autogen-agentchat <1.0.0,>=0.4.0
autogen-core <1.0.0,>=0.4.0
build >=1.0.0
crewai <2.0.0,>=1.0.0
datasets <6.0.0,>=4.0.0
deepeval <4.0.0,>=3.0.0
dspy-ai >=2.0.0
fastapi <1.0.0,>=0.110.0

1–10 of 41

Used by

Nothing tracked depends on this yet.

Releases

Version	Released
`0.9.4` patch	May 28, 2026
`0.9.3` patch	May 27, 2026
`0.9.2` patch	May 15, 2026
`0.9.1` patch	Apr 27, 2026
`0.9.0` minor	Apr 27, 2026
`0.8.5` patch	Apr 23, 2026
`0.8.4` patch	Apr 22, 2026
`0.8.1` patch	Apr 15, 2026
`0.8.0` minor	Apr 13, 2026
`0.7.9` patch	Apr 11, 2026
`0.7.8` patch	Apr 11, 2026
`0.7.7` patch	Apr 11, 2026
`0.7.4` patch	Apr 08, 2026
`0.7.0` minor	Apr 01, 2026
`0.6.7` patch	Mar 31, 2026
`0.6.6` patch	Mar 31, 2026
`0.6.0` minor	Mar 22, 2026
`0.5.8` patch	Mar 20, 2026
`0.5.7` patch	Mar 20, 2026
`0.5.6` patch	Mar 20, 2026

1–20 of 22

Release calendar

2026

S M T W T F S

Jan

Feb

Mar

Apr

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec