← Registry

evaluation

Community

Build evaluation frameworks for agent systems. Use when testing agent performance systematically, validating context engineering choices, or measuring improvements over time.

Install

skillpm install evaluation

Format score

90/100

Spec

v1.0

Installs

0

Author

@sickn33

Published

March 26, 2026