← Registry

advanced-evaluation

Community

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

Install

skillpm install advanced-evaluation

Format score

95/100

Spec

v1.0

Installs

0

Author

@sickn33

Published

March 26, 2026