Evaluations (Evals)

Evaluations (Evals) are benchmarked test suites used to track model behavior, regressions, and improvements against defined criteria.

Related terms

Related terms