Functions for sampling objects for audit, recording human judgments, and comparing system evaluations to reviewer assessments.
Functions for sampling objects for audit, recording human judgments, and comparing system evaluations to reviewer assessments.