Interface: Evaluator<Input, Output, Expected>
An evaluator is a collection of functions that can be used to evaluate a model. It consists of:
data, a function that returns a list of inputs, expected outputs, and metadatatask, a function that takes an input and returns an outputscores, a set of functions that take an input, output, and expected value and return a scoremetadata, optional additional metadata for the eval definition, such as experiment name.
Type parameters
| Name |
|---|
Input |
Output |
Expected |
Properties
data
• data: EvalData<Input, Expected>
metadata
• Optional metadata: EvalMetadata
scores
• scores: EvalScorer<Input, Output, Expected>[]
task
• task: EvalTask<Input, Output>