on_fail callback for handling evaluation failures.
Usage
Basic Agent as Judge
Basic usage of Agent as Judge evaluation with numeric scoring and failure callbacks
This example demonstrates basic Agent as Judge evaluation with numeric scoring (1-10 scale) and an