⚙️

Eval Architecture

Eval harness design, CI/CD gates, LLM-as-judge calibration

GitHub ActionsPromptfooWeave