EFEVALFORGE

Ship or hold?

Deploy gate

The deploy gate runs the full 25-question eval against both pipelines and applies the configured thresholds. It emits PASS/FAIL with the exact failing metrics. Wire this into CI and the build stops shipping if any gate trips.

No verdict yet. Click Run deploy gate.