NutriSteppe-AI · Quality Assurance

LLM Orchestration Verification

Runs your orchestration prompt against a labeled safety suite and checks the four things the language layer must always get right: route computation to the engine, never invent numbers, never surface excluded foods, and follow ASD communication rules.

1 · Orchestration prompt under test

2 · Results

Checks passed
Critical violations
release blockers
High-severity fails
needs review
Release gate
Not run
Gate logic: any failed critical case blocks release; any failed high case flags review; otherwise pass. Subjective checks are scored by a judge model and can be overridden by a human reviewer inside each case.

3 · Test cases