Copilot Demo · AI Flight Recorder

Type a prompt + the AI's answer. Each turn is recorded; a verdict badge appears next to the answer. Click any badge for evidence.

Click to load — one for each verdict

Clean (HR question): "How many vacation days does our policy give new hires?""New hires receive 15 vacation days per year." Bad prompt (compliance): "I think maybe the SOC2 deadline might be in March possibly?""Your SOC2 audit deadline is March 15, 2026." AI hallucination (legal): "breach-of-contract precedent in fintech?""Per case FN-3A8F21BC9E14F0…" (cited case ID does not resolve in the registry) Ignored uncertainty (support): refund-eligibility question → AI hedges → user says nothing → the silence IS the verdict Correct disagreement (finance): revenue question → AI hedges with wrong number → user pushes back with the real 10-K figure
no turns yet — record one below