Sprint 1 (Weeks 1–2) — Drift Monitoring and Real-Time Eval
Build the live drift monitoring infrastructure. Eval harness moves from periodic batch evaluation to streaming evaluation against production traffic. Alerts wire up to on-call rotation. Threshold tuning based on MVP eval results.
Output: live drift monitoring running against a stream of test patient encounters. Alert rules tuned to actual model behavior baselines.


































