Zubnet AI学习Wiki › Drift Detection
基础设施

Drift Detection

Data Drift, Model Drift, Concept Drift
监控随时间推移可能降低性能的数据分布或模型行为的变化。Data drift:输入数据变化(客户人口统计变化、出现新产品类别)。Concept drift:输入和正确输出之间的关系变化(什么构成垃圾邮件在演变)。Model drift:模型预测逐渐变得不那么准确,尽管模型本身没变。

为什么重要

模型在历史数据上训练,但世界一直在变。2024 年训练的欺诈检测模型会错过 2025 年的新欺诈模式。在疫情前行为上训练的推荐系统会在疫情后做糟糕建议。漂移检测在这些退化变昂贵之前抓到它们 — 提醒你模型需要重训或更新。

Deep Dive

Data drift detection: compare the statistical distribution of current inputs to the training data distribution. If features shift significantly (using tests like KS test, PSI, or Jensen-Shannon divergence), the model may be operating outside its training distribution. Example: a credit scoring model trained on applicants aged 25–55 starts receiving applications from 18-year-olds — a population it's never seen.

Concept Drift

Concept drift is harder to detect because the inputs look the same but the correct outputs change. During COVID, "normal" purchase patterns shifted dramatically — buying 100 rolls of toilet paper went from "probable fraud" to "Tuesday." The model's predictions became wrong not because the model degraded, but because reality changed. Detecting concept drift requires comparing predictions to ground truth, which often arrives with a delay.

For LLMs

LLM drift manifests differently: user query patterns shift (new topics emerge), provider model updates change behavior (API model versions change silently), and the world changes (outdated training data). Monitoring strategies include: tracking output quality scores over time, detecting shifts in topic distribution of queries, alerting on increases in user-reported issues, and periodically re-evaluating on a fixed benchmark to detect provider-side changes.

相关概念

← 所有术语
ESC