Drift Detection: Definition & Meaning — AI Wiki

監控隨時間推移可能降低性能的資料分佈或模型行為的變化。Data drift:輸入資料變化(客戶人口統計變化、出現新產品類別)。Concept drift:輸入和正確輸出之間的關係變化(什麼構成垃圾郵件在演變)。Model drift:模型預測逐漸變得不那麼準確,儘管模型本身沒變。

為什麼重要

模型在歷史資料上訓練,但世界一直在變。2024 年訓練的詐欺偵測模型會錯過 2025 年的新詐欺模式。在疫情前行為上訓練的推薦系統會在疫情後做糟糕建議。漂移偵測在這些退化變昂貴之前抓到它們 — 提醒你模型需要重訓或更新。

Deep Dive

Data drift detection: compare the statistical distribution of current inputs to the training data distribution. If features shift significantly (using tests like KS test, PSI, or Jensen-Shannon divergence), the model may be operating outside its training distribution. Example: a credit scoring model trained on applicants aged 25–55 starts receiving applications from 18-year-olds — a population it's never seen.

Concept Drift

Concept drift is harder to detect because the inputs look the same but the correct outputs change. During COVID, "normal" purchase patterns shifted dramatically — buying 100 rolls of toilet paper went from "probable fraud" to "Tuesday." The model's predictions became wrong not because the model degraded, but because reality changed. Detecting concept drift requires comparing predictions to ground truth, which often arrives with a delay.

For LLMs

LLM drift manifests differently: user query patterns shift (new topics emerge), provider model updates change behavior (API model versions change silently), and the world changes (outdated training data). Monitoring strategies include: tracking output quality scores over time, detecting shifts in topic distribution of queries, alerting on increases in user-reported issues, and periodically re-evaluating on a fixed benchmark to detect provider-side changes.

Drift Detection

為什麼重要

Deep Dive

Concept Drift

For LLMs

相關概念