Zubnet AI學習Wiki › Model Card
Safety

Model Card

Model Documentation, Data Sheet
描述機器學習模型預期用途、性能特徵、訓練資料、限制和倫理考量的標準化文件。Mitchell 等人(2019)引入,model card 旨在增加透明度,幫助使用者對一個模型是否適合自己的用例做出明智決定。

為什麼重要

Model card 是 AI 的營養標籤。沒有它,你是在盲用模型 — 你不知道它用什麼資料訓練、它在什麼上表現好或差、它可能對哪些群體不利。當 AI 監管增加(歐盟 AI 法案要求文件),model card 從最佳實踐變成法律要求。

Deep Dive

A model card typically includes: model details (architecture, version, date), intended use (what the model is designed for and what it shouldn't be used for), training data (description of the training dataset, including any known biases), performance metrics (broken down by relevant subgroups), limitations (known failure modes, edge cases), and ethical considerations (potential harms, mitigation strategies).

In Practice

Hugging Face popularized model cards by requiring them for all models on their Hub. Quality varies widely — some are detailed technical documents, others are perfunctory placeholders. The best model cards include per-group performance breakdowns (does the model work equally well for different languages, demographics, or domains?), concrete examples of failure cases, and honest assessments of limitations rather than marketing language.

Data Cards and System Cards

The concept extends beyond models: data cards document datasets (collection methodology, annotation process, known biases), and system cards document entire AI systems (model + post-processing + guardrails + deployment context). Anthropic publishes system cards for Claude releases. These broader documents capture information that model cards alone miss — a model might be safe in isolation but dangerous when deployed with certain tool-use capabilities or without content filters.

相關概念

← 所有術語
← Model Model Collapse →