Zubnet AI学习Wiki › Model Card
Safety

Model Card

Model Documentation, Data Sheet
描述机器学习模型预期用途、性能特征、训练数据、限制和伦理考量的标准化文档。Mitchell 等人(2019)引入,model card 旨在增加透明度,帮助用户对一个模型是否适合自己的用例做出明智决定。

为什么重要

Model card 是 AI 的营养标签。没有它,你是在盲用模型 — 你不知道它用什么数据训练、它在什么上表现好或差、它可能对哪些群体不利。当 AI 监管增加(欧盟 AI 法案要求文档),model card 从最佳实践变成法律要求。

Deep Dive

A model card typically includes: model details (architecture, version, date), intended use (what the model is designed for and what it shouldn't be used for), training data (description of the training dataset, including any known biases), performance metrics (broken down by relevant subgroups), limitations (known failure modes, edge cases), and ethical considerations (potential harms, mitigation strategies).

In Practice

Hugging Face popularized model cards by requiring them for all models on their Hub. Quality varies widely — some are detailed technical documents, others are perfunctory placeholders. The best model cards include per-group performance breakdowns (does the model work equally well for different languages, demographics, or domains?), concrete examples of failure cases, and honest assessments of limitations rather than marketing language.

Data Cards and System Cards

The concept extends beyond models: data cards document datasets (collection methodology, annotation process, known biases), and system cards document entire AI systems (model + post-processing + guardrails + deployment context). Anthropic publishes system cards for Claude releases. These broader documents capture information that model cards alone miss — a model might be safe in isolation but dangerous when deployed with certain tool-use capabilities or without content filters.

相关概念

← 所有术语
← Model Model Collapse →