Model Registry: Definition & Meaning — AI Wiki

对整个生命周期里训练好的机器学习模型做版本管理、跟踪、治理的中心化系统。像包 registry(npm、PyPI)但是用于 ML 模型:每个模型版本都和它的元数据一起存(训练数据、超参数、性能指标、血统),让你能复现结果、比较版本、部署特定模型到生产。

为什么重要

没有 model registry,ML 开发就变混乱:哪个版本的模型在生产?它用什么数据训练的?上次更新是什么时候?谁训练的?model registry 回答所有这些问题,为可复现、可审计、可靠的 ML 部署提供基础。对任何在生产中跑模型的团队,它是必需的基础设施。

Deep Dive

A model registry typically stores: the model artifact (weights, configuration), training metadata (hyperparameters, dataset version, training duration), evaluation metrics (accuracy, latency, fairness metrics across demographics), deployment status (which version is serving in production), and lineage (which experiment, code commit, and data pipeline produced this model). MLflow Model Registry, Weights & Biases, and SageMaker Model Registry are popular implementations.

The Deployment Pipeline

In production workflows, the model registry is the handoff point between training and serving: a data scientist trains and evaluates models, registers the best one, a reviewer approves it, and the deployment system pulls the approved model and serves it. This separation of concerns — training doesn't directly touch production, deployment only uses registry-approved models — reduces the risk of deploying broken models.

For LLMs

LLM registries have specific needs: models are very large (tens to hundreds of GB), fine-tuned variants share a common base model (store adapters separately), and evaluation is more complex (automated benchmarks + human evaluation + safety checks). Hugging Face Hub serves as a de facto model registry for the open-source community, with model cards, versioning, and evaluation results. Enterprise teams often use private registries for proprietary models.

Model Registry

为什么重要

Deep Dive

The Deployment Pipeline

For LLMs

相关概念