Model Registry: Definition & Meaning — AI Wiki

對整個生命週期裡訓練好的機器學習模型做版本管理、追蹤、治理的中心化系統。像套件 registry(npm、PyPI)但是用於 ML 模型:每個模型版本都和它的元資料一起存(訓練資料、超參數、性能指標、血統),讓你能重現結果、比較版本、部署特定模型到生產。

為什麼重要

沒有 model registry,ML 開發就變混亂:哪個版本的模型在生產?它用什麼資料訓練的?上次更新是什麼時候?誰訓練的?model registry 回答所有這些問題,為可重現、可稽核、可靠的 ML 部署提供基礎。對任何在生產中跑模型的團隊,它是必需的基礎設施。

Deep Dive

A model registry typically stores: the model artifact (weights, configuration), training metadata (hyperparameters, dataset version, training duration), evaluation metrics (accuracy, latency, fairness metrics across demographics), deployment status (which version is serving in production), and lineage (which experiment, code commit, and data pipeline produced this model). MLflow Model Registry, Weights & Biases, and SageMaker Model Registry are popular implementations.

The Deployment Pipeline

In production workflows, the model registry is the handoff point between training and serving: a data scientist trains and evaluates models, registers the best one, a reviewer approves it, and the deployment system pulls the approved model and serves it. This separation of concerns — training doesn't directly touch production, deployment only uses registry-approved models — reduces the risk of deploying broken models.

For LLMs

LLM registries have specific needs: models are very large (tens to hundreds of GB), fine-tuned variants share a common base model (store adapters separately), and evaluation is more complex (automated benchmarks + human evaluation + safety checks). Hugging Face Hub serves as a de facto model registry for the open-source community, with model cards, versioning, and evaluation results. Enterprise teams often use private registries for proprietary models.

Model Registry

為什麼重要

Deep Dive

The Deployment Pipeline

For LLMs

相關概念