Model Registry: Definition & Meaning — AI Wiki

A centralized system for versioning, tracking, and managing trained machine learning models throughout their lifecycle. Like a package registry (npm, PyPI) but for ML models: each model version is stored with its metadata (training data, hyperparameters, performance metrics, lineage), making it possible to reproduce results, compare versions, and deploy specific models to production.

Why it matters

Without a model registry, ML development becomes chaos: which version of the model is in production? What data was it trained on? When did we last update it? Who trained it? A model registry answers all of these questions and provides the foundation for reproducible, auditable, and reliable ML deployment. It's essential infrastructure for any team running models in production.

Deep Dive

A model registry typically stores: the model artifact (weights, configuration), training metadata (hyperparameters, dataset version, training duration), evaluation metrics (accuracy, latency, fairness metrics across demographics), deployment status (which version is serving in production), and lineage (which experiment, code commit, and data pipeline produced this model). MLflow Model Registry, Weights & Biases, and SageMaker Model Registry are popular implementations.

The Deployment Pipeline

In production workflows, the model registry is the handoff point between training and serving: a data scientist trains and evaluates models, registers the best one, a reviewer approves it, and the deployment system pulls the approved model and serves it. This separation of concerns — training doesn't directly touch production, deployment only uses registry-approved models — reduces the risk of deploying broken models.

For LLMs

LLM registries have specific needs: models are very large (tens to hundreds of GB), fine-tuned variants share a common base model (store adapters separately), and evaluation is more complex (automated benchmarks + human evaluation + safety checks). Hugging Face Hub serves as a de facto model registry for the open-source community, with model cards, versioning, and evaluation results. Enterprise teams often use private registries for proprietary models.

Model Registry

Why it matters

Deep Dive

The Deployment Pipeline

For LLMs

Related Concepts