Zubnet AI学习Wiki › Databricks
公司

Databricks

Mosaic ML, DBRX, Unity Catalog
一个数据和 AI 平台,提供统一的分析、数据工程和机器学习能力。Databricks 在 2023 年收购了 Mosaic ML 以加入 LLM 训练能力,并发布了自己的开源 LLM DBRX。平台建在 Apache Spark 上,为从数据准备到模型服务的完整 ML 生命周期提供托管基础设施。

为什么重要

Databricks 是企业数据和 AI 相遇的地方。大多数公司的 AI 野心都从“我们需要搞清楚自己的数据”开始,Databricks 往往是在一个地方处理数据工程、特征工程、模型训练、服务的平台。他们收购 Mosaic ML(以高效 LLM 训练闻名)标志着数据平台和 AI 平台正在收敛。

Deep Dive

Databricks' ML stack includes: MLflow (the most popular open-source ML experiment tracking tool, created by Databricks), Unity Catalog (data governance and model registry), Mosaic ML's training infrastructure (used to train DBRX), and model serving endpoints. The platform handles the full workflow from raw data in a lakehouse to a deployed model, which is its key differentiator from point solutions.

DBRX

DBRX is Databricks' open-weight LLM, using a Mixture of Experts architecture (132B total, 36B active). It was competitive with Llama 2 70B and Mixtral 8x7B at release. More than the model itself, DBRX demonstrated Databricks' ability to train frontier-scale models in-house, validating their Mosaic ML acquisition and positioning them as a credible AI lab alongside their platform business.

相关概念

In The News

Databricks' $5B war chest fuels AI security play with dual acquisitions
Mar 24, 2026
← 所有术语
ESC