Zubnet AI學習Wiki › Databricks
公司

Databricks

Mosaic ML, DBRX, Unity Catalog
一個資料和 AI 平台,提供統一的分析、資料工程和機器學習能力。Databricks 在 2023 年收購了 Mosaic ML 以加入 LLM 訓練能力,並發布了自己的開源 LLM DBRX。平台建在 Apache Spark 上,為從資料準備到模型服務的完整 ML 生命週期提供託管基礎設施。

為什麼重要

Databricks 是企業資料和 AI 相遇的地方。大多數公司的 AI 野心都從「我們需要搞清楚自己的資料」開始,Databricks 往往是在一個地方處理資料工程、特徵工程、模型訓練、服務的平台。他們收購 Mosaic ML(以高效 LLM 訓練聞名)標誌著資料平台和 AI 平台正在收斂。

Deep Dive

Databricks' ML stack includes: MLflow (the most popular open-source ML experiment tracking tool, created by Databricks), Unity Catalog (data governance and model registry), Mosaic ML's training infrastructure (used to train DBRX), and model serving endpoints. The platform handles the full workflow from raw data in a lakehouse to a deployed model, which is its key differentiator from point solutions.

DBRX

DBRX is Databricks' open-weight LLM, using a Mixture of Experts architecture (132B total, 36B active). It was competitive with Llama 2 70B and Mixtral 8x7B at release. More than the model itself, DBRX demonstrated Databricks' ability to train frontier-scale models in-house, validating their Mosaic ML acquisition and positioning them as a credible AI lab alongside their platform business.

相關概念

In The News

Databricks' $5B war chest fuels AI security play with dual acquisitions
Mar 24, 2026
← 所有術語
ESC