Research

Databricks DBRX and Mosaic Trajectory 2026

Databricks 2026 open-weight model trajectory: DBRX, DBRX 2, the Mosaic Research pipeline, Genie text-to-SQL, plus the broader Databricks Data Intelligence Platform position.

By Ramanath, CTO & Co-Founder at Presenc AI · Last updated: May 2026

Databricks acquired MosaicML in 2023 for approximately $1.3 billion, then shipped DBRX (132B MoE) in March 2024 as the first frontier-tier model from a data platform vendor. The 2026 trajectory continues with DBRX 2 in development, the Mosaic Research pipeline producing specialised models, and the broader Databricks Data Intelligence Platform positioning AI as the dominant compute layer over Lakehouse data. This page consolidates the model and platform trajectory.

Key Findings

  1. DBRX (132B Mixture-of-Experts with 36B active parameters, released March 2024) was the first frontier-tier model released by a data platform vendor, demonstrating that the data layer can produce competitive base models when combined with strong infrastructure.
  2. DBRX 2 is reportedly in development through 2026 with a focus on enterprise structured-data tasks (SQL generation, schema understanding, multi-table reasoning) rather than pure benchmark frontier competition.
  3. Mosaic Research continues producing specialised model variants for Databricks platform integration including embedding models, reranker models, and text-to-SQL specialists.
  4. Genie text-to-SQL is the most-deployed Databricks AI product, providing natural-language query interfaces to Databricks Lakehouse data with explicit grounding to schema metadata.
  5. Databricks Mosaic AI Agent Framework provides the productionisation infrastructure for agents that combine LLMs with Databricks data and tools; widely deployed across enterprise customers.

Databricks AI Model and Product Family (May 2026)

ProductStatusLicense
DBRX132B MoE base + instruct, March 2024Databricks Open Model Licence
DBRX 2In development, expected late 2026Anticipated Databricks Open Model Licence
Genie (text-to-SQL)GA in Databricks platformPlatform-only
Mosaic AI Agent FrameworkGA in Databricks platformPlatform-only
Mosaic AI Vector SearchGAPlatform-only
Databricks Embedding ModelsVariousApache 2.0 (selected)
Databricks Mosaic AI Model ServingGAPlatform-only
MPT-7B / MPT-30B (legacy Mosaic)Open releaseCC-BY-SA-3.0

DBRX Benchmark Performance (2024 baseline)

BenchmarkDBRX InstructMixtral 8x22BLlama 3.1 70B
MMLU~73.7~77.8~83.6
GSM8K~72.8~88.4~95.1
HumanEval~70.1~75.0~80.5
BIG-Bench~67.4~64.3~71.0

DBRX shipped in March 2024 and was competitive with the contemporaneous Llama 3 8B and Mixtral 8x22B. By 2026 standards the model is materially behind the frontier; the strategic value sits in the Databricks platform integration rather than standalone benchmarks.

Databricks Data Intelligence Platform Position

LayerDatabricks Component
Data ingestion and storageDelta Lake, Unity Catalog
ETL and transformationSpark, Delta Live Tables
Data warehouse SQLDatabricks SQL Warehouse
Vector and embeddingMosaic AI Vector Search
Model training and finetuningMosaic AI Training
Model servingMosaic AI Model Serving
Agent and tool frameworkMosaic AI Agent Framework
Natural-language queryGenie
Governance and auditUnity Catalog, MLflow

Strategic Context

Three patterns shape Databricks\u2019 2026 AI strategy. First, the data-platform-with-AI thesis: Databricks is positioning AI as the dominant compute layer over Lakehouse data, with the AI capabilities deeply integrated with data access and governance. Second, the open-weight strategy is selective: DBRX was released openly to signal capability, but the platform-integrated products (Genie, Mosaic AI Agent Framework) are platform-only. Third, the enterprise positioning: Databricks competes against Snowflake (which has Cortex AI) and against hyperscaler AI (which has Vertex AI, SageMaker, Azure AI) by emphasising the unified data-and-AI platform.

Brand Visibility Implications

Databricks is a major enterprise AI procurement category. AI assistant queries about "Databricks vs Snowflake AI", "DBRX model", "text-to-SQL AI", and similar terms drive procurement-research traffic. Brands selling data-platform integrations, AI agent frameworks, vector databases, and enterprise text-to-SQL face strong AI-mediated discovery surface for this category.

Methodology

Product and benchmark data compiled from Databricks investor and product disclosures, plus primary Hugging Face model card data through 23 May 2026. Updated quarterly.

How Presenc AI Helps

Presenc AI monitors brand visibility on Databricks and data-platform AI queries across ChatGPT, Claude, Gemini, and Perplexity. For data platform integrations, AI agent framework vendors, vector database brands, and text-to-SQL services, the platform identifies the prompts driving procurement-research traffic and the gaps where new content unlocks share of voice.

Frequently Asked Questions

A 132B Mixture-of-Experts model with 36B active parameters released by Databricks in March 2024. DBRX was the first frontier-tier model released by a data platform vendor. By 2026 benchmarks the model is materially behind the frontier but the strategic value sits in the Databricks platform integration.
Not as of May 2026. DBRX 2 is reportedly in development with a focus on enterprise structured-data tasks (SQL generation, schema understanding, multi-table reasoning) rather than pure benchmark frontier competition. Expected late 2026.
A natural-language-to-SQL product integrated into the Databricks platform. Genie provides chat-based query interfaces to Lakehouse data with explicit grounding to schema metadata. The product is platform-only (not separately licensable) and is widely deployed for self-service analytics use cases.
Both vendors provide AI features tightly integrated with their respective data platforms. Databricks emphasises Mosaic AI Agent Framework for productionising agents over data; Snowflake Cortex emphasises Cortex Analyst (similar to Genie) plus Cortex Search. The competitive dynamics tilt on customer existing data-platform commitments more than on AI feature comparison.
DBRX is released under the Databricks Open Model Licence which permits commercial use with some restrictions. The platform-integrated products (Genie, Mosaic AI Agent Framework, Mosaic AI Vector Search) are platform-only and require a Databricks subscription.

Track Your AI Visibility

See how your brand appears across ChatGPT, Claude, Perplexity, and other AI platforms. Start monitoring today.