DBRX
DevelopersMosaic ML and Databricks team
ReleaseMarch 27, 2024
LicenseDatabricks Open Model License[1][2]
Websitehttps://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
Repositoryhttps://github.com/databricks/dbrx

DBRX is a large language model (LLM) developed by Mosaic under its parent company Databricks, released on March 27, 2024 under the Databricks Open Model License.[3][4][5] It is a mixture-of-experts transformer model, with 132 billion parameters in total. 36 billion parameters (4 out of 16 experts) are active for each token.[6] The released model comes in either a base foundation model version or an instruction-tuned variant.[7]

At the time of its release, DBRX outperformed prominent models such as Meta's Llama 2, Mistral AI's Mixtral, and xAI's Grok-1, in several benchmarks ranging from language understanding, programming ability and mathematics.[6][8][9]

It was trained for 2.5 months[9] and reported using on 3,072 Nvidia H100s connected by 3.2 terabytes per second bandwidth (InfiniBand), for a training cost of US$10M.[3][non-primary source needed]

References

edit
  1. ^ "Databricks Open Model License". Databricks. 27 March 2024. Retrieved 6 August 2025.
  2. ^ "Databricks Open Model Acceptable Use Policy". Databricks. 27 March 2024. Retrieved 6 August 2025.
  3. ^ a b "Introducing DBRX: A New State-of-the-Art Open LLM". Databricks. 2024-03-27. Retrieved 2024-03-28.
  4. ^ Avidon, Eric (March 27, 2024). "New Databricks open source LLM targets custom development". Tech Target. Retrieved 2024-03-28.
  5. ^ Ghoshal, Anirban (2024-03-27). "Databricks' open-source DBRX LLM beats Llama 2, Mixtral, and Grok". InfoWorld. Retrieved 2024-03-28.
  6. ^ a b "A New Open Source LLM, DBRX Claims to be the Most Powerful – Here are the Scores". GIZMOCHINA. Mar 28, 2024.
  7. ^ Wiggers, Kyle (2024-03-27). "Databricks spent $10M on new DBRX generative AI model". TechCrunch. Retrieved 2024-03-29.
  8. ^ "Data and AI company DataBrix has launched a general-purpose large language model (LLM) DBRX that out." Maeil Business Newspaper. 2024-03-28. Retrieved 2024-03-28.
  9. ^ a b Knight, Will (March 27, 2024). "Inside the Creation of the World's Most Powerful Open Source AI Model". Wired. ISSN 1059-1028. Retrieved 2024-03-28.

📚 Artikel Terkait di Wikipedia

Databricks

built on the MegaBlocks open-source project. DBRX cost $10 million to create. According to the company, DBRX performed competitively on industry benchmarks

Mixture of experts

finetuned for instruction following. In March 2024, Databricks released DBRX. It is a MoE language model with 132B parameters, 16 experts, and sparsity

List of large language models

Unknown Unknown Proprietary Includes three models: Haiku, Sonnet, and Opus. DBRX Mar 2024 Databricks and Mosaic ML 136B 12T tokens Unknown Databricks Open

List of artificial intelligence projects

interactions per month. DeepSeek: Chinese chatbot funded by hedge fund High-Flyer. DBRX, 136 billion parameter open sourced large language model developed by Mosaic