MPT-7B: the first open-source model explicitly built for commercial use
In one sentence MosaicML launches MPT-7B under Apache 2.0 with a 65,000-token context window via ALiBi, the first open model explicitly designed for unrestricted commercial deployment.
Before MPT-7B, open-source models like LLaMA carried an important restriction: you couldn't use them to make money. They were built for research, not commercial products.
MosaicML changed the rules by releasing MPT-7B under the Apache 2.0 license — the same used by vast amounts of commercial open-source software. You can build a product on top, sell it, integrate it into a business service.
There was also an important technical innovation: MPT-7B used a technique called ALiBi that let the model read very long texts — up to 65,000 words, compared to the usual 4,000. For anyone working with documents, contracts, or system logs, that capability is enormous. It's no coincidence that three months after launch, MosaicML was acquired by Databricks for 1.3 billion dollars.
Companies
MosaicML
Tools
—
Tags
Sources