MiniMax M3 Officially Open-Sourced with Native Multimodal Support for One Million Contexts

According to monitoring by Dongcha Beating, the domestic large model company MiniMax has officially open-sourced the weights of its native multimodal mixture of experts (MoE) model MiniMax M3 on Hugging Face. MiniMax M3 has a total parameter count of 428 billion, with 23 billion parameters activated per token, and natively supports one million ultra-long contexts. To reduce deployment memory costs, the development team has simultaneously released an MXFP8 quantized version, compatible with mainstream inference frameworks such as SGLang, vLLM, and Transformers. In terms of multimodal design, MiniMax M3 conducts joint training of text, images, and videos during the pre-training phase to achieve native semantic fusion, rather than aligning multimodal data in the post-training phase. The model operates in two inference modes: the Thinking mode for complex logic and tool orchestration, and the Non-thinking mode for low-latency dialogue and code generation. The underlying kernel supporting one million ultra-long contexts is the lightweight attention kernel library MiniMax Sparse Attention (MSA), which is also open-sourced. Official data shows that MSA employs a grouped query attention (GQA) chunk retrieval mechanism, achieving over 9 times pre-fill acceleration and 15 times decoding speedup in tests with one million tokens on the NVIDIA Blackwell (SM100) architecture, while significantly reducing inference costs.

Recently Searched

Hot Coins

Trending

Daily Must-Read

Welcome Back

Join CoinTime

Sign in with email

Sign up with email

Check your inbox

MiniMax M3 Officially Open-Sourced with Native Multimodal Support for One Million Contexts

All Comments

Recommended for you

BTC Falls Below $66,000

ETH Falls Below $1800

CLARITY Act Proposes $150 Million Funding to Combat Digital Asset Crimes

Trump: Strait of Hormuz Will Fully Resume Navigation by Friday

SK Hynix Responds to Rumors of 100 Trillion Won Shareholder Return Plan

Xiaohongshu's Valuation Reached $50 Billion in Private Secondary Market Trading

DeepSeek's First Round of Financing May Be Finalized with Liang Wenfeng Investing Approximately 20 Billion Yuan

HYPE Surges Over 13% Today, Breaking Previous High at $75.8

European Parliament Votes to Approve EU-US Trade Agreement Legislation

Robinhood Announces 10% Layoff, Anticipates Approximately $28 Million in Restructuring Costs

Daily Must-Read

SuperStrike: As AI Takes Over Financial Decision-Making, A New Era of Wealth Creation Begins

The largest IPO in history! Panoramic analysis of SpaceX's IPO: trillion dollar valuation, track change, and Musk's trillion dollar net worth

The 90 trillion yuan track is officially legalized: the United States opens up perpetual contracts on the blockchain, and encrypted finance completely rewrites the global trading landscape

PayStill Enters the Pure PAYS Era: A Value Aggregator Embarking on a New Growth Cycle

Bitwise deep review: Cryptocurrency market faces three major structural changes, reverse layout window opens as bear market ends

Goldman Sachs makes a heavy judgment: AI supply chain boom spillover, MLCC opens a new cycle of ultra long quantity price until 2030

Popular Activities

RaveDAO at Terra Solis by Tomorrowland: A Female-Led Techno Night Where Web3 Culture Converges

Popular Tags

Share