MiniMax M3: Open-Weight Coding at 1/10 the Cost (2026)
June 9, 2026
MiniMax M3 is an open-weight coding model whose Sparse Attention runs 1M-token context at 1/20th the compute. Its benchmarks beat GPT-5.5 — with caveats.
MiniMax M3 is an open-weight coding model whose Sparse Attention runs 1M-token context at 1/20th the compute. Its benchmarks beat GPT-5.5 — with caveats.
Four Chinese labs shipped open-weight coding models in 18 days. Inside the benchmarks, prices, and architectures reshaping agentic coding economics in 2026.
In 17 days, GLM-5.1, Kimi K2.6, and DeepSeek V4 shipped frontier-tier open-weight coding LLMs at a fraction of Western prices. Inside the April 2026 wave.