Kimi K2.7-Code: Open Weights, First-Party Numbers (2026)
June 13, 2026
Moonshot's Kimi K2.7-Code is a 1T open-weight coding model with cheap API pricing, but every launch benchmark is first-party. Here's what's actually verified.
Moonshot's Kimi K2.7-Code is a 1T open-weight coding model with cheap API pricing, but every launch benchmark is first-party. Here's what's actually verified.
In 17 days, GLM-5.1, Kimi K2.6, and DeepSeek V4 shipped frontier-tier open-weight coding LLMs at a fraction of Western prices. Inside the April 2026 wave.
Moonshot open-sourced FlashKDA, a CUTLASS CUDA kernel for Kimi Delta Attention. Drop-in for flash-linear-attention with up to 2.22x prefill speedup on H20 GPUs.
Moonshot AI's Kimi K2.6 (April 20, 2026): a 1T-parameter open-weight MoE that orchestrates 300 sub-agents and tops GPT-5.4 on SWE-Bench Pro at 58.6%.