#ai-benchmarks

GLM-5.1: The Open-Source Model That Beat GPT-5.4

April 19, 2026

GLM-5.1 (April 2026): Z.ai's 754B open-weight model scored 58.4% on SWE-bench Pro, beating GPT-5.4 and Claude Opus 4.6 on real coding benchmarks.

#GLM-5.1 #Z.ai

Claude Opus 4.7: Benchmarks, Features & Pricing

April 17, 2026

Claude Opus 4.7 leads SWE-bench Pro at 64.3% and OSWorld at 78.0%. Full breakdown of benchmarks, new features, pricing, and what changed from Claude Opus 4.6.

#Claude #Anthropic

Stanford AI Index 2026: US-China Gap Shrinks to 2.7 Points

April 16, 2026

Stanford's 2026 AI Index: US leads China by just 2.7 points on Arena, organizational AI adoption hits 88 percent, transparency collapses from 58 to 40.

#Stanford AI Index 2026 #AI adoption

Moonshot’s Kimi-K2: The Open-Source AI Model Beating Paid Giants

September 28, 2025

Moonshot Kimi-K2: the open-weight, trillion-parameter MoE model that outscored GPT-4.1 on SWE-bench Verified at release. What it is, and what it actually ships.

#Kimi-K2 #Moonshot

#ai-benchmarks

GLM-5.1: The Open-Source Model That Beat GPT-5.4

Claude Opus 4.7: Benchmarks, Features & Pricing

Stanford AI Index 2026: US-China Gap Shrinks to 2.7 Points

Moonshot’s Kimi-K2: The Open-Source AI Model Beating Paid Giants

Stay on the Nerd Track