Grok 5 vs Gemini 3.5: Video separates hype from released models as of mid-2026
π The letter grade, factuality score, and political-lean rating for this report are part of CladFacts Premium. The full report below is free to read.
Topics in This Edition
Summary
The video examines claims that xAI's unreleased Grok 5 has won the AGI race against Google's Gemini 3.5 family. It contrasts Gemini 3.5 Flash, released in May 2026 with agentic and coding strengths, against Grok 5 still in training on the Colossus cluster. It covers Grok 4.3 as the current released xAI model, benchmarks from Artificial Analysis, pricing, capabilities like video understanding, and the multi-lab competitive landscape including OpenAI and Anthropic.
Editorial Assessment
The broadcast holds up well as a measured corrective to viral hype, correctly emphasizing the asymmetry between a partially released Gemini family and an unreleased Grok 5. It draws appropriately on independent benchmarks rather than company claims and provides useful context on AGI definitions and incremental progress. Minor limitations include reliance on reported (not always primary) parameter counts and prediction-market probabilities without deeper sourcing on every infrastructure figure. Viewers gain tools to assess future releases but may miss the very latest post-recording updates on Gemini 3.5 Pro rollout.
Key Moments
Grok 5 has not been released as of mid-June 2026; Q1 and Q2 2026 targets missed.
Multiple sources confirm no release, with Q2 window closing and Polymarket odds around 33% by June 30.
Gemini 3.5 Flash released May 2026 at I/O; emphasizes agentic tasks and outperforms prior Pro on coding/agentic benchmarks.
Google blog and independent reports confirm May 19 release, pricing, 1M context, and benchmark gains on Terminal-Bench and MCP Atlas.
Grok 4.3 scores 53 on Artificial Analysis Intelligence Index; Gemini 3.5 Flash around 55 with nuance on agentic strength.
Artificial Analysis data directly supports Grok 4.3 score; comparisons show Flash competitive with strengths in specific agentic areas.
Colossus cluster in Memphis involves hundreds of thousands of GPUs and massive power draw (hundreds of MW to GW scale).
xAI and reporting confirm 200k+ GPUs, phased expansion, and power infrastructure details.
AGI remains undefined across researchers; progress is incremental across multiple labs.
Transcript accurately reflects ongoing definitional debates and multi-lab leaderboard shifts.
Sources Consulted
- News: Research, Product & Company Updates
- Colossus: The World's Largest AI Supercomputer
- Innovations from Google I/O 26 on Google Cloud
- Artificial Analysis: AI Model & API Providers Analysis
- Colossus (supercomputer)
- Grok 5
- With Gemini 3.5 Flash, Google bets its next AI wave on agents, not chatbots
- Grok 5 Launch Tracker: What AI Builders Should Watch
- The 2026 AI Index Report
- Inside the 100K GPU xAI Colossus Cluster that Supermicro Helped Build for Elon Musk
- Everything Google announced at I/O 2026: Gemini, Search, Android XR, & more
- Models