AI News

Latest updates from the world of artificial intelligence

All Models Products Companies Research Industry

ModelsJun 22, 2026

Sakana AI's Fugu Turns Model Orchestration Into a Product

The Tokyo lab made Fugu generally available on June 22 — a foundation model trained to command a pool of other AIs through one OpenAI-compatible API, pitched as a hedge against single-vendor lock-in.

Sakana AI

ModelsJun 20, 2026

GLM-5.2 Review: The Open Model That Out-Codes GPT-5.5

GLM-5.2 is the strongest open-weight coding model yet: MIT-licensed, 744B MoE, 1M context. It beats GPT-5.5 on SWE-bench Pro and trails Opus 4.8 by a point on FrontierSWE — at a fraction of the cost. Our review, with benchmark charts.

The Decoder

ModelsJun 18, 2026

GPT-5.6: Everything We Know So Far — Rumors, Leaks, and Where It Stands in Testing

OpenAI hasn't announced GPT-5.6, but a vanished Codex log entry, a string of alpha codenames and a pile of leaks sketch a model already in canary testing. Here's what's confirmed, what's rumored, and when it might ship — with the appropriate skepticism.

WaveSpeed Blog

ModelsJun 18, 2026

Xiaomi's MiMo-V2-Pro Cracks the Global Top 10 for AI Reasoning — at a Fraction of the Price

Xiaomi's reasoning model scores 49 on the Artificial Analysis Intelligence Index — #10 worldwide, between Kimi K2.5 and GLM-5 — while running its whole benchmark suite for just $348.

Artificial Analysis

ModelsJun 17, 2026

GPT-5.6 Rumors Reach a Fever Pitch: Prediction Markets Bet on a Late-June Launch, Leaks Claim a 1.5M-Token Window

No OpenAI announcement exists yet, but developer leaks and prediction markets now point to a GPT-5.6 launch in late June — Polymarket prices the June 22–28 window near 83% — with rumored upgrades including a ~1.5M-token context and stronger agentic coding. Treat it all as unconfirmed.

Cryptopolitan

ModelsJun 16, 2026

Zhipu Releases GLM-5.2: a 744B-Parameter, 1M-Token Coding Model Under a Full MIT License

Zhipu (Z.ai) has launched GLM-5.2, a 744B-parameter Mixture-of-Experts model (40B active) with a 1M-token context window, built for agentic coding and released under a permissive MIT license. The catch: no benchmarks were published at launch, so performance claims remain vendor assertions for now.

Z.ai

ModelsJun 14, 2026

Moonshot AI Ships Kimi K2.7-Code, an Open-Weight Coding Model It Says Uses 30% Fewer Reasoning Tokens

The Beijing lab’s fifth K2 release in a year is a coding-first, 1-trillion-parameter open-weight model under a modified MIT license — but independent testers say the headline benchmarks are Moonshot’s own and don’t all hold up.

Crypto Briefing

ModelsJun 12, 2026

Opus 4.8 vs Fable 5, One Prompt Each: We Made Them Build a Game and an Interchange — and They Split 1–1

We pasted identical prompts into Claude Opus 4.8 and Claude Fable 5 — an animated highway interchange in pure SVG, then a one-file Space Invaders — one shot each, no retries. Both games are playable inside the article. The split decision reveals two very different temperaments.

BitsMinds Lab

ModelsJun 11, 2026

Review: Claude Fable 5 Is the Most Capable Public Model Yet — for a Premium

Anthropic's Claude Fable 5 is the most capable model the public can use today — topping SWE-Bench Pro and excelling at vision and long tasks. It is also the priciest major model and ships with hard safety guardrails. Our first-look verdict.

BitsMinds Reviews

ModelsJun 9, 2026

Anthropic Launches Claude Fable 5: Its Most Powerful Public Model Yet, With the Cyber Edges Sanded Off

Anthropic released Claude Fable 5 — the first publicly available model in its powerful Mythos class — scoring 80.3 on SWE-Bench Pro (vs 69.2 for Opus 4.8), priced at $10/$50 per million tokens, with classifier guardrails that block cyber, bio, and distillation requests and fall back to Opus 4.8.

Anthropic

ModelsJun 9, 2026

Claude Fable 5: The Internet Is Sure It Ships Today. Anthropic Hasn't Said a Word.

Prediction markets put it near 94%, a tech reporter says it lands June 9, and internal checkpoints named claude-fable-5 have surfaced — but Anthropic has confirmed nothing. Here is what the noise around Claude Fable 5, the rumored public release of its Mythos model, actually amounts to.

Reporting: Alex Heath; Polymarket

ModelsJun 9, 2026

Xiaomi Pushes a 1-Trillion-Parameter Model Past 1,000 Tokens a Second — on Eight Off-the-Shelf GPUs

MiMo-V2.5-Pro-UltraSpeed broke the 1,000 tokens-per-second barrier on a one-trillion-parameter model using a single eight-GPU commodity node — roughly 15x faster than GPT-5.5 or Claude Opus, and without a line of custom silicon.

MarkTechPost

ModelsJun 7, 2026

MiniMax M3 Lands as an Open-Weight, Million-Token Coding Model That Claims to Edge Out GPT-5.5

The Chinese lab says its new open-weight model pairs frontier coding, a 1M-token context window and native multimodality on a sparse-attention architecture that cuts long-context compute 20x — but the weights and technical report are still days away, so every number is company-reported.

The Decoder

ModelsJun 6, 2026

Anthropic's Unreleased 'Claude Oceanus' Reportedly Leaked Through a Chinese Proxy Hours After Reaching Red Teamers

A model identifier, claude-oceanus-v1-p, surfaced in Anthropic's console on June 3 and went to vetted red-team testers — then, within hours, an unknown actor allegedly resold access through a Chinese API proxy. None of it is officially confirmed, but it offers an early look at the next model in the Mythos line.

Cyber Security News

ModelsJun 4, 2026

Reve 2.0 and Ideogram 4 Land the Same Day, Both Betting Image AI's Next Move Is Layout Control

On June 3, two image-AI startups shipped models built on the same wager: that the future of generation is precise layout control, not ever-longer prose prompts. Reve's proprietary 2.0 chases 4K and "images you can touch"; Ideogram's open-weight 4 nails in-image text and bounding-box placement. Both are gunning for OpenAI's GPT Image 2 and Google's Gemini.

Latent Space

ModelsJun 4, 2026

OpenAI Rebuilds GPT-Rosalind on GPT-5.5 and Widens Access — Its Science Model Now Beats the General One on Biology

OpenAI's life-sciences model just got a major upgrade. Rebuilt on GPT-5.5, it now outscores the general model on a new benchmark called LifeSciBench across drug discovery, genomics, and medicinal chemistry — and OpenAI is widening access beyond launch partners like Moderna, Amgen, and Novo Nordisk.

OpenAI

ModelsJun 3, 2026

Microsoft Aims MAI-Thinking-1 Straight at Claude: a 35B Reasoning Model It Says Beats Sonnet 4.6 and Matches Opus 4.6 on Code

At Build 2026, Microsoft’s AI Superintelligence Team unveiled MAI-Thinking-1, its first in-house reasoning model — a sparse Mixture-of-Experts design with 35B active parameters (~1T total) and a 256K context window. Microsoft says human raters prefer it to Claude Sonnet 4.6 and that it matches Opus 4.6 on the SWE-Bench Pro coding benchmark, and it pointedly trained the model from scratch with zero distillation on commercially licensed data — no OpenAI involved.

Microsoft AI / TechTimes

ModelsJun 2, 2026

JetBrains Open-Sources Mellum2, a 12B Mixture-of-Experts Model Built to Be a Fast "Focal" Part, Not a Frontier Rival

JetBrains released Mellum2 under Apache 2.0 on June 2 — a 12B MoE model that activates just 2.5B parameters per token for 2x-faster inference, pitched as a fast specialized component for multi-model AI pipelines.

JetBrains AI Blog

ModelsJun 1, 2026

Microsoft Will Unveil Its Own GitHub Copilot Coding Model at Build — a Direct Shot at Claude Code

According to The Information, Microsoft plans to debut a homegrown coding model for GitHub Copilot at its Build conference on June 2–3, part of a wider suite of in-house MAI models led by Mustafa Suleyman. The push to reduce reliance on OpenAI, Anthropic and Google comes as Copilot has lost ground to Anthropic’s Claude Code among developers.

The Information / TestingCatalog

ModelsMay 29, 2026

Claude Opus 4.8 vs GPT-5.5 vs Gemini 3.1 Ultra: Benchmarks

A BitsMinds analysis. We put each lab’s strongest model through the benchmark portfolio that actually separates frontier systems in 2026 — GPQA Diamond, ARC-AGI-2, AIME, SWE-bench, BFCL and more — then weigh price, context and speed. The short version: Gemini 3.1 Ultra is the sharpest reasoner and the best value, Claude Opus 4.8 owns real coding and agentic work, and GPT-5.5 is the strong all-rounder that everyone already has. Here is the full scorecard.

BitsMinds Analysis

ModelsMay 28, 2026

Anthropic Ships Claude Opus 4.8: Dynamic Workflows Spawn Hundreds of Parallel Subagents, and It’s the “Most Honest” Claude Yet

Anthropic released Claude Opus 4.8 on May 28, 41 days after 4.7 — adding Dynamic Workflows that plan and run hundreds of parallel subagents per session, a user-facing effort dial, a 2.5× faster fast mode at 3× lower cost, and its “most honest” self-review behavior yet, at 4.7’s price.

Anthropic / TechCrunch

ModelsMay 25, 2026

Anthropic's Leaked Forbidden-Strings File Names Opus 4.8, Sonnet 4.8 and a New Tier Above Opus Called Capybara

Inside the 512K-line Claude Code source map that npm shipped on March 31 was a feature literally called Undercover Mode — and inside Undercover Mode was a hard-coded list of names it was supposed to scrub from public builds. That list spells out Opus 4.8, Sonnet 4.8, a never-released Sonnet 4.7 and animal codenames Capybara and Tengu — with Capybara described internally as a new tier sitting above Opus.

WaveSpeed AI / ChaoBro / AI Nexus Daily

ModelsMay 22, 2026

Alibaba's Qwen3.7-Max Lands With a 1M-Token Context, AA Index 56.6, and a 35-Hour Agent Run

Qwen3.7-Max-Preview ships with a 1M-token context, extended-thinking mode, and benchmark gains across CritPt, Humanity's Last Exam, and Terminal-Bench Hard — clearing Gemini 3.5 Flash on the AA Intelligence Index.

Qwen (Alibaba Cloud)

ModelsMay 16, 2026

Google's Gemini 'Omni' Video Model Leaks Days Before I/O 2026, Pointing to a Unified Multimodal System

Test UI strings inside Gemini reveal a new Google video model that lets users remix and edit clips directly in chat, with a likely debut at I/O 2026 on May 19-20.

9to5Google

ModelsMay 12, 2026

Mira Murati's Thinking Machines Unveils TML-Interaction-Small, a Full-Duplex Voice Model That Beats GPT and Gemini

Mira Murati's Thinking Machines Lab released a research preview of TML-Interaction-Small, a 276B mixture-of-experts model that responds in under 0.4 seconds and listens while it speaks — outpacing GPT-realtime-2 and Gemini 3.1 Flash Live on the FD-bench latency test.

TechCrunch

ModelsMay 10, 2026

OpenAI Ships GPT-Realtime-2, Translate, and Whisper, Bringing GPT-5 Reasoning Into Voice Apps

OpenAI rolled out three new voice models on May 7 — a reasoning agent with a 128K context window, a 70-language live translator at $0.034 a minute, and a streaming Whisper that transcribes as you speak.

OpenAI

ModelsMay 9, 2026

Google Ships Gemini 3.1 Flash-Lite to GA at $0.25 per Million Input Tokens

Gemini 3.1 Flash-Lite is now generally available on Vertex AI and Gemini Enterprise, pitched as the cheapest, fastest member of the Gemini 3 family for high-volume agentic, classification, and tool-use workloads.

Google Cloud Blog

ModelsMay 3, 2026

Mistral Medium 3.5 Lands With Cloud Coding Agents and 77.6% on SWE-Bench

Mistral fuses chat, reasoning, and coding into a single 128B dense model and pairs it with Vibe — async cloud-based coding agents that hand back work as pull requests instead of terminal output.

Mistral AI

ModelsApr 30, 2026

DeepSeek V4 Preview Closes Gap With Frontier Models at a Fraction of the Price

Chinese lab DeepSeek released two open-weight V4 preview models with 1M-token context windows, matching closed-source rivals on reasoning while undercutting them on price.

TechCrunch

ModelsApr 29, 2026

NVIDIA Unveils Nemotron 3 Nano Omni: Open Multimodal Model with 9x Throughput

NVIDIA's new 30B-A3B mixture-of-experts model unifies vision, audio, and language into a single open system, delivering up to nine times the throughput of comparable open omni models for AI agents.

NVIDIA Blog

ModelsApr 24, 2026

OpenAI Launches GPT-5.5: Smarter, Faster, and Built for Agentic Work

OpenAI's GPT-5.5, codenamed "Spud," brings 82.7% Terminal-Bench 2.0 performance and autonomous multi-step task execution — a step toward OpenAI's unified AI super app.

TechCrunch

ModelsApr 24, 2026

DeepSeek Releases V4: Open-Source Frontier Model at a Fraction of the Price

DeepSeek's V4-Pro and V4-Flash arrive with 1.6 trillion parameters, a 1M-token context window, and Apache 2.0 licensing — matching near-frontier performance at 21x less cost than Claude Opus 4.7.

Simon Willison

ModelsApr 21, 2026

Google Launches Gemini 3.1 Pro: Tops ARC-AGI-2 Benchmark with 2M Token Context and Agentic Coding

Google's Gemini 3.1 Pro hits a verified 77.1% on the ARC-AGI-2 reasoning benchmark — more than double its predecessor — and arrives with a 2-million-token context window, multi-step agentic coding, and support across Gemini API, Vertex AI, and NotebookLM.

Google

ModelsApr 21, 2026

Meta Launches Muse Spark: Multimodal AI with Parallel Agents, Health Insights, and Shopping Intelligence

Meta Superintelligence Labs unveils Muse Spark, a fast multimodal model that deploys parallel AI subagents, analyzes medical images, generates functional websites, and surfaces social-powered shopping recommendations.

xAI Launches Grok 4.3 Beta: Native Video Input, Document Generation, and Desktop Automation

xAI has launched Grok 4.3 in beta with native video processing, PDF and spreadsheet generation, and tighter Grok Computer integration — behind a $300/month SuperGrok Heavy paywall ahead of a broader May rollout.

TechSifted

ModelsApr 21, 2026

Moonshot AI Releases Kimi K2.6: Open-Source Giant with 1T Parameters and 300-Agent Swarms

Moonshot AI has released Kimi K2.6, a 1-trillion-parameter open-source model capable of coordinating 300 parallel sub-agents across 4,000 steps — available on Hugging Face under a Modified MIT License.

MarkTechPost

ModelsApr 19, 2026

Google Gemma 4: Open Models That Outperform Systems 20x Their Size

Google DeepMind released Gemma 4 on April 2, a family of four open-weight multimodal models under Apache 2.0 that bring frontier-level reasoning and agentic capabilities to phones, edge devices, and developer environments.

Google DeepMind

ModelsApr 19, 2026

Anthropic's Claude Mythos 5: The First 10-Trillion-Parameter AI Model

Anthropic has revealed Claude Mythos 5, the first publicly acknowledged 10-trillion-parameter AI model, engineered for cybersecurity, advanced coding, and deep academic reasoning — but not yet available to the public.

Anthropic / AI Analytics Diaries

ModelsApr 17, 2026

Microsoft Launches Three MAI Models to Rival OpenAI and Google in Speech, Voice, and Image

Microsoft unveils MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 through its Foundry platform, marking the company's boldest push yet to build AI infrastructure independent of its OpenAI partnership.

TechCrunch

ModelsApr 16, 2026

Anthropic Releases Claude Opus 4.7 With Major Coding and Vision Upgrades

Claude Opus 4.7 is now generally available, bringing a 13% coding benchmark improvement, 3x higher image resolution support, and a new xhigh effort setting -- at the same price as Opus 4.6.

Anthropic

ModelsApr 16, 2026

NVIDIA Launches Ising: Open AI Models to Supercharge Quantum Computing

NVIDIA unveils Ising, the world's first family of open-source AI models built to accelerate fault-tolerant quantum computing through AI-powered calibration and error correction.

NVIDIA Newsroom

ModelsApr 14, 2026

OpenAI Launches GPT-6: 40% Capability Jump, 2M Token Context, and Super-App Integration

OpenAI releases GPT-6 with a 40% performance leap, a 2 million token context window, and a unified super-app merging ChatGPT, Codex, and the Atlas browser into a single agent experience.

OpenAI

ModelsApr 13, 2026

Google Gemini 3.1 Ultra Breaks Reasoning Records with 94.3% on GPQA Diamond

Google's flagship Gemini 3.1 Ultra achieves a verified 94.3% on GPQA Diamond and 77.1% on ARC-AGI-2, setting new state-of-the-art benchmarks for complex scientific and logical reasoning.

Google DeepMind

ModelsApr 13, 2026

OpenAI Releases GPT-5.4 Mini and Nano: Purpose-Built for the Subagent Era

OpenAI's new small models are designed to function as parallel subagents inside larger AI workflows — fast, cheap, and capable enough to handle the bulk of agentic work.

OpenAI

ModelsApr 12, 2026

NVIDIA Unveils Nemotron 3: Nano, Super, and Ultra Open Models for Agentic AI

NVIDIA debuted the Nemotron 3 family at GTC 2026, releasing Nano immediately and previewing Super (49B) and Ultra (253B) variants, along with three trillion tokens of pre-training data.

NVIDIA Newsroom

ModelsApr 12, 2026

Meta Releases Llama 5: Open-Source 600B Model Claims Frontier Dominance

Meta CEO Mark Zuckerberg unveiled Llama 5, a 600-billion-parameter open-weight model trained on 500,000 Blackwell GPUs that Meta says outperforms GPT-5 and Gemini on key benchmarks.

FinancialContent / Meta AI

ModelsApr 10, 2026

Google Releases Gemma 4: Open Models That Outperform Rivals 20x Their Size

Google launches Gemma 4, a family of open models ranging from 2B to 31B parameters that deliver unprecedented intelligence-per-parameter, outperforming models 20 times their size on key benchmarks.

Google Blog

ModelsApr 9, 2026

Meta Unveils Muse Spark: First AI Model from $14B Superintelligence Labs

Meta debuts Muse Spark, a proprietary multimodal model built by Alexandr Wang's elite superintelligence team, marking a dramatic shift away from the company's open-source Llama strategy.

TechCrunch

ModelsApr 5, 2026

Anthropic's Claude Mythos 5: A 10-Trillion-Parameter Step Change in AI Capability

Anthropic has begun limited enterprise testing of Claude Mythos 5, a 10-trillion-parameter model that the company calls a 'step change' in AI performance, excelling at cybersecurity, complex coding, and academic reasoning.

Fortune

ModelsApr 5, 2026

Google Releases Gemini 3.1 Pro with Massive 2 Million Token Context Window

Google's Gemini 3.1 Pro sets a new industry benchmark with a 2 million token context window and 77.1% on ARC-AGI-2, enabling AI to reason over entire codebases and lengthy documents in one pass.

MarkTechPost

ModelsApr 5, 2026

Google Gemini 2.0: The Model That Unifies Everything

Google unveils Gemini 2.0 with advanced multimodal capabilities and deep integration across all Google services.

Google AI Blog

ModelsApr 5, 2026

Anthropic Releases Claude 4: Deeper AI Understanding

Anthropic launches Claude 4 with dramatic improvements in text comprehension, coding, and mathematical reasoning.

Anthropic

ModelsApr 5, 2026

OpenAI Launches GPT-5: The Next Generation of AI

OpenAI has announced GPT-5, its most advanced model yet, with enhanced reasoning capabilities and multimodal features.

OpenAI Blog