AI News

Latest updates from the world of artificial intelligence

SAKANA · FUGU One Model to Orchestrate the Rest T THINKER W WORKER V VERIFIER S SYNTHESIZER FUGU BITSMINDS.COM
Models

Sakana AI's Fugu Turns Model Orchestration Into a Product

The Tokyo lab made Fugu generally available on June 22 — a foundation model trained to command a pool of other AIs through one OpenAI-compatible API, pitched as a hedge against single-vendor lock-in.

Sakana AI

Read more →
Z GLM-5.2 by Z.ai · MODEL REVIEW 62.1 SWE-bench Pro #1 open model · beats GPT-5.5 74.4 FrontierSWE ≈ Opus 4.8 (75.4) Open-weight · MIT · 744B MoE · 1M context BITSMINDS.COM
Models

GLM-5.2 Review: The Open Model That Out-Codes GPT-5.5

GLM-5.2 is the strongest open-weight coding model yet: MIT-licensed, 744B MoE, 1M context. It beats GPT-5.5 on SWE-bench Pro and trails Opus 4.8 by a point on FrontierSWE — at a fraction of the cost. Our review, with benchmark charts.

The Decoder

Read more →
GPT-5.6 WHAT WE KNOW SO FAR — STATUS: UNCONFIRMED NOW Pre-train Canary Red-team Early access Launch CODEX ROUTING LOG ▸ route: gpt-5.6 · seen May 13 ▸ route: gpt-5.5 · then reverted Rumored · 1.5M context Rumored · ~1/3 cheaper vs Fable 5 Odds 83% · Jun 22–28 BITSMINDS.COM
Models

GPT-5.6: Everything We Know So Far — Rumors, Leaks, and Where It Stands in Testing

OpenAI hasn't announced GPT-5.6, but a vanished Codex log entry, a string of alpha codenames and a pile of leaks sketch a model already in canary testing. Here's what's confirmed, what's rumored, and when it might ship — with the appropriate skepticism.

WaveSpeed Blog

Read more →
MiMo-V2-Pro Xiaomi · API-only reasoning model 49 INTELLIGENCE INDEX 1M-token context #10 worldwide $1 / $3 per 1M tokens MI BITSMINDS.COM
Models

Xiaomi's MiMo-V2-Pro Cracks the Global Top 10 for AI Reasoning — at a Fraction of the Price

Xiaomi's reasoning model scores 49 on the Artificial Analysis Intelligence Index — #10 worldwide, between Kimi K2.5 and GLM-5 — while running its whole benchmark suite for just $348.

Artificial Analysis

Read more →
MODEL WATCH · OPENAI · LEAKS, UNCONFIRMED JUN 17 GPT-5.6 looks days away — if the leaks hold. Prediction markets put a late-June launch near 83% — but OpenAI has confirmed nothing. 83% LAUNCH ODDS Polymarket · June 22–28 window RUMORED SPECS Context window up to ~1.5M tokens Stronger agentic coding, a leaker claims Pricing rumored near a third of Fable 5 BITSMINDS.COM Source: Polymarket · Cryptopolitan · leaks
Models

GPT-5.6 Rumors Reach a Fever Pitch: Prediction Markets Bet on a Late-June Launch, Leaks Claim a 1.5M-Token Window

No OpenAI announcement exists yet, but developer leaks and prediction markets now point to a GPT-5.6 launch in late June — Polymarket prices the June 22–28 window near 83% — with rumored upgrades including a ~1.5M-token context and stronger agentic coding. Treat it all as unconfirmed.

Cryptopolitan

Read more →
ZHIPU / Z.AI · OPEN MODEL JUN 16 GLM-5.2 goes open under MIT. A 744B-parameter MoE built for long-horizon agentic coding. 744B TOTAL · 40B ACTIVE (MoE) 1M CONTEXT TOKENS 131K MAX OUTPUT TOKENS MIT OPEN-WEIGHTS LICENSE No benchmarks published at launch — performance is a vendor claim. glm-5.2[1m] · 28.5T training tokens · agentic coding BITSMINDS.COM Source: Z.ai
Models

Zhipu Releases GLM-5.2: a 744B-Parameter, 1M-Token Coding Model Under a Full MIT License

Zhipu (Z.ai) has launched GLM-5.2, a 744B-parameter Mixture-of-Experts model (40B active) with a 1M-token context window, built for agentic coding and released under a permissive MIT license. The catch: no benchmarks were published at launch, so performance claims remain vendor assertions for now.

Z.ai

Read more →
Moonshot AI Ships Kimi K2.7-Code, an Open-Weight Coding Model It Says Uses 30% Fewer Reasoning Tokens
Models

Moonshot AI Ships Kimi K2.7-Code, an Open-Weight Coding Model It Says Uses 30% Fewer Reasoning Tokens

The Beijing lab’s fifth K2 release in a year is a coding-first, 1-trillion-parameter open-weight model under a modified MIT license — but independent testers say the headline benchmarks are Moonshot’s own and don’t all hold up.

Crypto Briefing

Read more →
BITSMINDS.COM
Models

Opus 4.8 vs Fable 5, One Prompt Each: We Made Them Build a Game and an Interchange — and They Split 1–1

We pasted identical prompts into Claude Opus 4.8 and Claude Fable 5 — an animated highway interchange in pure SVG, then a one-file Space Invaders — one shot each, no retries. Both games are playable inside the article. The split decision reveals two very different temperaments.

BitsMinds Lab

Read more →
BITSMINDS VERDICT Claude Fable 5 The most capable public model yet — for a premium 4.6 OUT OF 5 ★★★★½ SWE-Bench Pro 80.3 Humanity's Last Exam 59.0 Blueprint-Bench 2 38.6 Fable 5 headline benchmarks — higher is better BITSMINDS.COM
Models

Review: Claude Fable 5 Is the Most Capable Public Model Yet — for a Premium

Anthropic's Claude Fable 5 is the most capable model the public can use today — topping SWE-Bench Pro and excelling at vision and long tasks. It is also the priciest major model and ships with hard safety guardrails. Our first-look verdict.

BitsMinds Reviews

Read more →
CLAUDE FABLE 5MYTHOS-CLASS, NOW PUBLICSWE-BENCH PRO80.3agentic coding · vs 69.2 OpusPRICE / 1M TOKENS$10 · $50input / outputCYBER ATTACK SUCCESS5.4%guardrails on · lower is saferAVAILABLE NOWAPI · CopilotPro · Max · Team · EnterpriseBITSMINDS.COMSource: Anthropic
Models

Anthropic Launches Claude Fable 5: Its Most Powerful Public Model Yet, With the Cyber Edges Sanded Off

Anthropic released Claude Fable 5 — the first publicly available model in its powerful Mythos class — scoring 80.3 on SWE-Bench Pro (vs 69.2 for Opus 4.8), priced at $10/$50 per million tokens, with classifier guardrails that block cyber, bio, and distillation requests and fall back to Opus 4.8.

Anthropic

Read more →
RUMOREDCLAUDEFABLE 5PUBLIC MYTHOS?claude-fable-5 · rumoredUNVERIFIEDMODEL CARD · LEAKEDclaude-fable-5ArchitectureClaude 5Long contextextendedMulti-turnimprovedCyber toolingExploit devUNLOCKINGwith guardrailsBITSMINDS.COMReporting: Alex Heath
Models

Claude Fable 5: The Internet Is Sure It Ships Today. Anthropic Hasn't Said a Word.

Prediction markets put it near 94%, a tech reporter says it lands June 9, and internal checkpoints named claude-fable-5 have surfaced — but Anthropic has confirmed nothing. Here is what the noise around Claude Fable 5, the rumored public release of its Mythos model, actually amounts to.

Reporting: Alex Heath; Polymarket

Read more →
Xiaomi Pushes a 1-Trillion-Parameter Model Past 1,000 Tokens a Second — on Eight Off-the-Shelf GPUs
Models

Xiaomi Pushes a 1-Trillion-Parameter Model Past 1,000 Tokens a Second — on Eight Off-the-Shelf GPUs

MiMo-V2.5-Pro-UltraSpeed broke the 1,000 tokens-per-second barrier on a one-trillion-parameter model using a single eight-GPU commodity node — roughly 15x faster than GPT-5.5 or Claude Opus, and without a line of custom silicon.

MarkTechPost

Read more →
OPEN MODELS · CODING · 1M CONTEXTMINIMAX M3 · JUNE 1MiniMax M3Open-weight · native multimodal · MSA attention~1/20 compute at 1M ctx · 9× faster prefill · 15× decodeBENCHMARK SCORECARD · MINIMAX-REPORTEDSWE-Bench Pro59%Terminal-Bench 2.166%SWE-fficiency34.8%BrowseComp83.5BROWSECOMPBeats Opus 4.783.5 vs 79.3Weights + technical report promised within 10 days · scores not yet independently verifiedBITSMINDS.COMSource: The Decoder · MiniMax
Models

MiniMax M3 Lands as an Open-Weight, Million-Token Coding Model That Claims to Edge Out GPT-5.5

The Chinese lab says its new open-weight model pairs frontier coding, a 1M-token context window and native multimodality on a sparse-attention architecture that cuts long-context compute 20x — but the weights and technical report are still days away, so every number is company-reported.

The Decoder

Read more →
REPORTEDLY LEAKEDCLAUDEOCEANUSANTHROPIC · MYTHOS LINEclaude-oceanus-v1-p · surfaced June 3 · unconfirmedBITSMINDS.COMSource: Cyber Security News
Models

Anthropic's Unreleased 'Claude Oceanus' Reportedly Leaked Through a Chinese Proxy Hours After Reaching Red Teamers

A model identifier, claude-oceanus-v1-p, surfaced in Anthropic's console on June 3 and went to vetted red-team testers — then, within hours, an unknown actor allegedly resold access through a Chinese API proxy. None of it is officially confirmed, but it offers an early look at the next model in the Mythos line.

Cyber Security News

Read more →
Aaheadline · in-image textSUBJECTLOGO#hex paletteIMAGE GENERATION · JUNE 3, 2026LAYOUT-NATIVEReve 2.0 + Ideogram 4Layout control is the new promptReve 2.0 — proprietary, 4K, layout editingIdeogram 4 — open-weight, 9.3B, in-image textBoth chase OpenAI GPT Image 2 and Google GeminiBITSMINDS.COMSource: Reve · Ideogram · Latent Space
Models

Reve 2.0 and Ideogram 4 Land the Same Day, Both Betting Image AI's Next Move Is Layout Control

On June 3, two image-AI startups shipped models built on the same wager: that the future of generation is precise layout control, not ever-longer prose prompts. Reve's proprietary 2.0 chases 4K and "images you can touch"; Ideogram's open-weight 4 nails in-image text and bounding-box placement. Both are gunning for OpenAI's GPT Image 2 and Google's Gemini.

Latent Space

Read more →
OPENAI · LIFE SCIENCESJUNE 2026 UPDATEGPT-RosalindRebuilt on GPT-5.5, tuned for biologyDrug discovery · Genomics · Medicinal chemistryOutperforms GPT-5.5 on the new LifeSciBench evalBITSMINDS.COMSource: OpenAI
Models

OpenAI Rebuilds GPT-Rosalind on GPT-5.5 and Widens Access — Its Science Model Now Beats the General One on Biology

OpenAI's life-sciences model just got a major upgrade. Rebuilt on GPT-5.5, it now outscores the general model on a new benchmark called LifeSciBench across drug discovery, genomics, and medicinal chemistry — and OpenAI is widening access beyond launch partners like Moderna, Amgen, and Novo Nordisk.

OpenAI

Read more →
BUILD 2026 · MICROSOFT TAKES AIM AT CLAUDE LOCKED ON MICROSOFT AI MAI-Thinking-1 FIRST IN-HOUSE REASONING MODEL 35B active · ~1T MoE · 256K ctx trained with zero distillation Claude ENTERPRISE CODING DEFAULT SWE-Bench Pro ≈ 53% — level with Opus 4.6 BITSMINDS.COM Microsoft's claims, pending independent benchmarks
Models

Microsoft Aims MAI-Thinking-1 Straight at Claude: a 35B Reasoning Model It Says Beats Sonnet 4.6 and Matches Opus 4.6 on Code

At Build 2026, Microsoft’s AI Superintelligence Team unveiled MAI-Thinking-1, its first in-house reasoning model — a sparse Mixture-of-Experts design with 35B active parameters (~1T total) and a 256K context window. Microsoft says human raters prefer it to Claude Sonnet 4.6 and that it matches Opus 4.6 on the SWE-Bench Pro coding benchmark, and it pointedly trained the model from scratch with zero distillation on commercially licensed data — no OpenAI involved.

Microsoft AI / TechTimes

Read more →
OPEN SOURCE · MIXTURE-OF-EXPERTS · APACHE 2.0JETBRAINS MELLUM2 · JUN 2Mellum212B total parameters · ~2.5B active per tokenMoE · 131K context · 2x faster inference6 variants · Base · Instruct · Thinking (RLVR)THE FOCAL-MODEL THESISFast, specialized parts orchestrated by frontier modelsROUTER · 8 OF 64 EXPERTS FIREOnly ~21% of parameters fire per tokenBITSMINDS.COMSource: JetBrains AI Blog · Hugging Face
Models

JetBrains Open-Sources Mellum2, a 12B Mixture-of-Experts Model Built to Be a Fast "Focal" Part, Not a Frontier Rival

JetBrains released Mellum2 under Apache 2.0 on June 2 — a 12B MoE model that activates just 2.5B parameters per token for 2x-faster inference, pitched as a fast specialized component for multi-model AI pipelines.

JetBrains AI Blog

Read more →
Microsoft Will Unveil Its Own GitHub Copilot Coding Model at Build — a Direct Shot at Claude Code
Models

Microsoft Will Unveil Its Own GitHub Copilot Coding Model at Build — a Direct Shot at Claude Code

According to The Information, Microsoft plans to debut a homegrown coding model for GitHub Copilot at its Build conference on June 2–3, part of a wider suite of in-house MAI models led by Mustafa Suleyman. The push to reduce reliance on OpenAI, Anthropic and Google comes as Copilot has lost ground to Anthropic’s Claude Code among developers.

The Information / TestingCatalog

Read more →
FRONTIER MODEL SHOWDOWN · WHO WINS? Three labs. Three strongest models. One fight. ANTHROPIC Claude Opus 4.8 AUTONOMY OPENAI GPT-5.5 “Spud” AGENTS GOOGLE Gemini 3.1 Ultra REASONING VS VS BITSMINDS.COM BitsMinds original analysis
Models

Claude Opus 4.8 vs GPT-5.5 vs Gemini 3.1 Ultra: Benchmarks

A BitsMinds analysis. We put each lab’s strongest model through the benchmark portfolio that actually separates frontier systems in 2026 — GPQA Diamond, ARC-AGI-2, AIME, SWE-bench, BFCL and more — then weigh price, context and speed. The short version: Gemini 3.1 Ultra is the sharpest reasoner and the best value, Claude Opus 4.8 owns real coding and agentic work, and GPT-5.5 is the strong all-rounder that everyone already has. Here is the full scorecard.

BitsMinds Analysis

Read more →
MODEL RELEASE · AGENTIC AI · ANTHROPICCLAUDE OPUS 4.8 · MAY 28Opus 4.8Anthropic’s most honest model yetSame price as 4.7 · $5 / $25 per M tokensFast mode · 2.5× speed · 3× cheaperEffort dial · plan-and-verify subagentsDYNAMIC WORKFLOW · PARALLEL SUBAGENTSPLANVERIFY ✓running 100s in parallel · one sessionBITSMINDS.COMSource: Anthropic · TechCrunch · Axios
Models

Anthropic Ships Claude Opus 4.8: Dynamic Workflows Spawn Hundreds of Parallel Subagents, and It’s the “Most Honest” Claude Yet

Anthropic released Claude Opus 4.8 on May 28, 41 days after 4.7 — adding Dynamic Workflows that plan and run hundreds of parallel subagents per session, a user-facing effort dial, a 2.5× faster fast mode at 3× lower cost, and its “most honest” self-review behavior yet, at 4.7’s price.

Anthropic / TechCrunch

Read more →
ANTHROPIC ® INTRODUCING OPUS 4.8 LEAK FORBIDDEN-STRINGS FILE · NPM SOURCE MAP · MARCH 31, 2026 BITSMINDS Source: WaveSpeed AI · decodethefuture.org
Models

Anthropic's Leaked Forbidden-Strings File Names Opus 4.8, Sonnet 4.8 and a New Tier Above Opus Called Capybara

Inside the 512K-line Claude Code source map that npm shipped on March 31 was a feature literally called Undercover Mode — and inside Undercover Mode was a hard-coded list of names it was supposed to scrub from public builds. That list spells out Opus 4.8, Sonnet 4.8, a never-released Sonnet 4.7 and animal codenames Capybara and Tengu — with Capybara described internally as a new tier sitting above Opus.

WaveSpeed AI / ChaoBro / AI Nexus Daily

Read more →
Alibaba's Qwen3.7-Max Lands With a 1M-Token Context, AA Index 56.6, and a 35-Hour Agent Run
Models

Alibaba's Qwen3.7-Max Lands With a 1M-Token Context, AA Index 56.6, and a 35-Hour Agent Run

Qwen3.7-Max-Preview ships with a 1M-token context, extended-thinking mode, and benchmark gains across CritPt, Humanity's Last Exam, and Terminal-Bench Hard — clearing Gemini 3.5 Flash on the AA Intelligence Index.

Qwen (Alibaba Cloud)

Read more →
Google's Gemini 'Omni' Video Model Leaks Days Before I/O 2026, Pointing to a Unified Multimodal System
Models

Google's Gemini 'Omni' Video Model Leaks Days Before I/O 2026, Pointing to a Unified Multimodal System

Test UI strings inside Gemini reveal a new Google video model that lets users remix and edit clips directly in chat, with a likely debut at I/O 2026 on May 19-20.

9to5Google

Read more →
Mira Murati's Thinking Machines Unveils TML-Interaction-Small, a Full-Duplex Voice Model That Beats GPT and Gemini
Models

Mira Murati's Thinking Machines Unveils TML-Interaction-Small, a Full-Duplex Voice Model That Beats GPT and Gemini

Mira Murati's Thinking Machines Lab released a research preview of TML-Interaction-Small, a 276B mixture-of-experts model that responds in under 0.4 seconds and listens while it speaks — outpacing GPT-realtime-2 and Gemini 3.1 Flash Live on the FD-bench latency test.

TechCrunch

Read more →
OpenAI Ships GPT-Realtime-2, Translate, and Whisper, Bringing GPT-5 Reasoning Into Voice Apps
Models

OpenAI Ships GPT-Realtime-2, Translate, and Whisper, Bringing GPT-5 Reasoning Into Voice Apps

OpenAI rolled out three new voice models on May 7 — a reasoning agent with a 128K context window, a 70-language live translator at $0.034 a minute, and a streaming Whisper that transcribes as you speak.

OpenAI

Read more →
Google Ships Gemini 3.1 Flash-Lite to GA at $0.25 per Million Input Tokens
Models

Google Ships Gemini 3.1 Flash-Lite to GA at $0.25 per Million Input Tokens

Gemini 3.1 Flash-Lite is now generally available on Vertex AI and Gemini Enterprise, pitched as the cheapest, fastest member of the Gemini 3 family for high-volume agentic, classification, and tool-use workloads.

Google Cloud Blog

Read more →
Mistral Medium 3.5 Lands With Cloud Coding Agents and 77.6% on SWE-Bench
Models

Mistral Medium 3.5 Lands With Cloud Coding Agents and 77.6% on SWE-Bench

Mistral fuses chat, reasoning, and coding into a single 128B dense model and pairs it with Vibe — async cloud-based coding agents that hand back work as pull requests instead of terminal output.

Mistral AI

Read more →
DeepSeek V4 Preview Closes Gap With Frontier Models at a Fraction of the Price
Models

DeepSeek V4 Preview Closes Gap With Frontier Models at a Fraction of the Price

Chinese lab DeepSeek released two open-weight V4 preview models with 1M-token context windows, matching closed-source rivals on reasoning while undercutting them on price.

TechCrunch

Read more →
NVIDIA Unveils Nemotron 3 Nano Omni: Open Multimodal Model with 9x Throughput
Models

NVIDIA Unveils Nemotron 3 Nano Omni: Open Multimodal Model with 9x Throughput

NVIDIA's new 30B-A3B mixture-of-experts model unifies vision, audio, and language into a single open system, delivering up to nine times the throughput of comparable open omni models for AI agents.

NVIDIA Blog

Read more →
OpenAI Launches GPT-5.5: Smarter, Faster, and Built for Agentic Work
Models

OpenAI Launches GPT-5.5: Smarter, Faster, and Built for Agentic Work

OpenAI's GPT-5.5, codenamed "Spud," brings 82.7% Terminal-Bench 2.0 performance and autonomous multi-step task execution — a step toward OpenAI's unified AI super app.

TechCrunch

Read more →
DeepSeek Releases V4: Open-Source Frontier Model at a Fraction of the Price
Models

DeepSeek Releases V4: Open-Source Frontier Model at a Fraction of the Price

DeepSeek's V4-Pro and V4-Flash arrive with 1.6 trillion parameters, a 1M-token context window, and Apache 2.0 licensing — matching near-frontier performance at 21x less cost than Claude Opus 4.7.

Simon Willison

Read more →
Google Launches Gemini 3.1 Pro: Tops ARC-AGI-2 Benchmark with 2M Token Context and Agentic Coding
Models

Google Launches Gemini 3.1 Pro: Tops ARC-AGI-2 Benchmark with 2M Token Context and Agentic Coding

Google's Gemini 3.1 Pro hits a verified 77.1% on the ARC-AGI-2 reasoning benchmark — more than double its predecessor — and arrives with a 2-million-token context window, multi-step agentic coding, and support across Gemini API, Vertex AI, and NotebookLM.

Google

Read more →
Meta Launches Muse Spark: Multimodal AI with Parallel Agents, Health Insights, and Shopping Intelligence
Models

Meta Launches Muse Spark: Multimodal AI with Parallel Agents, Health Insights, and Shopping Intelligence

Meta Superintelligence Labs unveils Muse Spark, a fast multimodal model that deploys parallel AI subagents, analyzes medical images, generates functional websites, and surfaces social-powered shopping recommendations.

Meta

Read more →
xAI Launches Grok 4.3 Beta: Native Video Input, Document Generation, and Desktop Automation
Models

xAI Launches Grok 4.3 Beta: Native Video Input, Document Generation, and Desktop Automation

xAI has launched Grok 4.3 in beta with native video processing, PDF and spreadsheet generation, and tighter Grok Computer integration — behind a $300/month SuperGrok Heavy paywall ahead of a broader May rollout.

TechSifted

Read more →
Moonshot AI Releases Kimi K2.6: Open-Source Giant with 1T Parameters and 300-Agent Swarms
Models

Moonshot AI Releases Kimi K2.6: Open-Source Giant with 1T Parameters and 300-Agent Swarms

Moonshot AI has released Kimi K2.6, a 1-trillion-parameter open-source model capable of coordinating 300 parallel sub-agents across 4,000 steps — available on Hugging Face under a Modified MIT License.

MarkTechPost

Read more →
Google Gemma 4: Open Models That Outperform Systems 20x Their Size
Models

Google Gemma 4: Open Models That Outperform Systems 20x Their Size

Google DeepMind released Gemma 4 on April 2, a family of four open-weight multimodal models under Apache 2.0 that bring frontier-level reasoning and agentic capabilities to phones, edge devices, and developer environments.

Google DeepMind

Read more →
Anthropic's Claude Mythos 5: The First 10-Trillion-Parameter AI Model
Models

Anthropic's Claude Mythos 5: The First 10-Trillion-Parameter AI Model

Anthropic has revealed Claude Mythos 5, the first publicly acknowledged 10-trillion-parameter AI model, engineered for cybersecurity, advanced coding, and deep academic reasoning — but not yet available to the public.

Anthropic / AI Analytics Diaries

Read more →
Microsoft Launches Three MAI Models to Rival OpenAI and Google in Speech, Voice, and Image
Models

Microsoft Launches Three MAI Models to Rival OpenAI and Google in Speech, Voice, and Image

Microsoft unveils MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 through its Foundry platform, marking the company's boldest push yet to build AI infrastructure independent of its OpenAI partnership.

TechCrunch

Read more →
Anthropic Releases Claude Opus 4.7 With Major Coding and Vision Upgrades
Models

Anthropic Releases Claude Opus 4.7 With Major Coding and Vision Upgrades

Claude Opus 4.7 is now generally available, bringing a 13% coding benchmark improvement, 3x higher image resolution support, and a new xhigh effort setting -- at the same price as Opus 4.6.

Anthropic

Read more →
NVIDIA Launches Ising: Open AI Models to Supercharge Quantum Computing
Models

NVIDIA Launches Ising: Open AI Models to Supercharge Quantum Computing

NVIDIA unveils Ising, the world's first family of open-source AI models built to accelerate fault-tolerant quantum computing through AI-powered calibration and error correction.

NVIDIA Newsroom

Read more →
OpenAI Launches GPT-6: 40% Capability Jump, 2M Token Context, and Super-App Integration
Models

OpenAI Launches GPT-6: 40% Capability Jump, 2M Token Context, and Super-App Integration

OpenAI releases GPT-6 with a 40% performance leap, a 2 million token context window, and a unified super-app merging ChatGPT, Codex, and the Atlas browser into a single agent experience.

OpenAI

Read more →
Google Gemini 3.1 Ultra Breaks Reasoning Records with 94.3% on GPQA Diamond
Models

Google Gemini 3.1 Ultra Breaks Reasoning Records with 94.3% on GPQA Diamond

Google's flagship Gemini 3.1 Ultra achieves a verified 94.3% on GPQA Diamond and 77.1% on ARC-AGI-2, setting new state-of-the-art benchmarks for complex scientific and logical reasoning.

Google DeepMind

Read more →
OpenAI Releases GPT-5.4 Mini and Nano: Purpose-Built for the Subagent Era
Models

OpenAI Releases GPT-5.4 Mini and Nano: Purpose-Built for the Subagent Era

OpenAI's new small models are designed to function as parallel subagents inside larger AI workflows — fast, cheap, and capable enough to handle the bulk of agentic work.

OpenAI

Read more →
NVIDIA Unveils Nemotron 3: Nano, Super, and Ultra Open Models for Agentic AI
Models

NVIDIA Unveils Nemotron 3: Nano, Super, and Ultra Open Models for Agentic AI

NVIDIA debuted the Nemotron 3 family at GTC 2026, releasing Nano immediately and previewing Super (49B) and Ultra (253B) variants, along with three trillion tokens of pre-training data.

NVIDIA Newsroom

Read more →
Meta Releases Llama 5: Open-Source 600B Model Claims Frontier Dominance
Models

Meta Releases Llama 5: Open-Source 600B Model Claims Frontier Dominance

Meta CEO Mark Zuckerberg unveiled Llama 5, a 600-billion-parameter open-weight model trained on 500,000 Blackwell GPUs that Meta says outperforms GPT-5 and Gemini on key benchmarks.

FinancialContent / Meta AI

Read more →
Google Releases Gemma 4: Open Models That Outperform Rivals 20x Their Size
Models

Google Releases Gemma 4: Open Models That Outperform Rivals 20x Their Size

Google launches Gemma 4, a family of open models ranging from 2B to 31B parameters that deliver unprecedented intelligence-per-parameter, outperforming models 20 times their size on key benchmarks.

Google Blog

Read more →
Meta Unveils Muse Spark: First AI Model from $14B Superintelligence Labs
Models

Meta Unveils Muse Spark: First AI Model from $14B Superintelligence Labs

Meta debuts Muse Spark, a proprietary multimodal model built by Alexandr Wang's elite superintelligence team, marking a dramatic shift away from the company's open-source Llama strategy.

TechCrunch

Read more →
Anthropic's Claude Mythos 5: A 10-Trillion-Parameter Step Change in AI Capability
Models

Anthropic's Claude Mythos 5: A 10-Trillion-Parameter Step Change in AI Capability

Anthropic has begun limited enterprise testing of Claude Mythos 5, a 10-trillion-parameter model that the company calls a 'step change' in AI performance, excelling at cybersecurity, complex coding, and academic reasoning.

Fortune

Read more →
Google Releases Gemini 3.1 Pro with Massive 2 Million Token Context Window
Models

Google Releases Gemini 3.1 Pro with Massive 2 Million Token Context Window

Google's Gemini 3.1 Pro sets a new industry benchmark with a 2 million token context window and 77.1% on ARC-AGI-2, enabling AI to reason over entire codebases and lengthy documents in one pass.

MarkTechPost

Read more →
Google Gemini 2.0: The Model That Unifies Everything
Models

Google Gemini 2.0: The Model That Unifies Everything

Google unveils Gemini 2.0 with advanced multimodal capabilities and deep integration across all Google services.

Google AI Blog

Read more →
Anthropic Releases Claude 4: Deeper AI Understanding
Models

Anthropic Releases Claude 4: Deeper AI Understanding

Anthropic launches Claude 4 with dramatic improvements in text comprehension, coding, and mathematical reasoning.

Anthropic

Read more →
OpenAI Launches GPT-5: The Next Generation of AI
Models

OpenAI Launches GPT-5: The Next Generation of AI

OpenAI has announced GPT-5, its most advanced model yet, with enhanced reasoning capabilities and multimodal features.

OpenAI Blog

Read more →