AI News

Latest updates from the world of artificial intelligence

Mistral Medium 3.5 Lands With Cloud Coding Agents and 77.6% on SWE-Bench
Models

Mistral Medium 3.5 Lands With Cloud Coding Agents and 77.6% on SWE-Bench

Mistral fuses chat, reasoning, and coding into a single 128B dense model and pairs it with Vibe — async cloud-based coding agents that hand back work as pull requests instead of terminal output.

Mistral AI

Read more →
DeepSeek V4 Preview Closes Gap With Frontier Models at a Fraction of the Price
Models

DeepSeek V4 Preview Closes Gap With Frontier Models at a Fraction of the Price

Chinese lab DeepSeek released two open-weight V4 preview models with 1M-token context windows, matching closed-source rivals on reasoning while undercutting them on price.

TechCrunch

Read more →
NVIDIA Unveils Nemotron 3 Nano Omni: Open Multimodal Model with 9x Throughput
Models

NVIDIA Unveils Nemotron 3 Nano Omni: Open Multimodal Model with 9x Throughput

NVIDIA's new 30B-A3B mixture-of-experts model unifies vision, audio, and language into a single open system, delivering up to nine times the throughput of comparable open omni models for AI agents.

NVIDIA Blog

Read more →
OpenAI Launches GPT-5.5: Smarter, Faster, and Built for Agentic Work
Models

OpenAI Launches GPT-5.5: Smarter, Faster, and Built for Agentic Work

OpenAI's GPT-5.5, codenamed "Spud," brings 82.7% Terminal-Bench 2.0 performance and autonomous multi-step task execution — a step toward OpenAI's unified AI super app.

TechCrunch

Read more →
DeepSeek Releases V4: Open-Source Frontier Model at a Fraction of the Price
Models

DeepSeek Releases V4: Open-Source Frontier Model at a Fraction of the Price

DeepSeek's V4-Pro and V4-Flash arrive with 1.6 trillion parameters, a 1M-token context window, and Apache 2.0 licensing — matching near-frontier performance at 21x less cost than Claude Opus 4.7.

Simon Willison

Read more →
Google Launches Gemini 3.1 Pro: Tops ARC-AGI-2 Benchmark with 2M Token Context and Agentic Coding
Models

Google Launches Gemini 3.1 Pro: Tops ARC-AGI-2 Benchmark with 2M Token Context and Agentic Coding

Google's Gemini 3.1 Pro hits a verified 77.1% on the ARC-AGI-2 reasoning benchmark — more than double its predecessor — and arrives with a 2-million-token context window, multi-step agentic coding, and support across Gemini API, Vertex AI, and NotebookLM.

Google

Read more →
Meta Launches Muse Spark: Multimodal AI with Parallel Agents, Health Insights, and Shopping Intelligence
Models

Meta Launches Muse Spark: Multimodal AI with Parallel Agents, Health Insights, and Shopping Intelligence

Meta Superintelligence Labs unveils Muse Spark, a fast multimodal model that deploys parallel AI subagents, analyzes medical images, generates functional websites, and surfaces social-powered shopping recommendations.

Meta

Read more →
xAI Launches Grok 4.3 Beta: Native Video Input, Document Generation, and Desktop Automation
Models

xAI Launches Grok 4.3 Beta: Native Video Input, Document Generation, and Desktop Automation

xAI has launched Grok 4.3 in beta with native video processing, PDF and spreadsheet generation, and tighter Grok Computer integration — behind a $300/month SuperGrok Heavy paywall ahead of a broader May rollout.

TechSifted

Read more →
Moonshot AI Releases Kimi K2.6: Open-Source Giant with 1T Parameters and 300-Agent Swarms
Models

Moonshot AI Releases Kimi K2.6: Open-Source Giant with 1T Parameters and 300-Agent Swarms

Moonshot AI has released Kimi K2.6, a 1-trillion-parameter open-source model capable of coordinating 300 parallel sub-agents across 4,000 steps — available on Hugging Face under a Modified MIT License.

MarkTechPost

Read more →
Google Gemma 4: Open Models That Outperform Systems 20x Their Size
Models

Google Gemma 4: Open Models That Outperform Systems 20x Their Size

Google DeepMind released Gemma 4 on April 2, a family of four open-weight multimodal models under Apache 2.0 that bring frontier-level reasoning and agentic capabilities to phones, edge devices, and developer environments.

Google DeepMind

Read more →
Anthropic's Claude Mythos 5: The First 10-Trillion-Parameter AI Model
Models

Anthropic's Claude Mythos 5: The First 10-Trillion-Parameter AI Model

Anthropic has revealed Claude Mythos 5, the first publicly acknowledged 10-trillion-parameter AI model, engineered for cybersecurity, advanced coding, and deep academic reasoning — but not yet available to the public.

Anthropic / AI Analytics Diaries

Read more →
Microsoft Launches Three MAI Models to Rival OpenAI and Google in Speech, Voice, and Image
Models

Microsoft Launches Three MAI Models to Rival OpenAI and Google in Speech, Voice, and Image

Microsoft unveils MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 through its Foundry platform, marking the company's boldest push yet to build AI infrastructure independent of its OpenAI partnership.

TechCrunch

Read more →
Anthropic Releases Claude Opus 4.7 With Major Coding and Vision Upgrades
Models

Anthropic Releases Claude Opus 4.7 With Major Coding and Vision Upgrades

Claude Opus 4.7 is now generally available, bringing a 13% coding benchmark improvement, 3x higher image resolution support, and a new xhigh effort setting -- at the same price as Opus 4.6.

Anthropic

Read more →
NVIDIA Launches Ising: Open AI Models to Supercharge Quantum Computing
Models

NVIDIA Launches Ising: Open AI Models to Supercharge Quantum Computing

NVIDIA unveils Ising, the world's first family of open-source AI models built to accelerate fault-tolerant quantum computing through AI-powered calibration and error correction.

NVIDIA Newsroom

Read more →
OpenAI Launches GPT-6: 40% Capability Jump, 2M Token Context, and Super-App Integration
Models

OpenAI Launches GPT-6: 40% Capability Jump, 2M Token Context, and Super-App Integration

OpenAI releases GPT-6 with a 40% performance leap, a 2 million token context window, and a unified super-app merging ChatGPT, Codex, and the Atlas browser into a single agent experience.

OpenAI

Read more →
Google Gemini 3.1 Ultra Breaks Reasoning Records with 94.3% on GPQA Diamond
Models

Google Gemini 3.1 Ultra Breaks Reasoning Records with 94.3% on GPQA Diamond

Google's flagship Gemini 3.1 Ultra achieves a verified 94.3% on GPQA Diamond and 77.1% on ARC-AGI-2, setting new state-of-the-art benchmarks for complex scientific and logical reasoning.

Google DeepMind

Read more →
OpenAI Releases GPT-5.4 Mini and Nano: Purpose-Built for the Subagent Era
Models

OpenAI Releases GPT-5.4 Mini and Nano: Purpose-Built for the Subagent Era

OpenAI's new small models are designed to function as parallel subagents inside larger AI workflows — fast, cheap, and capable enough to handle the bulk of agentic work.

OpenAI

Read more →
NVIDIA Unveils Nemotron 3: Nano, Super, and Ultra Open Models for Agentic AI
Models

NVIDIA Unveils Nemotron 3: Nano, Super, and Ultra Open Models for Agentic AI

NVIDIA debuted the Nemotron 3 family at GTC 2026, releasing Nano immediately and previewing Super (49B) and Ultra (253B) variants, along with three trillion tokens of pre-training data.

NVIDIA Newsroom

Read more →
Meta Releases Llama 5: Open-Source 600B Model Claims Frontier Dominance
Models

Meta Releases Llama 5: Open-Source 600B Model Claims Frontier Dominance

Meta CEO Mark Zuckerberg unveiled Llama 5, a 600-billion-parameter open-weight model trained on 500,000 Blackwell GPUs that Meta says outperforms GPT-5 and Gemini on key benchmarks.

FinancialContent / Meta AI

Read more →
Google Releases Gemma 4: Open Models That Outperform Rivals 20x Their Size
Models

Google Releases Gemma 4: Open Models That Outperform Rivals 20x Their Size

Google launches Gemma 4, a family of open models ranging from 2B to 31B parameters that deliver unprecedented intelligence-per-parameter, outperforming models 20 times their size on key benchmarks.

Google Blog

Read more →
Meta Unveils Muse Spark: First AI Model from $14B Superintelligence Labs
Models

Meta Unveils Muse Spark: First AI Model from $14B Superintelligence Labs

Meta debuts Muse Spark, a proprietary multimodal model built by Alexandr Wang's elite superintelligence team, marking a dramatic shift away from the company's open-source Llama strategy.

TechCrunch

Read more →
Anthropic's Claude Mythos 5: A 10-Trillion-Parameter Step Change in AI Capability
Models

Anthropic's Claude Mythos 5: A 10-Trillion-Parameter Step Change in AI Capability

Anthropic has begun limited enterprise testing of Claude Mythos 5, a 10-trillion-parameter model that the company calls a 'step change' in AI performance, excelling at cybersecurity, complex coding, and academic reasoning.

Fortune

Read more →
Google Releases Gemini 3.1 Pro with Massive 2 Million Token Context Window
Models

Google Releases Gemini 3.1 Pro with Massive 2 Million Token Context Window

Google's Gemini 3.1 Pro sets a new industry benchmark with a 2 million token context window and 77.1% on ARC-AGI-2, enabling AI to reason over entire codebases and lengthy documents in one pass.

MarkTechPost

Read more →
Google Gemini 2.0: The Model That Unifies Everything
Models

Google Gemini 2.0: The Model That Unifies Everything

Google unveils Gemini 2.0 with advanced multimodal capabilities and deep integration across all Google services.

Google AI Blog

Read more →
Anthropic Releases Claude 4: Deeper AI Understanding
Models

Anthropic Releases Claude 4: Deeper AI Understanding

Anthropic launches Claude 4 with dramatic improvements in text comprehension, coding, and mathematical reasoning.

Anthropic

Read more →
OpenAI Launches GPT-5: The Next Generation of AI
Models

OpenAI Launches GPT-5: The Next Generation of AI

OpenAI has announced GPT-5, its most advanced model yet, with enhanced reasoning capabilities and multimodal features.

OpenAI Blog

Read more →