2026

69 entries

June 24, 2026 High

Cursor 0.45: background agents autonomously fix bugs and open PRs from GitHub issues

Cursor 0.45 launches background agents running in cloud VMs that autonomously read GitHub issues, fix bugs, write tests, and open pull requests with no human in the loop.

AI Coding CursorBackground AgentsAutonomous Coding

June 23, 2026 Medium

OpenAI releases o4-mini-high: frontier reasoning at 60% lower cost than o3

OpenAI releases o4-mini-high, a reasoning model matching o3 on key benchmarks like SWE-bench and AIME while costing 60% less, with extended thinking up to 32K tokens and tool use during reasoning chains.

Foundation Models ReasoningOpenAICost Efficiency

June 20, 2026 High

NVIDIA GB300 Blackwell Ultra Launches: 288 GB HBM3e, NVLink 5 at 1.8 TB/s

NVIDIA begins shipping the GB300 Blackwell Ultra GPU featuring 288 GB HBM3e per chip, NVLink 5 at 1.8 TB/s, and double the FP8 throughput of the B200, dramatically lowering inference costs for frontier AI models.

AI Infrastructure GPUNVIDIABlackwell Ultra

June 19, 2026 High

Anthropic releases Memory API GA for Claude: structured persistent storage for agents across sessions

Anthropic has made its Memory API generally available, providing structured persistent storage for Claude agents across sessions with project-scoped memory, user-scoped memory, and semantic search over stored facts.

Agents Memory APIPersistent StorageAgentic AI

June 18, 2026 Medium

xAI releases Grok 3.5: real-time web access, 200K context, DeepSearch 2, and Aurora image gen

xAI launches Grok 3.5 featuring real-time web access, a 200K token context window, multi-step DeepSearch 2 research chains, and the Aurora image generator built in. Available on X Premium+ and via API, positioning against GPT-5.5 and Claude Opus 4.8.

Foundation Models GrokxAIDeepSearch

June 16, 2026 Medium

Mistral AI releases Mistral Medium 3: 22B parameters, best-in-class cost-performance for European AI

Mistral AI launches Medium 3, a 22-billion-parameter model delivering leading cost-to-performance for code, function calling, and JSON mode workloads, available on La Plateforme and as open weights.

Foundation Models Mistral AIOpen WeightsCost Efficiency

June 13, 2026 High

Alibaba releases Qwen 3.5: open-weight models from 7B to 235B MoE with 128K context

Alibaba drops Qwen 3.5 with four dense variants (7B, 14B, 32B, 72B) and a 235B MoE model under Apache 2.0, offering top-tier multilingual performance and GPT-5.5-level results at a fraction of the cost.

Open Source Models QwenOpen WeightsMultilingual

June 12, 2026 High

Microsoft Build 2026: Copilot becomes the agentic OS layer for Windows

Microsoft announces Copilot++ as a native agentic layer in Windows, GitHub Copilot Workspace reaches GA, Azure AI Foundry 2.0 launches, and Phi-4.5 is released as open source.

Enterprise AI Microsoft BuildCopilotGitHub Copilot

June 11, 2026 High

EU AI Act GPAI Compliance Deadline: OpenAI, Google, Anthropic and Meta File Transparency Reports

June 11, 2026 marks the first real enforcement milestone of the EU AI Act for GPAI model providers: major AI companies must register on the EU AI database and publish transparency reports, with fines up to 3% of global annual turnover for non-compliance.

AI Security EU AI ActGPAICompliance

June 10, 2026 Landmark

Meta releases Llama 4.1: Scout, Maverick, and Behemoth MoE models under Apache 2.0

Meta launches Llama 4.1 in three MoE variants — Scout (edge), Maverick (mid-tier with 10M context), and Behemoth (frontier) — all natively multimodal and freely available under Apache 2.0.

Open Source Models LlamaMetaMoE

June 9, 2026 High

OpenAI Codex 2.0: dedicated autonomous coding agent in ChatGPT and API

OpenAI releases Codex 2.0 as a fully autonomous coding agent inside ChatGPT and via API, capable of completing entire repository tasks — reading files, running tests, and opening pull requests — inside isolated cloud sandboxes.

AI Coding CodexCoding AgentAutonomous Coding

June 6, 2026 High

Apple WWDC 2026: Apple Intelligence 2.0 with 4B parameter on-device models

Apple unveils Apple Intelligence 2.0 at WWDC 2026: on-device models upgraded to 4B parameters on A18 Pro, an autonomous multi-step Siri, Visual Intelligence 2 with real-time scene understanding, and on-device image generation — all with expanded privacy guarantees.

Enterprise AI Apple IntelligenceOn-Device AISiri

June 5, 2026 High

Google I/O 2026: Gemini Ultra 3, Project Astra goes live on Pixel, 2M context with real-time grounding, Veo 3.2, Imagen 4

At Google I/O 2026, Google DeepMind unveiled Gemini Ultra 3 with a 2M-token context window and real-time web grounding, Project Astra now live on Pixel devices, Veo 3.2 and Imagen 4 for creative generation, and a broader rollout of NotebookLM Plus and Android AI mode.

Foundation Models GeminiGoogleMultimodal

June 4, 2026 Landmark

Anthropic releases Claude Fable 5: a new model family beyond the 4.x line

Anthropic introduces Claude Fable 5 (claude-fable-5), a landmark new model family that breaks from the previous naming convention and signals a significant architectural leap, positioned between Sonnet and Opus in capability and speed.

Foundation Models AnthropicClaudeFable

June 3, 2026 Medium

Google releases Veo 3.2: 4K video generation at 60fps with native lip-sync and integrated audio

Google DeepMind launches Veo 3.2, advancing AI video generation to 4K 60fps with native lip-sync, multi-character scene consistency, and integrated audio generation, available via Gemini API and Vertex AI.

Image & Video Gen Video GenerationGoogleGemini

June 2, 2026 Landmark

Anthropic releases Claude Opus 4.8: the most powerful Claude model to date

Anthropic launches Claude Opus 4.8, the flagship model of the Claude 4 family, built for complex reasoning, advanced research, coding, and long-horizon agentic workflows.

Foundation Models ClaudeAnthropicReasoning

May 21, 2026 High

OpenAI releases Sora 2: 1080p 60fps, synchronized audio-video, and API-first for creative professionals

OpenAI launches Sora 2 with 1080p 60fps output, 2-minute clips, native audio-video synchronized generation, and integrated inpainting/outpainting. Initially API-only, the model is repositioned as a creative professional tool following the shutdown of the consumer Sora app in April 2026.

Image & Video Gen Sora 2Video GenerationOpenAI

May 18, 2026 Medium

Realtime voice AI: sub-second latency and multilingual become the norm

Realtime voice APIs from OpenAI, Google and ElevenLabs converge on < 500ms latency, fluent multilingual, natural prosody. Phone as an agentic channel becomes practical.

Voice & Audio VoiceRealtimeSpeech

May 15, 2026 High

Boston Dynamics Atlas Electric: manipulation foundation model trained on 10 million robot hours

Boston Dynamics releases a new manipulation foundation model for Atlas Electric, trained on 10 million hours of robot experience, enabling zero-shot grasping of novel objects in unstructured environments. Now deployed at Hyundai factories.

Robotics Boston DynamicsAtlas ElectricFoundation Model

May 13, 2026 Medium

Mistral releases Devstral Small: 7B coding model for agentic tasks on consumer GPU

Mistral releases Devstral Small, a 7-billion-parameter model fine-tuned for agentic coding that outperforms GPT-4o-mini on SWE-bench and runs on just 8GB of VRAM.

Local AI Coding ModelLocal LLMAgentic AI

May 12, 2026 High

MCP at 18 months: the server ecosystem hits critical mass

Eighteen months after launch (November 2024), Model Context Protocol consolidates: thousands of public servers, confirmed cross-vendor adoption, first stable official registry.

Agents MCPModel Context ProtocolAnthropic

May 9, 2026 High

Google releases Gemini 3.1 Pro with native video understanding

Gemini 3.1 Pro analyzes videos up to one hour long frame-by-frame, extracts events, and answers questions about video content. It powers YouTube AI summaries and Google Search video clips, with a 2M token context window that natively includes video frames.

Multimodal AI Video UnderstandingGeminiLong Context

May 6, 2026 High

AMD MI350 Instinct: 288GB HBM3e and 1.5 PFLOPS FP8 challenge NVIDIA at datacenter scale

AMD launches the MI350 Instinct GPU with 288GB HBM3e memory, double the bandwidth of MI300X, and 1.5 PFLOPS FP8 performance, paired with ROCm 7.0 featuring significantly improved PyTorch compatibility.

AI Infrastructure GPUAMDHBM3e

May 3, 2026 High

ServiceNow Now AI Agents GA: autonomous IT service management at scale

ServiceNow releases Now AI Agents to general availability, enabling autonomous end-to-end handling of L1/L2 tickets without human routing, claiming 65% ticket deflection across enterprise deployments.

Enterprise AI ITSMAutonomous AgentsTicket Deflection

April 30, 2026 Medium

Usable 2-bit quantization: frontier reasoning models drop below 32GB RAM

New quantization techniques (high-quality 2-bit / 3-bit extensions) let frontier-sized reasoning models run on workstations with 32-64GB unified RAM.

Local AI Local AIQuantizationOllama

April 26, 2026 Medium

OpenAI shuts down the Sora app: consumer AI video can't sustain the math

OpenAI shuts down the Sora app on April 26, 2026; the Sora 2 API will be turned off September 24. Operating costs estimated around $1M/day, compute shifting to ChatGPT/GPT-5.5 and core enterprise.

Image & Video Gen OpenAISoraVideo Generation

April 24, 2026 Landmark

DeepSeek V4 Preview: 1.6T parameters, 1M context, open weight in two sizes

DeepSeek releases V4 Preview as open source: V4-Pro (1.6T total, 49B active) and V4-Flash (284B total, 13B active). Native 1M-token context, hybrid CSA+HCA attention cutting KV cache by 90%.

Open Source Models DeepSeekOpen SourceMoE

April 23, 2026 Landmark

GPT-5.5: OpenAI shifts ChatGPT toward an "agent runtime" paradigm

OpenAI releases GPT-5.5, GPT-5.5 Thinking, and GPT-5.5 Pro: designed as an "agent runtime" for persistent multi-step workflows. 23% more factually correct vs GPT-5.4. File Library, side-by-side shopping, improved image gen.

Foundation Models OpenAIGPT-5.5ChatGPT

April 22, 2026 High

EU AI Act: 100-day countdown to the high-risk system rules

Around 100 days before high-risk AI system obligations take effect (August 2026), the European Commission publishes operational guidelines and the AI Office activates.

AI Security EU AI ActRegulationCompliance

April 21, 2026 High

Deep Research and Deep Research Max: Google's autonomous research agents with MCP

Google ships two research agents on the Gemini API: Deep Research (fast) and Deep Research Max (deep + slow, 93.3% on DeepSearchQA). MCP support for private data, native visualizations via Nano Banana 2.

Agents GoogleGeminiDeep Research

April 20, 2026 High

Figure AI releases Figure 02 autonomy update: fully autonomous warehouse picking without human teleoperation

Figure AI's Figure 02 humanoid robot now performs warehouse picking tasks fully autonomously using an LLM backend for natural language task assignment, achieving 95% task success rate in a BMW pilot with 100 deployed units.

Robotics Autonomous RobotsWarehouse AutomationLLM Integration

April 17, 2026 Medium

Cerebras CS-3 Wafer-Scale Engine: 4 trillion transistors, 44GB SRAM, Llama 4 Maverick at 1500 tokens/sec

Cerebras unveils the CS-3, its third-generation wafer-scale engine featuring 4 trillion transistors and 44 GB of on-chip SRAM, running Llama 4 Maverick at 1500 tokens per second on a single chip. First commercial deployment is live in the UAE AI cloud.

AI Infrastructure CerebrasWafer-ScaleInference

April 14, 2026 High

SAP AI Foundation 2026: Autonomous AI Agents Embedded Across ERP Workflows

SAP embeds autonomous AI agents into invoice processing, inventory forecasting, and HR onboarding via SAP AI Foundation, upgrading Joule to a full agentic assistant with potential impact on 300 million users.

Enterprise AI SAPERPAI Agents

April 13, 2026 High

Claude in Word, Excel, and PowerPoint: Anthropic completes its Office 365 invasion

With the April 2026 release of Claude for Word, Anthropic completes its native AI integration into Office 365. Cross-app shared context, pivots/charts in Excel, slide editing in PowerPoint, contracts in Word.

Enterprise AI AnthropicClaudeMicrosoft 365

April 10, 2026 Medium

OpenAI upgrades gpt-image-1: accurate text, photorealistic portraits, and inpainting API

OpenAI enhances native image generation in GPT-4o with gpt-image-1: accurate text rendering, photorealistic portraits, consistent character across images, and inpainting via API. Displaces DALL-E 3 as the primary image generation backend.

Multimodal AI

April 8, 2026 High

Robotics foundation model: a new step toward the "GPT of manipulation"

A robotics lab (Physical Intelligence or peer) publishes a new multi-embodiment foundation model for general manipulation, trained on cross-robot datasets.

Robotics RoboticsFoundation ModelPhysical Intelligence

April 7, 2026 Landmark

Claude Mythos Preview: a model that finds zero-days at industrial speed, and Project Glasswing

Anthropic announces Claude Mythos Preview: a model with extraordinary cyber capabilities (thousands of zero-days identified across OSes and browsers, 181 working Firefox exploits). Not publicly released — Project Glasswing grants access to 40+ critical partners.

AI Security AnthropicMythosCybersecurity

April 3, 2026 High

Microsoft releases Phi-4.5: 14B parameter SLM with best-in-class reasoning, runs on 8GB VRAM

Microsoft releases Phi-4.5, a 14-billion parameter model that outperforms much larger models on reasoning and coding benchmarks, runs on a laptop GPU with 8GB VRAM, and is freely available under Apache 2.0.

Local AI Phi-4.5SLMReasoning

April 2, 2026 High

Cursor 3: the IDE becomes a control room for parallel agents

Anysphere ships Cursor 3 (codename Glass): a new Agents Window with parallel agents across local, worktrees, cloud, and remote SSH. Built for developers who orchestrate agents rather than write every line.

AI Coding CursorAnysphereCoding Agent

March 26, 2026 High

OpenAI consolidates its agent platform: Operator and ChatGPT Agent merged

OpenAI reorganizes Operator (January 2025) and ChatGPT Agent (July 2025) into a unified platform, with refreshed SDK and new async multi-task execution modes.

Agents OpenAIAgentsChatGPT

March 25, 2026 High

Salesforce Agentforce 3.0: AI agents autonomously handle full sales cycles in CRM

Salesforce launches Agentforce 3.0, replacing Einstein GPT with autonomous AI agents capable of managing lead qualification, email drafting, meeting scheduling, and follow-up across 150,000 enterprise customers.

Enterprise AI SalesforceAgentforceCRM

March 19, 2026 Landmark

Tesla Optimus Gen 3: 45-DOF hands and FSD neural stack adapted for manipulation

Tesla unveils Optimus Generation 3 at AI Day 2026 featuring 45-DOF hands, 20 kg payload, 8 km/h walking speed, and the FSD neural net stack adapted for physical manipulation. Tesla targets 1,000 units deployed in its own factories by Q4 2026.

Robotics TeslaOptimusHumanoid Robot

March 18, 2026 Medium

Claude 4: vision capabilities upgrade with PDF analysis up to 1000 pages

Anthropic enhances Claude 4 visual capabilities: advanced chart and document understanding, PDF analysis up to 1000 pages, 3D object reasoning from 2D images, and multimodal context mixing.

Multimodal AI

March 16, 2026 High

Mistral Small 4: three models (reasoning + vision + coding) fused into one open weight

Mistral releases Small 4, unifying Magistral (reasoning), Pixtral (multimodal vision), and Devstral (agentic coding) into a single open-weight model, simplifying the deployment stack.

Open Source Models MistralOpen SourceMultimodal

March 12, 2026 Medium

Groq launches GroqCloud 2.0: LPU Gen3, 2000 tokens/sec, and Frankfurt European data center

Groq releases GroqCloud 2.0 with third-generation LPU chips delivering 2000 tokens/sec on Llama 4.1 Maverick, adds Function Calling GA, JSON mode, streaming tool use, batch pricing, and opens a European data center in Frankfurt.

AI Infrastructure GroqLPUInference

March 11, 2026 High

NVIDIA GTC 2026: Huang keynote and the Rubin roadmap for the next cycle

At GTC 2026 NVIDIA confirms its annual cadence: details on Rubin (Blackwell's successor), new rack-scale configurations, updated software stack for training and inference.

AI Infrastructure NVIDIAGTCRubin

March 5, 2026 Medium

Ollama 0.9: concurrent model serving, multi-GPU split, and REST API v2 for local AI

Ollama 0.9 delivers simultaneous multi-model loading, inter-request KV-cache persistence, automatic multi-GPU layer splitting, and a new streaming JSON REST API v2.

Local AI OllamaLocal LLMGPU

February 26, 2026 High

GitHub Copilot Coding Agent: model picker, self-review, and built-in security scanning

GitHub upgrades the Copilot agent: per-task model picker, self-review before opening PRs, code/secret/dependency scanning in-workflow, custom agents in .github/agents/, and CLI handoff. Copilot CLI hits GA the same day.

AI Coding GitHubCopilotCoding Agent

February 26, 2026 Medium

Nano Banana 2: Google rebuilds its viral image model around consistency and text

Google releases Nano Banana 2 (aka Gemini 3.1 Flash Image): much better text rendering, consistency for up to 5 characters and 14 objects, default for image generation in Gemini app, Flow, Lens, and Search AI Mode.

Image & Video Gen GoogleGeminiNano Banana 2

February 25, 2026 Medium

Mistral relaunches with a new open-weight reasoning flagship

Mistral AI announces a new flagship with extended reasoning, open weights for the research variant. Europe's answer to DeepSeek R2 and the US reasoning models.

Open Source Models MistralFranceEurope

February 19, 2026 High

Gemini 3.1 Pro: Google's first '0.1' bump and the ARC-AGI-2 leap

Google releases Gemini 3.1 Pro: 77.1% on ARC-AGI-2 (more than double Gemini 3 Pro), 80.6% SWE-Bench Verified, 94.3% GPQA Diamond. Same price as 3 Pro: $2/M input.

Foundation Models GoogleDeepMindGemini

February 17, 2026 High

Claude Sonnet 4.6: the 'middle' model that beats Opus 4.5 in coding

Anthropic releases Sonnet 4.6: 79.6% on SWE-bench Verified, 72.5% on OSWorld-Verified (on par with Opus 4.6), better prompt-injection resistance. Pricing unchanged at $3/$15.

AI Coding AnthropicClaudeSonnet 4.6

February 15, 2026 High

ElevenLabs launches Studio Enterprise: voice cloning with consent verification and 200+ languages

ElevenLabs launches Studio Enterprise with 30-second voice cloning with consent verification, dubbing API with lip-sync, real-time voice agent SDK, and GDPR-compliant EU hosting. 200+ languages.

Voice & Audio

February 12, 2026 Medium

Google releases Imagen 3.5: Google's best text-to-image model

Google DeepMind releases Imagen 3.5 with photorealistic output, accurate text rendering in images, and SynthID watermarking on by default. Integrated in Gemini, Workspace, and Vertex AI.

Multimodal AI

February 11, 2026 High

Claude Sonnet 4.7: more reliable agents and longer task duration

Anthropic updates Sonnet to 4.7: focused on agent reliability over long tasks, better tool use, tighter integration with Claude Code and the Claude Agent SDK.

AI Coding AnthropicClaudeSonnet 4.7

February 5, 2026 Landmark

Claude Opus 4.6: 1M context, agent teams, and leadership on Terminal-Bench 2.0

Anthropic releases Opus 4.6: first Opus with 1M-token context in beta, agent teams in Claude Code, leadership on Terminal-Bench 2.0 and Humanity's Last Exam. Pricing unchanged at $5/$25.

Foundation Models AnthropicClaudeOpus 4.6

February 4, 2026 Medium

Mistral Voxtral Transcribe 2: open-source speech-to-text that runs on a laptop

Mistral releases Voxtral Transcribe 2: two open-source STT models (Batch + Realtime, 4B params) with latency configurable down to 200ms, Apache 2.0, 13 languages.

Voice & Audio MistralVoxtralASR

January 28, 2026 High

DeepSeek R2: the Chinese lab relaunches its open-weight reasoning model

DeepSeek ships R2, successor to R1: more efficient step-by-step reasoning, open weights, contained training cost. Fresh pressure on closed reasoning models.

Open Source Models DeepSeekOpen SourceReasoning

January 25, 2026 Medium

Whisper v3 Turbo: real-time local transcription on consumer GPU

Whisper v3 Turbo reaches widespread adoption: 8x faster than v3-large at the same accuracy, runs real-time on consumer GPUs. Integrated in Ollama and LM Studio, enables local transcription pipelines for businesses.

Voice & Audio

January 23, 2026 High

OpenAI Operator GA: the first commercial autonomous web agent

OpenAI launches Operator in GA across 30+ countries: an agent that browses the web, fills forms, books appointments, and shops online autonomously on behalf of the user.

Agents

January 20, 2026 High

Microsoft Copilot Studio: agent marketplace with 1800+ ready-made solutions

Microsoft launches the agent marketplace for Copilot Studio with over 1800 pre-built agents, autonomous multi-agent orchestration, and enterprise governance with DLP policies.

Enterprise AI

January 16, 2026 High

DeepSeek releases Janus Pro: one model to understand and generate images

Janus Pro is a unified 7B-parameter multimodal model that both understands images and generates them from text, outperforming DALL-E 3 and Stable Diffusion 3 on the GenEval benchmark. Fully open source and runs locally.

Multimodal AI DeepSeekMultimodalImage Generation

January 14, 2026 High

Gemini 3 Pro and Flash: Google relaunches the frontier challenge

Google DeepMind announces Gemini 3 with Pro and Flash variants: improved reasoning, native long context, deeper integration into Workspace and Android.

Foundation Models GoogleDeepMindGemini

January 13, 2026 Medium

Veo 3.1 and Veo 3.1 Lite: Google takes AI video to 1080p/4K vertical and "ingredients-to-video"

Google releases Veo 3.1 and Veo 3.1 Lite: video generation with "ingredients" (multiple reference images for character/scene consistency), 1080p/4K output, vertical format for Shorts. Veo 3.1 Lite is the cost-effective variant.

Image & Video Gen GoogleDeepMindVeo

January 12, 2026 High

Claude Cowork: Anthropic's desktop agent for non-technical knowledge workers

Anthropic ships Cowork as a research preview: a desktop agent with sandboxed shell and local file access, aimed at people who don't live in the terminal the way Claude Code users do.

Agents AnthropicClaudeCowork

January 10, 2026 Landmark

Alibaba releases Qwen2.5-VL 72B: best open-source multimodal model beats GPT-4o on key benchmarks

Alibaba releases Qwen2.5-VL 72B under Apache 2.0, surpassing GPT-4o on multiple multimodal benchmarks with support for documents, charts, 20+ minute videos, multilingual OCR, and GUI agent actions.

Multimodal AI QwenAlibabaOpen Source

January 9, 2026 Landmark

NVIDIA Project DIGITS: a 1 PFLOP personal AI supercomputer for $3,000

Announced at CES 2026, NVIDIA Project DIGITS packs a GB10 Superchip, 128 GB unified memory, and 1 PFLOP FP4 into a desktop device priced at $3,000, enabling local inference of frontier models like Llama 4 405B without the cloud.

Local AI NVIDIAProject DIGITSGB10 Superchip

January 7, 2026 High

OpenAI releases o3-mini: advanced reasoning at a fraction of the cost

OpenAI releases o3-mini on January 7 2026, the smallest and cheapest model in the o3 reasoning family, featuring three configurable effort levels and benchmark performance that surpasses o1 at significantly lower cost.

Foundation Models Reasoning ModelsOpenAIExtended Thinking

January 6, 2026 High

CES 2026: On-Device AI Takes Over with AI PCs, Home Robots, and NVIDIA Project DIGITS

CES 2026 in Las Vegas was the first edition entirely dominated by on-device AI, showcasing second-generation Copilot+ PCs, NVIDIA's Project DIGITS personal AI supercomputer, AI-powered TVs, and autonomous home robots from LG and Samsung.

Local AI Copilot+On-Device AIAI PC