← Reading paths

◦

Reading path

Frontend developer integrating AI into UIs

Streaming chat, voice UI, multimodal, Copilot in editor, MCP in the browser.

You are a frontend or fullstack developer who wants to go beyond simple API wrappers: you want to build interfaces that converse, listen, see and act. This path follows the releases that redefined what "intelligent UI" means — from editor autocompletion to agents that use the browser on behalf of the user.

01

Why it matters to you

The first AI native in the editor that becomes a daily tool for millions of developers: understanding how Copilot context works helps you use it better and design similar tools.

June 29, 2021 High AI Coding

GitHub Copilot: autocomplete grows up

GitHub and OpenAI launch a technical preview of an assistant that suggests entire lines and functions right in the editor, based on a GPT-3-derived model trained on public code.
02

Why it matters to you

The chat UI becomes the dominant AI interaction paradigm: every frontend developer should know the moment users stopped reading and started conversing.

November 30, 2022 Landmark Foundation Models

ChatGPT: AI lands in everyone's browser

OpenAI launches ChatGPT, a free conversational interface on GPT-3.5 aligned via RLHF. It crosses one million users in five days.
03

Why it matters to you

Voice and image input land in ChatGPT: the signal that multimodal UIs are no longer research — users expect them and you need to design for them today.

September 25, 2023 High Multimodal AI

ChatGPT can see, hear, and speak: voice + vision in mobile app

ChatGPT Plus on iOS/Android gets voice conversations (5 synthetic voices) and image input (GPT-4V). From text chat to a full conversational assistant.
04

Why it matters to you

Omnimodal model with sub-300ms voice latency: finally unlocks fluid voice UIs in the browser without a separate speech-to-text/text-to-speech pipeline.

May 13, 2024 High Multimodal AI

GPT-4o: text, voice and images in a single model

OpenAI unveils GPT-4o (omni), a single model that natively handles text, audio, and images with ~320 ms voice latency and GPT-4-class text quality — free for ChatGPT free users.
05

Why it matters to you

The standard that lets your frontend connect to local tools (file system, browser, database) via a uniform protocol: the foundation for agentic UIs in the browser.

November 25, 2024 High AI Infrastructure

Model Context Protocol: the open standard to connect LLMs and data

Anthropic open-sources the Model Context Protocol (MCP), a JSON-RPC standard that lets AI assistants talk to tools, file systems, databases, and SaaS without per-model ad-hoc integrations.
06

Why it matters to you

An AI agent that writes, tests and refactors code autonomously inside the editor: the reference benchmark for understanding where assisted developer experience is heading.

February 24, 2025 Landmark AI Coding

Claude Code: the coding agent lands in the terminal

Anthropic ships Claude Code alongside Claude 3.7 Sonnet: a CLI that reads the codebase, edits files, runs commands, runs tests, makes commits — the 'agent in terminal' pattern goes mainstream.
07

Why it matters to you

OpenAI Realtime APIs in general availability: finally a stable standard on which to build bidirectional voice UIs in production, without experimental middleware.

April 9, 2025 High Voice & Audio

OpenAI Realtime API GA: production-ready voice-to-voice over WebRTC

OpenAI promotes the Realtime API to GA: low-latency voice-in/voice-out (~300ms), tool calling, function calling, native WebRTC. Opens the production voice-app era with a single end-to-end API.