← Reading paths

◦

Reading path

Backend developer integrating LLMs

API calls, function calling, streaming, agents and MCP: the milestones that matter.

You are a backend developer who wants to bring AI into your applications in a robust way, not as a demo. This path takes you from foundational primitives (function calling, streaming) to the latest protocol standards (MCP, autonomous agents), keeping an eye on releases that changed what is actually buildable today.

01

Why it matters to you

The primitive that turns an LLM from a chatbot into a callable software component: without function calling there is no serious backend integration.

June 13, 2023 High AI Infrastructure

Function calling: GPT learns to speak JSON

OpenAI adds 'function calling' to the API: the model returns structured JSON conforming to a schema, enabling reliable tool integrations without fragile prompt engineering.
02

Why it matters to you

The framework that standardized how you build an LLM chain in Python: understanding its abstractions explains why every later framework either copied or rejected them.

October 25, 2022 Landmark Agents

LangChain: the framework for LLM applications is born

Harrison Chase releases LangChain, an open-source Python library to chain LLMs with prompt templates, memory, tools and external data sources. It will become the default stack of the first LLM apps.
03

Why it matters to you

Introduces the Assistants API, JSON mode, and 128k context windows: all three reshape how you design a production AI backend.

November 6, 2023 High Foundation Models

OpenAI DevDay: GPT-4 Turbo, GPTs, Assistants API in one hour

At OpenAI's first developer conference: GPT-4 Turbo (128K context, lower prices), GPTs (shareable custom ChatGPTs), Assistants API (managed agents). Product + dev pivot.
04

Why it matters to you

The first model family with tiered intelligence for cost, speed and quality: the moment when multi-model routing becomes a real architectural choice.

March 4, 2024 Landmark Foundation Models

Claude 3 (Opus, Sonnet, Haiku): Anthropic surpasses GPT-4

Anthropic ships the Claude 3 family in three sizes. Opus, the flagship, beats GPT-4 on MMLU, HumanEval, MATH. Native multimodal vision. For the first time GPT-4 is no longer the outright leader.
05

Why it matters to you

The open standard that lets your backend expose tools to any compatible agent: adopting it early means not rewriting the integration six months later.

November 25, 2024 High AI Infrastructure

Model Context Protocol: the open standard to connect LLMs and data

Anthropic open-sources the Model Context Protocol (MCP), a JSON-RPC standard that lets AI assistants talk to tools, file systems, databases, and SaaS without per-model ad-hoc integrations.
06

Why it matters to you

Hugging Face's minimalist code-first agent framework: useful to understand how to orchestrate multi-step tasks without a heavy library.

January 15, 2025 Medium Agents

Hugging Face smolagents: agents that write code instead of JSON

Hugging Face releases smolagents, a ~1000-line minimal library for LLM agents. Pushes the 'code agents' paradigm: the agent writes Python snippets instead of JSON tool calls.
07

Why it matters to you

The reference model for complex backend tasks in 2026: long context and advanced tool use redefine what you can delegate to an agent in production.

February 5, 2026 Landmark Foundation Models

Claude Opus 4.6: 1M context, agent teams, and leadership on Terminal-Bench 2.0

Anthropic releases Opus 4.6: first Opus with 1M-token context in beta, agent teams in Claude Code, leadership on Terminal-Bench 2.0 and Humanity's Last Exam. Pricing unchanged at $5/$25.