Realtime voice AI: sub-second latency and multilingual become the norm
Realtime voice APIs from OpenAI, Google and ElevenLabs converge on < 500ms latency, fluent multilingual, natural prosody. Phone as an agentic channel becomes practical.
A technical AI diary — from 2020 to today
Not a news site. A personal, curated archive of the AI breakthroughs that actually changed something for people who work with software and systems.
★ My picks
Not just news: these are entries with a real practical effect on how I work as IT / sysadmin / dev. Annotated with what I changed after.
This entire archive was built with Claude Code: 608 events, 121 terms, Docker deploys, regression fixes — all through one CLI I run from my terminal. It changed what 'personal project' means for me: I now ship things that used to need a full weekend.
MCP is why I stopped writing custom integrations: my sysadmin scripts now talk directly to Claude through MCP servers, and I reuse the same tools across IDE, terminal, and internal dashboards.
The day after the beta I had Claude do a full ticket-opening pass on our internal tool — fields filled, screenshot attached, all through the API. From then on the question 'what to automate?' became 'what NOT to automate?'.
Ollama runs on my home mini-PC: 16GB RAM, no GPU, and I have a private always-on AI for Q&A on company docs. The day I ran `ollama pull llama3` I realized local AI was no longer a nerd toy.
This is the moment I changed how I work: ChatGPT replaced Stack Overflow for 90% of my questions, cut my average time writing scripts and documentation by 40%, and forced me to rethink how I explain what a sysadmin does to non-technical people.