DeepSeek-V3: China releases a shockingly cheap open frontier model
DeepSeek publishes V3, MoE 671B (37B active), competitive with GPT-4o and Claude 3.5 Sonnet. Training: 2.788M H800 GPU-hours, claimed cost $5.6M. Changes the 'frontier = billions' narrative.