top of page

S3/E1: One Good AI Model and Everyone Forgets How Panic Works

  • brotherskeleton
  • Jan 30, 2025
  • 1 min read

DeepSeek dropped R1 on January 20th, trained it for a reported six million dollars, and by the 27th had wiped nearly six hundred billion from Nvidia's market cap in a single day. Both reactions - gaah the 'America is finished' crowd and the gahh 'it's a Chinese spy app' crowd - are dead wrong in interesting ways. Today we're actually looking at the model: what it does, how mixture-of-experts architecture works, why the cost claims are real but also complicated, and what it actually means when a smaller team produces a competitive result on constrained hardware. DeepSeek is a genuine engineering achievement. It is not Skynet. Calm down.

 
 
 

Recent Posts

See All

Comments


bottom of page