More

    An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, Nvidia impact, AGI, and more (Ben Thompson/Stratechery)

    Ben Thompson / Stratechery:
    An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, Nvidia impact, AGI, and more  —  It’s Monday, January 27.  Why haven’t you written about DeepSeek yet?  —  I did!  I wrote about R1 last Tuesday.

    Latest articles

    Related articles

    Leave a reply

    Please enter your comment!
    Please enter your name here