• ffhein@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    16 days ago

    Yea… it’s not quite the same thing to actually run DeepSeek R1, a 671B model, and for example DeepSeek-R1-Distill-Qwen-1.5B