• Irdial@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    46
    ·
    6 hours ago

    Every time I see a headline that contains the word “slams,” I want to slam my head on the table

  • Viri4thus@feddit.org
    link
    fedilink
    English
    arrow-up
    54
    arrow-down
    1
    ·
    14 hours ago

    Every one who bought the 7900xtx laughing their arse off running 20GiB models with MUCH better performance than a 4080/4080Super lol

    • TBi@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      2 hours ago

      I’m an idiot that waited. Saw a sapphire nitro 7900xtx on sale for €900 but didn’t get it holding out for the 5800. Now those are €1400 if you can find one and the 7900xtx is out of stock.

      Have a 3080ti though so I’m not too bad off, just annoyed.

      • Viri4thus@feddit.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 hours ago

        Don’t feel bad, neither AMD or NVIDIA (or Intel for that matter) have produced anything worthy of note in the GPU space since the 1080Ti or 6800XT. Keep your 3080ti, it’ll serve you well for now. Hopefully Morethreads or Intel make something interesting and disrupt the market although it’s unlikely. NV and AMD have the GPU spaced fairly locked with IP (and cash reserves) that would drown any competitor in legalese for a millennium. The 7900xtx is a helluva card because it competes with overpriced NVIDIA hw, in any sane world it would be a 7800 class card and priced accordingly. (like the 5080 is actually a 5070)

    • Naz@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      28
      arrow-down
      1
      ·
      11 hours ago

      I bought my 7900XTX for $800, and have kept absolutely quiet about it.

      Anyone who has asked me: “AMD sucks, CUDA better, buy NVDA stock”.

      The invisible hand of the market is made of invisible delicious meat

    • Eager Eagle@lemmy.world
      link
      fedilink
      English
      arrow-up
      42
      ·
      17 hours ago

      I bet he just wants a card to self host models and not give companies his data, but the amount of vram is indeed ridiculous.

      • Jeena@piefed.jeena.net
        link
        fedilink
        English
        arrow-up
        21
        ·
        17 hours ago

        Exactly, I’m in the same situation now and the 8GB in those cheaper cards don’t even let you run a 13B model. I’m trying to research if I can run a 13B one on a 3060 with 12 GB.

          • levzzz@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            ·
            11 hours ago

            You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

          • Viri4thus@feddit.org
            link
            fedilink
            English
            arrow-up
            1
            ·
            13 hours ago

            I also have a 3060, can you detail which framework (sglang, ollama, etc) you are using and how you got that speed? i’m having trouble reaching that level of performance. Thx

            • The Hobbyist@lemmy.zip
              link
              fedilink
              English
              arrow-up
              4
              ·
              edit-2
              6 hours ago

              Ollama, latest version. I have it setup with Open-WebUI (though that shouldn’t matter). The 14B is around 9GB, which easily fits in the 12GB.

              I’m repeating the 28 t/s from memory, but even if I’m wrong it’s easily above 20.

              Specifically, I’m running this model: https://ollama.com/library/deepseek-r1:14b-qwen-distill-q4_K_M

              Edit: I confirmed I do get 27.9 t/s, using default ollama settings.

              • Viri4thus@feddit.org
                link
                fedilink
                English
                arrow-up
                2
                ·
                5 hours ago

                Ty. I’ll try ollama with the Q-4-M quantization. I wouldn’t expect to see a difference between ollama and SGlang.

              • Jeena@piefed.jeena.net
                link
                fedilink
                English
                arrow-up
                2
                ·
                11 hours ago

                Thanks for the additional information, that helped me to decide to get the 3060 12G instead of the 4060 8G. They have almost the same price but from what I gather when it comes to my use cases the 3060 12G seems to fit better even though it is a generation older. The memory bus is wider and it has more VRAM. Both video editing and the smaller LLMs should be working well enough.

        • manicdave@feddit.uk
          link
          fedilink
          English
          arrow-up
          4
          ·
          12 hours ago

          I’m running deepseek-r1:14b on a 12GB rx6700. It just about fits in memory and is pretty fast.

  • deleted@lemmy.world
    link
    fedilink
    English
    arrow-up
    27
    arrow-down
    2
    ·
    16 hours ago

    I legit tried to understand how a lackluster VRAM capacity could spy on us.

  • Gork@lemm.ee
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    1
    ·
    edit-2
    8 hours ago

    How would Snowden get a hold of one of these in Russia? Maybe through an intermediary in Kazakhstan?

    Then again it’s hard finding one here even in the US since they all went out of stock within 5 minutes of being listed.

    • UndercoverUlrikHD@programming.dev
      link
      fedilink
      English
      arrow-up
      39
      ·
      14 hours ago

      According to russian over at r/hardware GPUs have become cheaper in Russia since the ban as they are now being smuggled instead of imported via Europe with all extra cost that implies.

  • John Richard@lemmy.world
    link
    fedilink
    English
    arrow-up
    26
    arrow-down
    4
    ·
    edit-2
    17 hours ago

    The video card monopoly (but also other manufacturers) have been limiting functionality for a long time. It started with them restricting vGPU to enterprise garbage products, which allows Linux users to virtualize their GPU for things like playing games with near-native speeds using Windows on Linux. This is one of the big reasons Windows still has such a large marketshare as the main desktop OS.

    Now they want to restrict people running AI locally so that they get stuck with crap like Copilot-enabled PCs or whatever dumb names they want to come up. These actions are intentional. It is anti-consumer & anti-trust, but don’t expect our government to care or do anything about it.

    • Grandwolf319@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 hours ago

      But that’s assuming there is actual high demand for running big models locally, so far I’ve only seen hobbyists do it.

      I agree with you in theory that they just want more money but idk if they actually think locally run AI is that big of a threat (I hope it is).

    • MudMan@fedia.io
      link
      fedilink
      arrow-up
      13
      arrow-down
      3
      ·
      16 hours ago

      So to put the likelihood of this in perspective, let me just repeat it to see if I understand the claim.

      You’re saying that one of the big reasons of Windows’ market share is how artificially inefficient it is to install Linux, spin up a Virtual Machine, run Windows inside THAT and then run a game?

      That’s the mainstream use case that is propping up Windows adoption in this scenario?

      • John Richard@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        10
        ·
        16 hours ago

        The main thing propping up Windows as the main OS (meaning it is running at the root layer) is exclusive hardware GPU support which is used for gaming & many apps. Otherwise, automating running Windows apps & Windows on Linux would have become much more mainstream.

        • MudMan@fedia.io
          link
          fedilink
          arrow-up
          27
          arrow-down
          2
          ·
          15 hours ago

          This is demonstrably wrong on a scale where it loops around to becoming hard to explain, so that’s a neat trick.

          There are enough people who have never heard of or don’t understand the concept of virtual machines to keep Windows as the biggest mainstream OS several times over. There isn’t a “root layer” in computers as far as normal humans are concerned. They’re computers and then a Windows pops up and that’s how that works.

          At the very most, they understand conversion layers on the basis of having gone from an old Macbook to a new Macbook, and even that is like a tenth of the market (still several times bigger than Linux adoption, though).

          The idea that a mass of people are waiting on the sidelines, chomping at the bit for direct GPU access through an extra layer of software fine tuning to be able to run some brand name Windows app with no Linux version is absurd. Even games are not the problem, as evidenced by that being mostly solved via Proton and not changing much.

          I don’t mind either way, but man, consider what other assumptions you may be making that are wildly off, particularly if they’re on something more important than your hopes for relative OS market share on home computers.

  • MudMan@fedia.io
    link
    fedilink
    arrow-up
    9
    arrow-down
    1
    ·
    17 hours ago

    Wait, did the guy refuse to call AMD’s 9070 by its official name out of spite there at the end? Is this a weird tech The Onion thing?

    • Viri4thus@feddit.org
      link
      fedilink
      English
      arrow-up
      5
      ·
      13 hours ago

      AMD, as usual, misses an opportunity here. The 5xxx series is exactly Fermi again (they even removed hotspot data so reviewers would miss the throttling). AMD could leverage the nostalgia of one of ATI’s best gens and call the cards 9700 and 9700pro. Damn, those were the days. (since 4750 conflicts with current scheme)

      • MudMan@fedia.io
        link
        fedilink
        arrow-up
        4
        arrow-down
        3
        ·
        16 hours ago

        What type of news editor for a hardware review outlet gets that wrong? That’s as weird as the Snowden thing. If you have that job you’ve surely been joking about AMD’s shameful “we just want to use the same name as Nvidia” thing for ages by now. This thing is so surreal.

  • 800XL@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    63
    ·
    15 hours ago

    Shut the fuck up, Snowden.You had everyone behind you until you defected to Russia. There’s no free lunch and you had a lot of info Putin would like to have. Oddly enough things really started getting bad shortly therafter.

    • Danitos@reddthat.com
      link
      fedilink
      English
      arrow-up
      64
      arrow-down
      4
      ·
      edit-2
      7 hours ago

      He was being chased by the US government, and Assange proved that being in an US allied country will still get your arrested/tortured. What other options did Snowden had other than escaping to Russia?

      IMO don’t hate the player, hate the game.

    • LiPoly@lemmynsfw.com
      link
      fedilink
      English
      arrow-up
      30
      arrow-down
      3
      ·
      13 hours ago

      Think of it from Snowdens perspective. You get to choose: either be tortured for the rest of your life, or chill in Russia and pretend Putin is a nice guy. I know what I’d pick.

      • Alphane Moon@lemmy.world
        link
        fedilink
        English
        arrow-up
        17
        arrow-down
        5
        ·
        12 hours ago

        He is not simply pretending Putin is a nice guy, he is clearly collaborating with russian security services. Just look at his comments on internal US politics. And he also was spreading misinformation that russia wasn’t going to invade Ukraine in Feb 2022.

        He might be a hero for many, but if you’re Ukrainian (like I am), he is clearly a piece of shit.

        • Evil_Shrubbery@lemm.ee
          link
          fedilink
          English
          arrow-up
          11
          arrow-down
          2
          ·
          edit-2
          9 hours ago

          Still it’s 100.0% USAs decisions that pushed him to Russia, it’s not like he went there immediately (2013), he was on the run and in shitty conditions for years before he finally had enough and went to Russia (2022).

          USA still wanted to disappear (torture) him, not even allowing him to stay in other NATO or non-NATO countries.

          A USA hero is safer in Russia, and Putin had nothing to do with setting that situation up (safe for not deporting him to USA ofc, which otc lol).

          • Alphane Moon@lemmy.world
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            2
            ·
            7 hours ago

            And how does this justify Snowden promoting russian genocidal imperialism in Ukraine?

            He did a good thing, so now he is an untouchable saint?

            • sugar_in_your_tea@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              4
              arrow-down
              1
              ·
              5 hours ago

              I’m guessing that was a deal Russia made with him, he gets to live as long as he’s useful to Russia.

              I don’t think anyone is calling him a saint. He’s a whistleblower whose best option was to defect to Russia, after a period of trying not to do that.

        • LiPoly@lemmynsfw.com
          link
          fedilink
          English
          arrow-up
          7
          arrow-down
          2
          ·
          9 hours ago

          I totally get it, I live next to that big pain in the ass as well. Luckily, he hasn’t invaded us yet, but I feel it’s only a matter of time. And what Snowden does here certainly doesn’t help.

          But he could have just done nothing and lived a very happy life. Instead, he chose to give up his happy life to uncover the NSA scandal, knowing full well that it will absolutely wreck his life.

          Personally, I think he did enough for the greater good there. This isn’t his war, and if he has to post some lies to get a bit of normalcy back in his life, I can understand that. I wish it wasn’t that way, but I can understand it.

          • Alphane Moon@lemmy.world
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            2
            ·
            7 hours ago

            The issue is that he is supporting russian genocidal imperialism. His messaging clearly aligns with russian propaganda goals.

            Is it not reasonable for me to consider him my enemy (he directly supports doing harm to me, my family and my fellow citizens)?

            As far I am concerned, I hope Snowden and his family will one day be on the recieving end of russian brutality.

            I don’t buy the logic of “he did one good thing, so it’s fine for him to promote russian genocidal imperialism”.

            If he can’t stay consistent, he should have never got involved in the NSA issue in the first place.

            He clearly enjoys the attention (just look at this post). He could have simply shut up and not worked with russian security services (the russians wouldn’t kill him, they need him alive).

            • sugar_in_your_tea@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              2
              arrow-down
              1
              ·
              edit-2
              5 hours ago

              His messaging clearly aligns with russian propaganda goals.

              Well yeah, he’s in Russia. What’s he supposed to do?

              He did one good thing. Now he’s in Russia, so ignore pretty much anything he days says that could in some way benefit Russia.

              Why do they need him alive? Any information he has is a decade old at this point. He’s only useful to them alive while he has a platform.