• fallowseed@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    5 hours ago

    i read that that the chinese made alterations to the cards, as well-- they dismantled them to access the chips themselves and were able to do more precise micromanagement that cuda doesn’t support, for instance… basically they took the training wheels off and used a more fine-tuned and hands-on approach that gave them some serious advantages

        • froztbyte@awful.systems
          link
          fedilink
          English
          arrow-up
          3
          ·
          4 minutes ago

          okay so that post’s core supposition (“using ptx instead of cuda”) is just fucking wrong and I’m not going to spend time on it, but it links to this tweet which has this:

          DeepSeek customized parts of the GPU’s core computational units, called SMs (Streaming Multiprocessors), to suit their needs. Out of 132 SMs, they allocated 20 exclusively for server-to-server communication tasks instead of computational tasks

          this still reads more like simply tuning allocation than outright scheduler and execution control (which your post alluded to)

          [x] doubt

          • fallowseed@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            2 minutes ago

            well you’re always free to doubt and do your own research-- as i mentioned- it is something i read and between believing what the US tech bros are saying when all their money and hegemony is on the line vs what the chinese have given up for free-use, i am going to go out on a limb and trust the chinese. you’re free to make your own decisions in this regard and kudos for having your own mind.