Lol. Lmao even. "DeepSeek R1 reproduced for $30: Berkeley researchers replicate DeepSeek R1 for $30—casting doubt on H100 claims and controversy"

Snot Flickerman@lemmy.blahaj.zone · 1 month ago

Lol. Lmao even. "DeepSeek R1 reproduced for $30: Berkeley researchers replicate DeepSeek R1 for $30—casting doubt on H100 claims and controversy"

fallowseed@lemmy.world · edit-2 1 month ago

i read that that the chinese made alterations to the cards, as well-- they dismantled them to access the chips themselves and were able to do more precise micromanagement that cuda doesn’t support, for instance… basically they took the training wheels off and used a more fine-tuned and hands-on approach that gave them some serious advantages

froztbyte@awful.systems · 1 month ago

got a source for that?

fallowseed@lemmy.world · 1 month ago

just something i read, this isn’t the original source i read, but a quick search gave me: https://www.xatakaon.com/robotics-and-ai/the-secret-to-deepseeks-extreme-efficiency-is-out-it-bypasses-nvidias-cuda-standard

froztbyte@awful.systems · edit-2 1 month ago

okay so that post’s core supposition (“using ptx instead of cuda”) is just ~~fucking wrong~~ fucking weird and I’m not going to spend time on it, but it links to this tweet which has this:

DeepSeek customized parts of the GPU’s core computational units, called SMs (Streaming Multiprocessors), to suit their needs. Out of 132 SMs, they allocated 20 exclusively for server-to-server communication tasks instead of computational tasks

this still reads more like simply tuning allocation than outright scheduler and execution control (which your post alluded to)

[x] doubt

e: original wording because cuda still uses ptx anyway, whereas this post looks like it’s saying “they steered ptx directly”. at first I read the tweet more like “asm vs python” but it doesn’t appear to be what that part meant to convey. still doubting the core hypothesis tho

froztbyte@awful.systems · edit-2 1 month ago

sidebar: I definitely wouldn’t be surprised if it comes to this overall being a case of “a shop optimised by tuning, and then it suddenly turns out the entire industry has never tried to tune a thing ever”

because why try hard when the money taps are open and flowing free? velocity over everything! this is the bayfucker way.

skillissuer@discuss.tchncs.de · 1 month ago

ah yes the ultimate american NOBUS - we can throw money at the problem until it disappears

froztbyte@awful.systems · 1 month ago

it might disappear under the gigantic heap of money but gosh darn it we can KEEP HEAPING

froztbyte@awful.systems · 1 month ago

I do sorta get the idea that this is (one of the reasons) exactly why 'ole felon is trying to get his hand on all the funding faucets

fallowseed@lemmy.world · 1 month ago

well you’re always free to doubt and do your own research-- as i mentioned- it is something i read and between believing what the US tech bros are saying when all their money and hegemony is on the line vs what the chinese have given up for free-use, i am going to go out on a limb and trust the chinese. you’re free to make your own decisions in this regard and kudos for having your own mind.

froztbyte@awful.systems · 1 month ago

mine isn’t a “USA v China: Jelly Wrestling Deluxe” comment and you’re not really understanding the point

fallowseed@lemmy.world · 1 month ago

what is your point? i thought i was giving a “explain like i’m 5” answer to a guy asking for one… you came along asking me to show sources… now this?

froztbyte@awful.systems · 1 month ago

the point is that your eli5 is unfounded rumour hearsay bullshit (and thus it’s entirely pointless to spread it), then when giving you a relatively gentle indication of that you decided to cosplay an ostrich

pro-tip: if it ain’t something you actually understand something about, probably best to avoid uncritically amplifying shit about it

fallowseed@lemmy.world · 1 month ago

so you’re saying i’m wrong and i’m spreading misinfo… its somehow wrong that china got more juice out of the cards by bypassing cuda to better micromanage some aspect of the process?