Open Source DeepSeek R1 Runs at 200 Tokens Per Second on Raspberry Pi

cm0002@lemmy.world · 2 months ago

Open Source DeepSeek R1 Runs at 200 Tokens Per Second on Raspberry Pi

2 months ago

How. I though u needed huge amounts of vram on exorbitantly prices GPUs to run LLM with decent capacity? Are the just running a really small model or is it hyper parametrised? Or is the “thinking” process just that effective u can make up for a weak LLM?

cm0002@lemmy.world · 2 months ago

Even though it is the smallest of the distilled models that model still outperforms GPT 4o and Claude Sonnet 3.5.

The 7B parameter models crush the older models on performance benchmarks. The 14 billion parameter model is very competitive with OpenAI o1 mini in many metrics.

Yea sounds like it’s their smallest model

Open Source DeepSeek R1 Runs at 200 Tokens Per Second on Raspberry Pi

Open Source DeepSeek R1 Runs at 200 Tokens Per Second on Raspberry Pi

Open Source DeepSeek R1 Runs at 200 Tokens Per Second on Raspberry Pi | NextBigFuture.com