AITech

Powerful AI on a Budget: DeepSeek’s Distilled R1 Model Needs Just One GPU

11
DeepSeek Mobile

DeepSeek, a Chinese AI lab, has released a new smaller version of its advanced R1 reasoning AI model called DeepSeek-R1-0528-Qwen3-8B. This “distilled” model delivers strong performance while running on much less powerful hardware, making it accessible for wider use.

Built on Alibaba’s Qwen3-8B foundation model, the distilled R1 outperforms Google’s Gemini 2.5 Flash on a tough math challenge called AIME 2025. It also nearly matches Microsoft’s Phi 4 reasoning plus model on another math test known as HMMT.

While smaller models like DeepSeek-R1-0528-Qwen3-8B usually have fewer capabilities than full-sized versions, they require far less computing power.

The full R1 model needs around a dozen GPUs with 80GB of memory, but the distilled model can run on a single GPU with 40GB to 80GB of RAM, such as an Nvidia H100.

DeepSeek created the smaller model by fine-tuning Qwen3-8B using text generated by the larger R1 model. The company describes this model as ideal for academic research and industrial use where smaller AI models are needed.

Importantly, DeepSeek-R1-0528-Qwen3-8B is available under an open MIT license, allowing commercial use without restrictions. Several platforms, including LM Studio, already provide access to the model via API.

Written by
Sazid Kabir

I've loved music and writing all my life. That's why I started this blog. In my spare time, I make music and run this blog for fellow music fans.

Related Articles

Microsoft
TechAI

Microsoft’s Chief Product Officer Reassures Coders Amid Layoffs: AI Is Transforming, Not Replacing, Software Development

Microsoft’s Chief Product Officer, Aparna Chennapragada, recently addressed concerns surrounding the future...

Qualcomm Snapdragon Processor
Tech

Qualcomm Faces Legal Setback as Judge Allows Patent Lawsuit Over Snapdragon Chips to Proceed

Qualcomm is in legal trouble after a judge refused to dismiss a...

intel
Tech

Intel Confirms Arrow Lake-S Refresh CPUs With LGA 1851 & 800-Series Motherboard Compatibility

Intel’s next round of desktop CPUs — the Arrow Lake-S Refresh under...

Nvidia CEO Jensen Huang at CES 2025
Tech

NVIDIA CEO Confirms Huawei’s CloudMatrix Matches Grace Blackwell, Admits They Can’t Be Stopped

NVIDIA CEO Jensen Huang has publicly acknowledged that Huawei’s AI hardware has...