AITech & Science

DeepSeek’s AI Model Costs $1.6 Billion, Not $5.5 Million

92
DeepSeek Mobile

DeepSeek, the Chinese AI startup, recently sparked controversy in the AI industry by claiming that its R1 model was trained for just $5.5 million.

However, a new report by SemiAnalysis suggests the actual costs are significantly higher—around $1.6 billion—and that DeepSeek has access to 50,000 Nvidia GPUs.

The $5.5 Million Claim vs. Reality

DeepSeek’s initial claim about the costs of developing its Mixture-of-Experts model sent Nvidia’s stock plummeting by $600 billion.

The $5.5 million figure seemed too low compared to the vast investments by major AI companies.

According to SemiAnalysis, the true investment includes $944 million in operating costs and over $500 million on GPUs alone.

DeepSeek’s GPU Power

The firm has access to a huge array of Hopper GPUs, including 10,000 H800s and 10,000 H100s, along with H20s for Chinese-specific operations.

These GPUs are shared with High-Flyer, a quantitative hedge fund backing DeepSeek, distributed across multiple locations for trading, research, and training.

Talent and Data Centers

DeepSeek exclusively hires talent from China and offers salaries up to $1.3 million for top candidates.

Unlike other tech giants, DeepSeek runs its own data centers, giving it more control and flexibility for AI development.

Written by
Sazid Kabir

I've loved music and writing all my life. That's why I started this blog. In my spare time, I make music and run this blog for fellow music fans.

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Stay updated with nomusica.com. Add us to your preferred sources to see our latest updates first.

Related Articles

Accenture
Finance & BusinessTech & Science

Accenture Buys Speedtest and Downdetector in $1.2 Billion Mega Deal

Global consulting giant Accenture has agreed to buy the entire Connectivity division...

ChatGPT - OpenAI
AI

ChatGPT Uninstalls Jump 295% After OpenAI’s DoD Deal

Uninstalls of the ChatGPT app in the United States jumped 295% in...

Alibaba Qwen 3.5
AI

Alibaba’s New Qwen 3.5 Model Runs Fully Offline on iPhone 17 Pro

Alibaba Group has released its new Qwen 3.5 small model series, and...

Hack Warning Cyberattack
Tech & Science

47,000 GitHub Repos Hacked by AI Bot That Won’t Stop Bragging

An AI bot called hackerbot-claw is tearing through GitHub right now. It...