AITech & Science

Researchers Develop $20 AI Model to Compete with OpenAI’s o1 Reasoning System”

83
Artificial Intelligence (AI)

In an impressive breakthrough, researchers have successfully created a low-cost alternative to OpenAI’s o1 “reasoning” model, named s1.

This new model was developed by a team from DeepSeek and other AI labs, who aimed to replicate the performance of OpenAI’s o1 using more efficient and cost-effective methods.

By focusing on “test-time scaling” (giving the model more time to reason before answering), the team found success with their approach, which contrasts with the more expensive reinforcement learning methods typically used.

S1 was built using a relatively small dataset—only 1,000 carefully curated questions—and fine-tuned using supervised fine-tuning (SFT), a cheaper method compared to large-scale reinforcement learning.

This approach allowed the model to be trained in under 30 minutes with 16 Nvidia H100 GPUs at a cost of just $20 in compute time.

The researchers employed a unique strategy to enhance the model’s reasoning by incorporating the instruction to “wait” during the reasoning process.

This simple adjustment helped improve the accuracy of the model’s responses. The project utilized a free, off-the-shelf AI model from Alibaba-owned Chinese lab Qwen, which helped keep the costs down.

While s1 shows strong performance on certain AI benchmarks, it demonstrates that distillation can be a practical method for creating competitive AI models without the enormous costs associated with training models like those developed by major tech companies.

However, it’s important to note that this method doesn’t yet push the boundaries of AI innovation beyond current models.

Written by
Sazid Kabir

I've loved music and writing all my life. That's why I started this blog. In my spare time, I make music and run this blog for fellow music fans.

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Stay updated with nomusica.com. Add us to your preferred sources to see our latest updates first.

Related Articles

Accenture
Finance & BusinessTech & Science

Accenture Buys Speedtest and Downdetector in $1.2 Billion Mega Deal

Global consulting giant Accenture has agreed to buy the entire Connectivity division...

ChatGPT - OpenAI
AI

ChatGPT Uninstalls Jump 295% After OpenAI’s DoD Deal

Uninstalls of the ChatGPT app in the United States jumped 295% in...

Alibaba Qwen 3.5
AI

Alibaba’s New Qwen 3.5 Model Runs Fully Offline on iPhone 17 Pro

Alibaba Group has released its new Qwen 3.5 small model series, and...

Hack Warning Cyberattack
Tech & Science

47,000 GitHub Repos Hacked by AI Bot That Won’t Stop Bragging

An AI bot called hackerbot-claw is tearing through GitHub right now. It...