Close Menu
NoMusica.com
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    NoMusica.comNoMusica.com
    • Entertainment
    • Music
      • Music Production
    • Tech
      • AI
      • Electronics & Gadgets
      • Apps & Updates
      • Smartphones
    • Films & Shows
    • Gaming
    • Streaming
    NoMusica.com
    Home»AI

    Grok-3 Outperforms DeepSeek-R1, Matches OpenAI o1 Pro

    February 18, 2025
    xAI - Grok
    Share
    Facebook Twitter LinkedIn Pinterest Email

    xAI, the AI research company founded by Elon Musk, has unveiled its latest model, Grok-3, and early benchmarks suggest it outperforms several competitors.

    The model has reportedly surpassed the 1400 mark on Chatbot Arena, making it one of the most capable AI models available today.

    Grok-3’s Strengths in Reasoning and Research

    One of Grok-3’s standout features is its advanced reasoning (Think) capabilities and a deep research function called DeepSearch. AI researcher Andrej Karpathy, founder of Eureka Labs and a former OpenAI and Tesla researcher, was given early access to the model.

    In a post on X (formerly Twitter), Karpathy shared his experience, stating that Grok-3 successfully handled complex tasks, such as creating a hex grid for Settlers of Catan, a challenge that only OpenAI’s top-tier models like o1 Pro ($200/month) have mastered. In contrast, he noted that models like DeepSeek-R1, Gemini 2.0 Flash Thinking, and Claude failed in this area.

    Karpathy also tested Grok-3’s ability to analyze the technical specifications of AI models, uploading OpenAI’s GPT-2 paper to estimate the required flops for training. He noted that Grok-3, with its reasoning feature, solved the task flawlessly, while OpenAI’s o1 Pro and GPT-4o failed.

    How Grok-3 Compares to Other Leading AI Models

    Karpathy described Grok-3’s performance as being on par with OpenAI’s o1 Pro and superior to DeepSeek-R1. However, he acknowledged that further evaluation is needed to determine its true ranking in the AI race.

    He also tested Grok-3’s DeepSearch capabilities, which are designed to enhance research. While he found them comparable to Perplexity AI’s deep research, he noted that Grok-3 still struggles with hallucinating URLs and lacks accurate citations.

    In one test, the model listed 12 major AI labs but failed to include xAI itself, highlighting some remaining gaps in its research abilities.

    Experts React to Grok-3’s Performance

    After two hours of testing, Karpathy concluded that Grok-3 feels close to the state-of-the-art AI models, calling it “slightly better than DeepSeek-R1 and Gemini 2.0 Flash Thinking.”

    Other AI experts, including Lex Fridman, also praised the model. In his own post on X, he said, “My mind is blown, very impressive model.”

    With xAI aggressively improving Grok-3, the AI landscape is heating up, and competition with OpenAI, Google, and other AI leaders is fiercer than ever.

    Grok AI xAI
    Sazid Kabir
    • Website
    • X (Twitter)
    • Pinterest
    • Instagram
    • LinkedIn

    Founder & Chief Editor, NoMusica.com. Sazid Kabir is a tech writer and music producer covering music, tech, and music production with both analytical and practical experience.

    Keep Reading

    5 Best Free AI Image Generators in 2026: Tested & Compared

    10 Free AI Courses With Certificates for High-Income Skills in 2026

    Best Discord AI Bots in 2026 (Safe, Useful & Verified Tools)

    15 Best AI Apps for Daily Use (2026 Guide)

    10 Free AI Voice Changers for Gamers and Streamers in 2026

    Best Free AI Tools for Content Creators in 2026 (Features, Limits, and Real Use Cases)

    Add A Comment
    Leave A Reply Cancel Reply

    Latest Posts

    XXL 2026 Freshman Class Is Here… But Did They Get It Right?

    June 26, 2026

    People Want To Know Why Cities Are Banning Kanye But Not Netanyahu & Israel’s Genocidal Regime, Epstein Friends

    June 24, 2026

    Drake Fans Go Off On Lil Yachty Over A$AP Rocky Link-Up

    June 22, 2026

    New Pooh Shiesty x GloRilla Track Sparks Gucci Mane Debate Online

    June 22, 2026

    #HimToo: The Man Cassie Ventura exposed to STDs & Public Embarrassment, wants Accountability

    June 20, 2026
    Pages
    • Home
    • Blog
    • About
    • Contact
    • Advertise
    • Cookie Policy
    • Privacy Policy
    Categories
    • AI
    • Tech & Science
    • Films & TV Shows
    • Entertainment
    • Music
    • Streaming
    • Music Production
    Random Reads

    Microsoft Pauses New Windows 11 Features Until 2025

    CashNasty and Tylil Meet LeBron James Thanks to Kai Cenat

    Pusha T Addresses Backlash Over PnB Rock Reference in New Song

    Facebook X (Twitter) Instagram Pinterest
    © 2026 WowPress Digital

    Type above and press Enter to search. Press Esc to cancel.