AITech & Science

ChatGPT Competes with Humans Across 44 Professions

37
AI Agents Working

OpenAI’s latest benchmark, GDPval, shows that its ChatGPT models are now performing at levels comparable to human experts across 44 professions. The evaluation tested real-world tasks in nine major industries crucial to the U.S. economy, with the benchmark designed by professionals averaging 14 years of experience.

OpenAI’s newest model, GPT-5, recorded a 38.8% win and tie rate against human experts. In comparison, Anthropic’s Claude Opus 4.1 achieved a higher 47.6% rate, highlighting growing competition in AI for workplace applications. Experts note, however, that real-world jobs often involve more than the tasks assessed, and many AI pilot projects still struggle to produce measurable revenue gains due to low-quality output.

In a separate announcement, OpenAI launched Instant Checkout, allowing U.S. users to buy products from Etsy and Shopify stores directly within ChatGPT. Powered by the open-source Agentic Commerce Protocol developed with Stripe, the feature supports payments via credit cards and digital wallets. The launch boosted Etsy and Shopify shares, reflecting market confidence in AI-driven e-commerce.

Meanwhile, Microsoft and Anthropic are expanding their offerings with autonomous AI agents capable of handling more complex professional tasks. Analysts say these developments signal a shift toward AI becoming increasingly integrated into both work and commerce, though limitations remain.

Overall, the advancements demonstrate AI’s growing capabilities in professional and commercial settings, reshaping industries while still facing significant practical and economic challenges.

Written by
Sazid Kabir

I've loved music and writing all my life. That's why I started this blog. In my spare time, I make music and run this blog for fellow music fans.

Stay updated with nomusica.com. Add us to your preferred sources to see our latest updates first.

Related Articles

Artificial Intelligence — AI
AITech & Science

AI Floods Research Papers, Scientists Call for Stricter Disclosure

Scientists are raising alarms over a surge of low-quality AI-generated research papers,...

Nvidia CEO Jensen Huang at CES 2025
Finance & BusinessTech & Science

Nvidia Pulls Back on $100B OpenAI Investment, Confirms Smaller, Measured Support

Nvidia CEO Jensen Huang has clarified that the previously reported $100 billion...

Moltbot
Social MediaAI

A Social Network for AI Bots Is Here, and No One Knows What Happens Next

A new social network where AI agents interact with each other is...

ChatGPT 5
AITech & Science

ChatGPT Ads Could Reshape Digital Marketing for Businesses Everywhere

OpenAI’s introduction of ads in ChatGPT is changing how digital marketing works....