Models
November 23, 2025

DeepSeek and Gemini models outperform ChatGPT in user ratings

A large-scale Prolific study ranked ChatGPT only 8th, behind DeepSeek, Gemini, Mistral, and Grok. Gemini 2.5 Pro and DeepSeek models dominated real user satisfaction and task-quality ratings.

Leading AI models using real user tasks and preference scoring. ChatGPT unexpectedly ranked 8th, trailing behind models from DeepSeek, Mistral, Google, and xAI.

Gemini 2.5 Pro received the highest performance ratings, followed closely by DeepSeek v3 and DeepSeek R1, which users preferred for reasoning depth, consistency, and speed-to-answer.

The results reflect a growing shift in user sentiment: high-performance, lower-cost alternatives are increasingly challenging OpenAI’s dominance. For enterprises evaluating multi-model strategies, this shows the competitive landscape is diversifying rapidly, especially with Chinese and open-weight models gaining traction.

#
DeepSeek

Read Our Content

See All Blogs
AI safety

Anthropic's AI agents just outpaced human researchers in safety tests

Deveshi Dabbawala

April 16, 2026
Read more
Gen AI

Anthropic’s Claude Managed Agents platform accelerates AI agent deployment for teams

Deveshi Dabbawala

April 9, 2026
Read more