Models
November 23, 2025

DeepSeek and Gemini models outperform ChatGPT in user ratings

A large-scale Prolific study ranked ChatGPT only 8th, behind DeepSeek, Gemini, Mistral, and Grok. Gemini 2.5 Pro and DeepSeek models dominated real user satisfaction and task-quality ratings.

Leading AI models using real user tasks and preference scoring. ChatGPT unexpectedly ranked 8th, trailing behind models from DeepSeek, Mistral, Google, and xAI.

Gemini 2.5 Pro received the highest performance ratings, followed closely by DeepSeek v3 and DeepSeek R1, which users preferred for reasoning depth, consistency, and speed-to-answer.

The results reflect a growing shift in user sentiment: high-performance, lower-cost alternatives are increasingly challenging OpenAI’s dominance. For enterprises evaluating multi-model strategies, this shows the competitive landscape is diversifying rapidly, especially with Chinese and open-weight models gaining traction.

#
DeepSeek

Read Our Content

See All Blogs
Gen AI

How OpenAI and Amazon Bedrock are building a next generation AI orchestration platform for enterprise AI

Deveshi Dabbawala

March 5, 2026
Read more
LLM Models

Why LLM benchmarking on leaderboards is not enough for enterprise AI

Deveshi Dabbawala

March 3, 2026
Read more