Models
November 23, 2025

DeepSeek and Gemini models outperform ChatGPT in user ratings

A large-scale Prolific study ranked ChatGPT only 8th, behind DeepSeek, Gemini, Mistral, and Grok. Gemini 2.5 Pro and DeepSeek models dominated real user satisfaction and task-quality ratings.

Leading AI models using real user tasks and preference scoring. ChatGPT unexpectedly ranked 8th, trailing behind models from DeepSeek, Mistral, Google, and xAI.

Gemini 2.5 Pro received the highest performance ratings, followed closely by DeepSeek v3 and DeepSeek R1, which users preferred for reasoning depth, consistency, and speed-to-answer.

The results reflect a growing shift in user sentiment: high-performance, lower-cost alternatives are increasingly challenging OpenAI’s dominance. For enterprises evaluating multi-model strategies, this shows the competitive landscape is diversifying rapidly, especially with Chinese and open-weight models gaining traction.

#
DeepSeek

Read Our Content

See All Blogs
AI system implementation

Rogue Agent Impact Visualizer

Sarankumar S

May 28, 2026
Read more
AI system implementation

Reinforcement learning for LLMs: SDAR's for multi-turn agent training

Deveshi Dabbawala

May 21, 2026
Read more