Deepseek: The Chinese AI Model Shaking ChatGPT, Nvidia, and the AI Industry

Deepseek is a new AI model from China. It outperforms ChatGPT, Gemini, and Claude AI. Its app became the most downloaded free app in the US. As a consequence, Nvidia, Microsoft, and Meta shares dropped. It is important to understand why Deepseek is a game changer. Let’s delve into this more.

What is Deepseek?

In 2023, Liang Wenfeng founded Deepseek, an AI research lab located in Hangzhou. Liang was the head of a hedge fund that relied on AI for financial analytics. He recruited a team of graduates from top-tier Chinese universities like Tsinghua and Peking. Deepseek’s models are open source and have a development cost of less than $6 million, in stark contrast to competitors such as OpenAI and Meta.

How Does Deepseek Compare to ChatGPT and Others?

Deepseek’s app reached first place in downloads within the US, UK, and China. It provides unlimited free usage, unlike paid models such as ChatGPT-4 and Claude Sonnet. Google Gemini charges no fee but is based on older models and has usage limitations. The R1 model of Deepseek is said to think like a human. It articulates its reasoning before concluding with answers to questions. Tests demonstrate it is on par with the most recent OpenAI technology.

Why is Deepseek Challenging Nvidia?

Deepseek utilizes lower-end hardware, so unlike the majority of AI systems reliant on Nvidia chips, Deepseek can be run on cheaper hardware. This is a crucial asset, as the US is no longer allowing advanced chip exports to China. Deepseek is demonstrating to the world that AI can function more than optimally without top-tier hardware.

What Makes Deepseek Special?

  • Open-Source: Programmers from around the world have the ability to utilize Deepseek-V3 without any costs being incurred.
  • Affordable: Deepseek V3 was made for under $6 million as opposed to billions spent by competitors.
  • Explanatory: It clearly defines its operating process.
  • No Payment: Paid competitors have put a paywall behind their systems, whereas Deepseek has not.

How is Deepseek Changing the AI Industry?

The success of Deepseek shook the stock market. Nvidia, Meta, and Microsoft shares have since then continued to tumble. Deepseek is aiding or utilizing the idea that the AI market can be stable without considerable spending. The budget for AI can easily be challenged with the inclusion of cheap Chinese equipment, which poses a larger threat to the American economy as analysts have claimed. Deepseek works wonders in proving how open-source AI might very well give birth to new innovations.

Challenges for Deepseek

  • Trust: There is still a requirement for users to see and believe in the reliability and safety offered.
  • Competition: Remaining rivals Deep AII and Google have compiled a hefty amount of funds.
  • Regulations: Government actions could hinder AI by implementing tougher regulation policies.

The Future of Deepseek

The primary model Duke seeks to bridge the gap between culture and language when it comes to state-of-the-art AI innovation. Its affordable model will attract startups from around the globe. It also anticipates that the big tech industry will have to rethink their pricing strategy with its expansion.

DeepSeek-V3 Capabilities

Benchmark (Metric) DeepSeek V3 DeepSeek V2.5 Qwen2.5 Llama3.1 Claude-3.5 GPT-4o
905 72B-Inst 405B-Inst Sonnet-1022 513
Architecture MoE MoE Dense Dense
# Activated Params 37B 21B 72B 405B
# Total Params 671B 236B 72B 405B
English MMLU (EM) 88.5 80.6 85.3 88.6 88.3 87.2
MMLU-Redux (EM) 89.1 80.3 85.6 86.2 88.9 88
MMLU-Pro (EM) 75.9 66.2 71.6 73.3 78 72.6
DROP (3-shot F1) 91.6 87.8 76.7 88.7 88.3 83.7
IF-Eval (Prompt Strict) 86.1 80.6 84.1 86 86.5 84.3
GPQA-Diamond (Pass@1) 59.1 41.3 49 51.1 65 49.9
SimpleQA (Correct) 24.9 10.2 9.1 17.1 28.4 38.2
FRAMES (Acc.) 73.3 65.4 69.8 70 72.5 80.5
LongBench v2 (Acc.) 48.7 35.4 39.4 36.1 41 48.1
Code HumanEval-Mul (Pass@1) 82.6 77.4 77.3 77.2 81.7 80.5
LiveCodeBench (Pass@1-COT) 40.5 29.2 31.1 28.4 36.3 33.4
LiveCodeBench (Pass@1) 37.6 28.4 28.7 30.1 32.8 34.2
Codeforces (Percentile) 51.6 35.6 24.8 25.3 20.3 23.6
SWE Verified (Resolved) 42 22.6 23.8 24.5 50.8 38.8
Aider-Edit (Acc.) 79.7 71.6 65.4 63.9 84.2 72.9
Aider-Polyglot (Acc.) 49.6 18.2 7.6 5.8 45.3 16
Math AIME 2024 (Pass@1) 39.2 16.7 23.3 23.3 16 9.3
MATH-500 (EM) 90.2 74.7 80 73.8 78.3 74.6
CNMO 2024 (Pass@1) 43.2 10.8 15.9 6.8 13.1 10.8
Chinese CLUEWSC (EM) 90.9 90.4 91.4 84.7 85.4 87.9
C-Eval (EM) 86.5 79.5 86.1 61.5 76.7 76
C-SimpleQA (Correct) 64.1 54.1 48.4 50.4 51.3 59.3

Among the open-source models, DeepSeek-V3 scores the highest in terms of speed and performance. It is even competitive with Open AI’s GPT-4o, being a closed-source proprietary model.

Deepseek is putting forward the proposition that advanced AI does not have to be expensive or proprietary. It goes head-to-head with the likes of ChatGPT and Nvidia while also shifting the patterns of global technology. The race for AI just got more intense.

Sayan Dutta
Sayan Dutta

I am glad you came over here. So, you want to know a little bit about me. I am a passionate digital marketer, blogger, and engineer. I have knowledge & experience in search engine optimization, digital analytics, google algorithms, and many other things.

Articles: 5043
Table of Contents