gdeac.comHome NavigationNavigation
Home >  News >  The affordability of DeepSeek is a myth: The revolutionary AI actually cost $1.6 billion to develop

The affordability of DeepSeek is a myth: The revolutionary AI actually cost $1.6 billion to develop

Author : Nora Update:Apr 12,2025

The new chatbot from DeepSeek, which boldly stated, "Hi, I was created so you can ask anything and get an answer that might even surprise you," has made significant waves in the AI industry. This introduction has not only captured attention but also contributed to one of NVIDIA's largest stock price drops, showcasing DeepSeek's impact on the market.

DeepSeek TestImage: ensigame.com

DeepSeek's AI model stands out due to its innovative architecture and training methods. Let's delve into the key technologies that set it apart:

Multi-token Prediction (MTP): This method allows the model to predict multiple words at once by analyzing different segments of a sentence. This not only boosts the accuracy but also the efficiency of the model, making it a powerful tool for understanding and generating text.

Mixture of Experts (MoE): DeepSeek V3 utilizes a sophisticated architecture with 256 neural networks, activating eight for each token processing task. This approach significantly speeds up AI training and enhances overall performance, making it a standout feature of their technology.

Multi-head Latent Attention (MLA): This mechanism focuses on the most crucial parts of a sentence, extracting key details repeatedly. By doing so, MLA reduces the risk of missing important information, allowing the AI to capture nuanced details in the input data effectively.

DeepSeek, a prominent Chinese startup, claims to have developed this competitive AI model at a relatively low cost. They assert that training the powerful DeepSeek V3 neural network cost them only $6 million and used just 2048 graphics processors.

DeepSeek V3Image: ensigame.com

However, analysts from SemiAnalysis have uncovered that DeepSeek's operations involve a much larger computational infrastructure. They estimate that DeepSeek uses approximately 50,000 Nvidia Hopper GPUs, including 10,000 H800 units, 10,000 H100s, and additional H20 GPUs, spread across several data centers. These resources are used for AI training, research, and financial modeling, with the company's total investment in servers reaching around $1.6 billion and operational expenses at $944 million.

DeepSeek is a subsidiary of the Chinese hedge fund High-Flyer, which established it as a separate AI-focused division in 2023. Unlike many startups that rely on cloud computing, DeepSeek owns its data centers, giving it complete control over AI model optimization and faster innovation deployment. The company's self-funded status enhances its agility and decision-making speed.

DeepSeekImage: ensigame.com

Furthermore, DeepSeek attracts top talent from leading Chinese universities, with some researchers earning over $1.3 million annually. Despite these significant investments, the company's claim of training its latest model for just $6 million seems unrealistic, as this figure only accounts for GPU usage during pre-training and excludes other substantial costs such as research, model refinement, data processing, and infrastructure.

Since its founding, DeepSeek has invested over $500 million in AI development. Its compact structure allows it to implement AI innovations quickly and effectively, unlike larger, more bureaucratic companies.

DeepSeekImage: ensigame.com

DeepSeek's example illustrates that a well-funded, independent AI company can compete with industry giants. While the company's success is driven by substantial investments, technical breakthroughs, and a strong team, the notion of a "revolutionary budget" for AI model development may be overstated. Nonetheless, DeepSeek's costs remain significantly lower than those of its competitors, such as the $100 million spent on training ChatGPT4o compared to DeepSeek's $5 million for R1.

However, it's still cheaper than its competitors.

Latest Articles
  • Once Human: RaidZone Launches on PC and Android

    ​ Once Human: RaidZone is now available on Android and PCThis survival spin-off shifts the focus to intense PvP combatYou can still leverage supernatural abilities to turn the tide in battleThe new run-and-gun survival shooter spin-off from Once Human,

    Author : Hannah View All

  • Pokémon Scarlet & Violet's Major Switch 2 Upgrade Hints at Legends Z-A Potential

    ​ The Nintendo Switch 2 is launching soon, bringing free performance upgrades for over ten existing Switch titles - including the notoriously problematic Pokémon Scarlet and Violet. After a 30-minute hands-on with the enhanced Pokémon Scarlet on Switch

    Author : Riley View All

  • Latest King God Castle Codes for January 2025

    ​ King God Castle is a turn-based strategy game set in a medieval world, featuring unique combat mechanics that set it apart. Your task is to assemble a team of warriors and other medieval heroes to conquer enemies and progress through the campaign.Usi

    Author : Audrey View All

Topics
Top Arcade Classics and New Hits
Top Arcade Classics and New HitsTOP

Dive into the world of arcade gaming with our curated collection of classic and new hits! Experience the thrill of retro gameplay with titles like Clone Cars and Brick Breaker - Balls vs Block, or discover innovative new experiences with Fancade, Polysphere, and Riot Squid. Whether you're a fan of puzzle games (Screw Pin Puzzle 3D), action-packed adventures (Rope-Man Run, SwordSlash), or competitive multiplayer (1-2-3-4 Player Ping Pong), this collection has something for everyone. Explore the best in arcade gaming with Tolf and many more exciting apps. Download Clone Cars, Fancade, 1-2-3-4 Player Ping Pong, Brick Breaker - Balls vs Block, Polysphere, Riot Squid, Tolf, Rope-Man Run, SwordSlash, and Screw Pin Puzzle 3D today!