by Viktor Eriksson

Skribent

Deepseek says new method can train AI more efficiently and cheaply

news brief

Jan 2, 20261 min

The new research could be a harbinger of the company's next big model release after the R1.

Chinese AI company Deepseek has unveiled a new training method, Manifold-Constrained Hyper-Connections (mHC), which will make it possible to train large language models more efficiently and at lower cost, reports the South China Morning Post.

The method is a further development of so-called Hyper-Connections, which was originally developed by Bytedance in 2024. That technology, in turn, builds on the classic ResNet architecture from Microsoft Research Asia.

Deepseek says mHC provides more stable and scalable training without increasing computational costs, thanks to specific optimizations at the infrastructure level. The researchers have tested the technology on models with up to 27 billion parameters with positive results.

According to experts cited by the South China Morning Post, the new method could be a foretaste of the next big model release from Deepseek. The company launched its high-profile R1 model on the occasion of Chinese New Year 2025.

Generative AIArtificial Intelligence

by Viktor Eriksson

Skribent

Show me more

Americas

Asia

Europe

Oceania

Topics

About

Policies

Our Network

More

Deepseek says new method can train AI more efficiently and cheaply

The new research could be a harbinger of the company's next big model release after the R1.

More from this author

Chinese authorities scrutinize Meta’s purchase of AI startup Manus

Samsung to double number of mobile AI devices in 2026

European banks may lay off 200,000 due to AI

Space X plans to lower Starlink satellites’ orbital altitude

Google may soon let users change their Gmail address

Meta buys high-profile AI startup Manus

OpenAI looks for a new head of AI preparedness

Google’s new AI tool Disco can turn tabs into apps

Show me more

Microsoft scraps criticized change in Exchange Online

JP Morgan Chase wins the hunt for the Apple Card

Musk’s OpenAI lawsuit clears path to trial, putting Microsoft in the spotlight

AI Threatens Bank Jobs, Nvidia Licenses Groq, Gmail Changes | Ep. 32

Ransomware Guilty Pleas, Cheaper AI, Meta Deal | Ep. 31

Inbox AI, buggy AI code, and “slop” | Ep. 30

The AI divide is growing for SMBs — Here’s how they can still win

AI Threatens Bank Jobs, Nvidia Licenses Groq, Gmail Changes | Ep. 32

Ransomware Guilty Pleas, Cheaper AI, Meta Deal | Ep. 31

Deepseek says new method can train AI more efficiently and cheaply

The new research could be a harbinger of the company's next big model release after the R1.

From our editors straight to your inbox

More from this author

Chinese authorities scrutinize Meta’s purchase of AI startup Manus

Samsung to double number of mobile AI devices in 2026

European banks may lay off 200,000 due to AI

Space X plans to lower Starlink satellites’ orbital altitude

Google may soon let users change their Gmail address

Meta buys high-profile AI startup Manus

OpenAI looks for a new head of AI preparedness

Google’s new AI tool Disco can turn tabs into apps

Show me more

Microsoft scraps criticized change in Exchange Online

JP Morgan Chase wins the hunt for the Apple Card

Musk’s OpenAI lawsuit clears path to trial, putting Microsoft in the spotlight

AI Threatens Bank Jobs, Nvidia Licenses Groq, Gmail Changes | Ep. 32

Ransomware Guilty Pleas, Cheaper AI, Meta Deal | Ep. 31

Inbox AI, buggy AI code, and “slop” | Ep. 30

The AI divide is growing for SMBs — Here’s how they can still win

AI Threatens Bank Jobs, Nvidia Licenses Groq, Gmail Changes | Ep. 32

Ransomware Guilty Pleas, Cheaper AI, Meta Deal | Ep. 31