{"id":4112384,"date":"2026-01-02T05:49:08","date_gmt":"2026-01-02T10:49:08","guid":{"rendered":"https:\/\/www.computerworld.com\/article\/4112384\/deepseeks-new-method-can-train-ai-more-efficiently-and-cheaply.html"},"modified":"2026-01-06T14:07:40","modified_gmt":"2026-01-06T19:07:40","slug":"deepseeks-new-method-can-train-ai-more-efficiently-and-cheaply","status":"publish","type":"post","link":"https:\/\/www.computerworld.com\/article\/4112384\/deepseeks-new-method-can-train-ai-more-efficiently-and-cheaply.html","title":{"rendered":"Deepseek says new method can train AI more efficiently and cheaply"},"content":{"rendered":"<div id=\"remove_no_follow\">\n\t\t<div class=\"grid grid--cols-10@md grid--cols-8@lg article-column\">\n\t\t\t\t\t  <div class=\"col-12 col-10@md col-6@lg col-start-3@lg\">\n\t\t\t\t\t\t<div class=\"article-column__content\">\n<section class=\"wp-block-bigbite-multi-title\"><div class=\"container\"><\/div><\/section>\n\n\n\n<p>Chinese AI company Deepseek has unveiled a new training method, Manifold-Constrained Hyper-Connections (mHC), which will make it possible to train large language models more efficiently and at lower cost, reports <a href=\"https:\/\/www.scmp.com\/tech\/big-tech\/article\/3338427\/deepseek-kicks-2026-paper-signalling-push-train-bigger-models-less?module=perpetual_scroll_0&amp;pgtype=article\">the South China Morning Post<\/a>.<\/p>\n\n\n\n<p>The method is a further development of so-called Hyper-Connections, which was originally developed by Bytedance in 2024. That technology, in turn, builds on the classic ResNet architecture from Microsoft Research Asia.<\/p>\n\n\n\n<p>Deepseek says mHC provides more stable and scalable training without increasing computational costs, thanks to specific optimizations at the infrastructure level. The researchers have tested the technology on models with up to 27 billion parameters with positive results.<\/p>\n\n\n\n<p>According to experts cited by the South China Morning Post, the new method could be a foretaste of the next big model release from Deepseek. The company launched its high-profile R1 model on the occasion of Chinese New Year 2025.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper youtube-video\">\n<iframe loading=\"lazy\" title=\"Ransomware Guilty Pleas, Cheaper AI, Meta Deal | Ep. 31\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/MSnzw1s3TnY?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n<\/div><\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Chinese AI company Deepseek has unveiled a new training method, Manifold-Constrained Hyper-Connections (mHC), which will make it possible to train large language models more efficiently and at lower cost, reports the South China Morning Post. The method is a further development of so-called Hyper-Connections, which was originally developed by Bytedance in 2024. That technology, in [&hellip;]<\/p>\n","protected":false},"author":2869,"featured_media":100071997,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"__idg_published_ids":[],"__idg_published_status":"draft","embargo_date":"","multi_title":"{\"titles\":{\"headline\":{\"value\":\"Deepseek says new method can train AI more efficiently and cheaply\",\"additional\":{\"short_title\":\"Deepseek says new method can train AI more efficiently and cheaply\",\"headline_subheadline\":\"The new research could be a harbinger of the company&#039;s next big model release after the R1. \",\"headline_desc\":\"The new research could be a harbinger of the company&#039;s next big model release after the R1. \"}},\"seo\":{\"value\":\"Deepseek says new method can train AI more efficiently and cheaply\",\"additional\":{\"seo_desc\":\"The new research could be a harbinger of the company&#039;s next big model release after the R1. \"}},\"social\":{\"value\":\"Deepseek says new method can train AI more efficiently and cheaply\",\"additional\":{\"social_desc\":\"The new research could be a harbinger of the company&#039;s next big model release after the R1. \"}}},\"subtitles\":[]}","old_id_in_onecms":"","_idg_updated_flag":false,"_idg_updated_date":"","hreflang_xdefault":0,"content_type":"News Brief","suppress_html_meta":"{}","byline":"","featured_video_id":0,"supress_floating_video":false,"prevent_index":0,"has_duration":0,"teaser_paragraphs":"","is_translated_post":1,"idg_original_post_id":4112305,"idg_translated_post_ids":[],"idg_original_post_publication":"ComputerSweden","idg_original_post_language":"Swedish","idg_original_post_brand":"computersweden.se","reviews":null,"suppress_monetization":"{}","is_premium":0,"external_post_link":"","suppress_fake_sidebar":"{}","first_published_date":"2026-01-02T10:46:42+01:00","hide_featured_image_for_post":false,"post_featured_image_nocaption":true,"post_featured_image_caption":"","automatic_content_time":1,"manual_content_time":0,"most_popular_author":null,"more_from_author":null,"footnotes":null},"categories":[1885,2888],"tags":[],"languages":[21],"editions":[12],"publication":[9,10],"territory":[],"story_types":[8414],"article_type":[],"sponsorships":[],"blogs":[],"podcast_series":[],"origin":[7179],"coauthors":[8415],"class_list":{"0":"post-4112384","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"category-generative-ai","9":"languages-en","10":"editions-global","11":"publication-computerworld","12":"publication-us-default","13":"story_types-news-brief","14":"origin-wp"},"jetpack_featured_media_url":"https:\/\/www.computerworld.com\/wp-content\/uploads\/2026\/01\/4112384-0-35706600-1767726501-deepseek_17d19c.jpg?quality=50&strip=all","eyebrow":{"eyebrow":"news brief","eyebrow_style":"default","eyebrow_feed_title":"news brief","eyebrow_feed_style":"default"},"review_score":null,"article_type_name":"","author_name":"Viktor Eriksson","author_meta":[{"authorID":2869,"name":"Viktor Eriksson","url":"https:\/\/www.computerworld.com\/profile\/viktor-eriksson\/","img":{"media_id":100072078,"full":"https:\/\/www.computerworld.com\/wp-content\/uploads\/2026\/01\/2869-0-90205100-1767812974-author_photo_Viktor-Eriksson_1705072267.jpeg?quality=50&strip=all"},"defaultUrl":"https:\/\/secure.gravatar.com\/avatar\/f394d03d0ba2a56273308e9b182c7735a2f7b85706e9b8da66a587cddfa41920?s=96&d=mm&r=g","profileImage":"<img data-hero alt=\"Viktor Eriksson\" src=\"https:\/\/www.computerworld.com\/wp-content\/uploads\/2026\/01\/2869-0-90205100-1767812974-author_photo_Viktor-Eriksson_1705072267.jpeg?quality=50&#038;strip=all&#038;w=160\" class=\"author_photo\" height=\"250\" width=\"250\" \/>","job_title":"Skribent"}],"multiple_name":"Viktor Eriksson","_embedded":"Viktor Eriksson","_links":{"self":[{"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/posts\/4112384","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/users\/2869"}],"replies":[{"embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/comments?post=4112384"}],"version-history":[{"count":0,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/posts\/4112384\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/media\/100071997"}],"wp:attachment":[{"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/media?parent=4112384"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/categories?post=4112384"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/tags?post=4112384"},{"taxonomy":"languages","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/languages?post=4112384"},{"taxonomy":"editions","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/editions?post=4112384"},{"taxonomy":"publication","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/publication?post=4112384"},{"taxonomy":"territory","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/territory?post=4112384"},{"taxonomy":"story_types","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/story_types?post=4112384"},{"taxonomy":"article_type","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/article_type?post=4112384"},{"taxonomy":"sponsorships","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/sponsorships?post=4112384"},{"taxonomy":"blogs","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/blogs?post=4112384"},{"taxonomy":"podcast_series","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/podcast_series?post=4112384"},{"taxonomy":"origin","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/origin?post=4112384"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.computerworld.com\/wp-json\/wp\/v2\/coauthors?post=4112384"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}