China's Moonshot Launches Free AI Model Kimi K2 That Outperforms GPT-4 In Key Benchmarks 26

Posted by BeauHD on Monday July 14, 2025 @06:50PM from the new-challenger-appears dept.

Chinese AI startup Moonshot AI has released Kimi K2, a trillion-parameter open-source language model that outperforms GPT-4 in key benchmarks with particularly strong performance on coding and autonomous agent tasks. VentureBeat reports: The new model, called Kimi K2, features 1 trillion total parameters with 32 billion activated parameters in a mixture-of-experts architecture. The company is releasing two versions: a foundation model for researchers and developers, and an instruction-tuned variant optimized for chat and autonomous agent applications. "Kimi K2 does not just answer; it acts," the company stated in its announcement blog. "With Kimi K2, advanced agentic intelligence is more open and accessible than ever. We can't wait to see what you build."

The model's standout feature is its optimization for "agentic" capabilities -- the ability to autonomously use tools, write and execute code, and complete complex multi-step tasks without human intervention. In benchmark tests, Kimi K2 achieved 65.8% accuracy on SWE-bench Verified, a challenging software engineering benchmark, outperforming most open-source alternatives and matching some proprietary models. [...] On LiveCodeBench, arguably the most realistic coding benchmark available, Kimi K2 achieved 53.7% accuracy, decisively beating DeepSeek-V3's 46.9% and GPT-4.1's 44.7%. More striking still: it scored 97.4% on MATH-500 compared to GPT-4.1's 92.4%, suggesting Moonshot has cracked something fundamental about mathematical reasoning that has eluded larger, better-funded competitors.

But here's what the benchmarks don't capture: Moonshot is achieving these results with a model that costs a fraction of what incumbents spend on training and inference. While OpenAI burns through hundreds of millions on compute for incremental improvements, Moonshot appears to have found a more efficient path to the same destination. It's a classic innovator's dilemma playing out in real time -- the scrappy outsider isn't just matching the incumbent's performance, they're doing it better, faster, and cheaper.

China's Moonshot Launches Free AI Model Kimi K2 That Outperforms GPT-4 In Key Benchmarks

Post Load All Comments

Search 26 Comments Log In/Create an Account

Comments Filter:

- - Re:China (Score:5, Funny)
    
    by sit1963nz ( 934837 ) writes: on Monday July 14, 2025 @07:34PM (#65521060)
    
    So...basicaly what the USA did when the industrial age happened.
    The USA STILL steal intellectual property under the guise of "National security" needs.
    
    So Pot, meet Kettle.
    
    Reply to This Parent Share
    Flag as Inappropriate
    - Re: (Score:1, Troll)
      
      by Pinky's Brain ( 1158667 ) writes:
      
      They don't have million of expats in China and a totalitarian state to easily squeeze their families though.
      - Re: (Score:2, Insightful)
        
        by Valgrus Thunderaxe ( 8769977 ) writes:
        
        The US *is* a totalitarian state.
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        The Chinese legal system is different than the US one.
        Indeed it is.
        In China, you have precisely no right to be free of unlawful detention. Indeed, there isn't even such a thing.
        You can't make generalizations based on your ignorance of other systems.
        Of course- you should never do that.
        But you can point out the non-generalizations about the Chinese legal system that are fucking atrocious.
        
        What the fuck is up with people fellating China these days? It's fucking insanity.
        If a government is based on the principle of dictatorship of the proletariat, and the proletariat's power is vested in one fucking body, you have a dictatorship o
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        You and those who moderated you positively are fucking morons.
        
        Educate yourself, you fucking dullard. [wikipedia.org]
      - Re: (Score:1)
        
        by sit1963nz ( 934837 ) writes:
        
        Totalitarianism is indépendant of left/right wing politics.
        Most 1st world countries have "socialist" policies in Universal healthcare, universal education, etc etc etc but they are also far more democratic, healthier, safer , happier, with better life expectancies than the USA . Trump is rapidly running further to the right WHILE also becoming totalitarian .
        
        Re: (Score:1)
        
        by DamnOregonian ( 963763 ) writes:
        
        Misinformation. [equality-o...tunity.org]
        The US has a problem with its poorest folks that many other first world countries do not have. However, its middle and well-off match and exceed the rest of the world, respectively.
        Judging by your word selection, I'm guessing you're a francophone.
        Here's France's income distributed life expectancy. [niussp.org]
        
        Which is better? I suppose that's up to the beholder. If you're going for maximum amount of likely years lived- the US is.
        If you're looking for a better place to be poor? Well, that's actuall
- Re: (Score:2)
  
  by 4wdloop ( 1031398 ) writes:
  
  Indeed, however comunism never existed anywhere, even in China. At best, they are totalitarian, and hopfully leaning benevolent today. But I see that since USofA is leaving the stage, there is indeed a vaccum.
Chinese engineers and scientists are smart (Score:4, Insightful)

by MpVpRb ( 1423381 ) writes: on Monday July 14, 2025 @07:06PM (#65520984)

Attempting to prevent them from acquiring tech is futile and counterproductive
Politicians like to see everything as a race
Warmongers and defense contractors see everything as a threat that requires more military spending
Cooperation would be a better strategy

Reply to This Share
Flag as Inappropriate
- Re:Chinese engineers and scientists are smart (Score:4, Insightful)
  
  by sg_oneill ( 159032 ) writes: on Tuesday July 15, 2025 @12:51AM (#65521520)
  
  Theres this obnoxious myth a lot of people at least subconsciously seem to have that innovation only comes from americans europeans, australians and. .... well you can probably figure the commonality, and it aint english.
  We used to accuse the Japanese of only ever stealing tech , we now know better, the japanese where phenomenal innovators until the arse fell out of their economy.
  The chinese have been great innovators for long before the wests industrial revolution. Its in the cultural DNA of the people. Yes, the chinese invent stuff, and they always have.
  We're not as special as we think we are.
  
  Reply to This Parent Share
  Flag as Inappropriate
You pay for it later... (Score:3)

by afaiktoit ( 831835 ) writes: on Monday July 14, 2025 @07:12PM (#65521002)

They use more compute time during inference so in the long run they use more energey/computer.

Reply to This Share
Flag as Inappropriate
- Re: (Score:2)
  
  by martin-boundary ( 547041 ) writes:
  
  "In the long run, we're all dead."
- Re: (Score:2)
  
  by DamnOregonian ( 963763 ) writes:
  
  At 33B active parameters in an MoE? No, they most certainly do not.
No one knows what GPT-4 really costs to run (Score:2)

by Pinky's Brain ( 1158667 ) writes:

Most of their costs could be developing a lot of the basics, which continually diffuse away to other companies and China through ex-employees, requiring a lot more expenses on salary and exploration in training than the competition.
Wonder how much of this is distillation... (Score:4, Insightful)

by ndykman ( 659315 ) writes: on Monday July 14, 2025 @07:18PM (#65521016)

Not that I care if they are using other companies models to ease costs. You can't inhale the internet, wave your hands about copyright and then complain "IP" when somebody uses your stuff in way you don't like.
If it takes more air out of the AI bubble, all the better, I say.

Reply to This Share
Flag as Inappropriate
- Re: (Score:2)
  
  by DrMrLordX ( 559371 ) writes:
  
  One wonders if it identifies as ChatGPT.
Just don't ask about the Tiananmen Square masacre (Score:3, Informative)

by caseih ( 160668 ) writes: on Monday July 14, 2025 @07:56PM (#65521114)

I'm sure it's been well trained to ensure you get the correct party-approved information.
Bu seriously I am curious as to how these chinese models react to questions about things the CCP does not want people to talk about. The CCP has a long history of attempting to apply censorship all across the world.

Reply to This Share
Flag as Inappropriate
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  Try asking Trump about the Epstein files instead. See if he wants to talk about that!
- Re: (Score:2)
  
  by AmiMoJo ( 196126 ) writes:
  
  No need to wonder, just go try one.
  For example, with DeepSeek if you download their AI and run it locally, it doesn't care about what the CCP wants and will happily tell you what you ask for. If you use it on their website, it depends if you are in China or not.
  In other words, it's exactly like Western AIs. If you ask Siri about Tienanmen Square, the answer will depend on if you are in China or not.
OpenAI (Score:1)

by systemd-anonymousd ( 6652324 ) writes:

I'd like to take this time to once again laugh at the "open," "non-profit" OpenAI, that took the anti-human route and is now rapidly sinking. Good riddance.
- Re: (Score:3)
  
  by DamnOregonian ( 963763 ) writes:
  
  New to the LLM benchmarking game, I see.
  Every new model is better. That's because they fine-tune them to be better at the benchmarks, and the benchmarks keep adjusting to the new SOTA.
  Also, out of curiosity, how are we defining "rapidly sinking?"
  
  I mean, I'm with you on criticizing the bullshit of OpenAI being completely non-open, but they are otherwise still basically top of the pack.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

China's Moonshot Launches Free AI Model Kimi K2 That Outperforms GPT-4 In Key Benchmarks 26

China's Moonshot Launches Free AI Model Kimi K2 That Outperforms GPT-4 In Key Benchmarks More | Reply Login

China's Moonshot Launches Free AI Model Kimi K2 That Outperforms GPT-4 In Key Benchmarks

Re:China (Score:5, Funny)

Re: (Score:1, Troll)

Re: (Score:2, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Chinese engineers and scientists are smart (Score:4, Insightful)

Re:Chinese engineers and scientists are smart (Score:4, Insightful)

You pay for it later... (Score:3)

Re: (Score:2)

Re: (Score:2)

No one knows what GPT-4 really costs to run (Score:2)

Wonder how much of this is distillation... (Score:4, Insightful)

Re: (Score:2)

Just don't ask about the Tiananmen Square masacre (Score:3, Informative)

Re: (Score:1)

Re: (Score:2)

OpenAI (Score:1)

Re: (Score:3)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot