Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
AI China Programming

China's Moonshot Launches Free AI Model Kimi K2 That Outperforms GPT-4 In Key Benchmarks 26

Chinese AI startup Moonshot AI has released Kimi K2, a trillion-parameter open-source language model that outperforms GPT-4 in key benchmarks with particularly strong performance on coding and autonomous agent tasks. VentureBeat reports: The new model, called Kimi K2, features 1 trillion total parameters with 32 billion activated parameters in a mixture-of-experts architecture. The company is releasing two versions: a foundation model for researchers and developers, and an instruction-tuned variant optimized for chat and autonomous agent applications. "Kimi K2 does not just answer; it acts," the company stated in its announcement blog. "With Kimi K2, advanced agentic intelligence is more open and accessible than ever. We can't wait to see what you build."

The model's standout feature is its optimization for "agentic" capabilities -- the ability to autonomously use tools, write and execute code, and complete complex multi-step tasks without human intervention. In benchmark tests, Kimi K2 achieved 65.8% accuracy on SWE-bench Verified, a challenging software engineering benchmark, outperforming most open-source alternatives and matching some proprietary models. [...] On LiveCodeBench, arguably the most realistic coding benchmark available, Kimi K2 achieved 53.7% accuracy, decisively beating DeepSeek-V3's 46.9% and GPT-4.1's 44.7%. More striking still: it scored 97.4% on MATH-500 compared to GPT-4.1's 92.4%, suggesting Moonshot has cracked something fundamental about mathematical reasoning that has eluded larger, better-funded competitors.

But here's what the benchmarks don't capture: Moonshot is achieving these results with a model that costs a fraction of what incumbents spend on training and inference. While OpenAI burns through hundreds of millions on compute for incremental improvements, Moonshot appears to have found a more efficient path to the same destination. It's a classic innovator's dilemma playing out in real time -- the scrappy outsider isn't just matching the incumbent's performance, they're doing it better, faster, and cheaper.

China's Moonshot Launches Free AI Model Kimi K2 That Outperforms GPT-4 In Key Benchmarks

Comments Filter:
  • by MpVpRb ( 1423381 ) on Monday July 14, 2025 @07:06PM (#65520984)

    Attempting to prevent them from acquiring tech is futile and counterproductive
    Politicians like to see everything as a race
    Warmongers and defense contractors see everything as a threat that requires more military spending
    Cooperation would be a better strategy

    • by sg_oneill ( 159032 ) on Tuesday July 15, 2025 @12:51AM (#65521520)

      Theres this obnoxious myth a lot of people at least subconsciously seem to have that innovation only comes from americans europeans, australians and. .... well you can probably figure the commonality, and it aint english.

      We used to accuse the Japanese of only ever stealing tech , we now know better, the japanese where phenomenal innovators until the arse fell out of their economy.

      The chinese have been great innovators for long before the wests industrial revolution. Its in the cultural DNA of the people. Yes, the chinese invent stuff, and they always have.

      We're not as special as we think we are.

  • by afaiktoit ( 831835 ) on Monday July 14, 2025 @07:12PM (#65521002)
    They use more compute time during inference so in the long run they use more energey/computer.
  • Most of their costs could be developing a lot of the basics, which continually diffuse away to other companies and China through ex-employees, requiring a lot more expenses on salary and exploration in training than the competition.

  • by ndykman ( 659315 ) on Monday July 14, 2025 @07:18PM (#65521016)

    Not that I care if they are using other companies models to ease costs. You can't inhale the internet, wave your hands about copyright and then complain "IP" when somebody uses your stuff in way you don't like.

    If it takes more air out of the AI bubble, all the better, I say.

  • by caseih ( 160668 ) on Monday July 14, 2025 @07:56PM (#65521114)

    I'm sure it's been well trained to ensure you get the correct party-approved information.

    Bu seriously I am curious as to how these chinese models react to questions about things the CCP does not want people to talk about. The CCP has a long history of attempting to apply censorship all across the world.

    • by Anonymous Coward
      Try asking Trump about the Epstein files instead. See if he wants to talk about that!
    • by AmiMoJo ( 196126 )

      No need to wonder, just go try one.

      For example, with DeepSeek if you download their AI and run it locally, it doesn't care about what the CCP wants and will happily tell you what you ask for. If you use it on their website, it depends if you are in China or not.

      In other words, it's exactly like Western AIs. If you ask Siri about Tienanmen Square, the answer will depend on if you are in China or not.

  • I'd like to take this time to once again laugh at the "open," "non-profit" OpenAI, that took the anti-human route and is now rapidly sinking. Good riddance.

    • New to the LLM benchmarking game, I see.
      Every new model is better. That's because they fine-tune them to be better at the benchmarks, and the benchmarks keep adjusting to the new SOTA.
      Also, out of curiosity, how are we defining "rapidly sinking?"

      I mean, I'm with you on criticizing the bullshit of OpenAI being completely non-open, but they are otherwise still basically top of the pack.

Biology is the only science in which multiplication means the same thing as division.

Working...