Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model

(kimi.com)

144 points | by nekofneko 3 hours ago

17 comments

Tepix 2 hours ago
Huggingface Link: https://huggingface.co/moonshotai/Kimi-K2.5
1T parameters, 32b active parameters.
License: MIT with the following modification:
Our only modification part is that, if the Software (or any derivative works thereof) is used for any of your commercial products or services that have more than 100 million monthly active users, or more than 20 million US dollars (or equivalent in other currencies) in monthly revenue, you shall prominently display "Kimi K2.5" on the user interface of such product or service.
[-]
- dheera 1 hour ago
  > or more than 20 million US dollars (or equivalent in other currencies) in monthly revenue, you shall prominently display "Kimi K2.5" on the user interface of such product or service.
  Why not just say "you shall pay us 1 million dollars"?
  [-]
  - vessenes 0 minutes ago
    ? They prefer the branding. The license just says you have to say it was them if you make > $250mm a year on the model.
  - clayhacks 1 hour ago
    I assume this allows them to sue for different amounts. And not discourage too many people from using it.
- Imustaskforhelp 1 hour ago
  Hey have they open sourced all Kimi k2.5 (thinking,instruct,agent,agent swarm [beta])?
  Because I feel like they mentioned that agent swarm is available their api and that made me feel as if it wasn't open (weights)*? Please let me know if all are open source or not?
pu_pe 16 minutes ago
I don't get this "agent swarm" concept. You set up a task and they boot up 100 LLMs to try to do it in parallel, and then one "LLM judge" puts it all together? Is there anywhere I can read more about it?
[-]
- rvnx 1 minute ago
  You have a team lead that establishes a list of tasks needed to achieve your mission, then it creates a list of employees, each of them is specialized for a task, and they work in parallel.
  Essentially hiring a team of people who get specialized on one problem.
  Do one thing and do it well.
- jonkoops 10 minutes ago
  The datacenters yearn for the chips.
bertili 21 minutes ago
The "Deepseek moment" is just one year ago today!
Coincidence or not, let's just marvel for a second over this amount of magic/technology that's being given away for free... and how liberating and different this is than OpenAI and others that were closed to "protect us all".
jumploops 3 hours ago
> For complex tasks, Kimi K2.5 can self-direct an agent swarm with up to 100 sub-agents, executing parallel workflows across up to 1,500 tool calls.
> K2.5 Agent Swarm improves performance on complex tasks through parallel, specialized execution [..] leads to an 80% reduction in end-to-end runtime
Not just RL on tool calling, but RL on agent orchestration, neat!
[-]
Reubend 2 hours ago
I've read several people say that Kimi K2 has a better "emotional intelligence" than other models. I'll be interested to see whether K2.5 continues or even improves on that.
[-]
- storystarling 1 hour ago
  yes, though this is highly subjective - it 'feels' like that to me as well (comapred to Gemini 3, GPT 5.2, Opus 4.5).
zmmmmm 1 hour ago
Curious what would be the most minimal reasonable hardware one would need to deploy this locally?
[-]
- NitpickLawyer 1 hour ago
  I parsed "reasonable" as in having reasonable speed to actually use this as intended (in agentic setups). In that case, it's a minimum of 70-100k for hardware (8x 6000 PRO + all the other pieces to make it work). The model comes with native INT4 quant, so ~600GB for the weights alone. An 8x 96GB setup would give you ~160GB for kv caching.
  You can of course "run" this on cheaper hardware, but the speeds will not be suitable for actual use (i.e. minutes for a simple prompt, tens of minutes for high context sessions per turn).
Topfi 1 hour ago
K2 0905 and K2 Thinking shortly after that have done impressively well in my personal use cases and was severely slept on. Faster, more accurate, less expensive, more flexible in terms of hosting and available months before Gemini 3 Flash, I really struggle to understand why Flash got such positive attention at launch.
Interested in the dedicated Agent and Agent Swarm releases, especially in how that could affect third party hosting of the models.
[-]
- msp26 45 minutes ago
  K2 thinking didn't have vision which was a big drawback for my projects.
striking 1 hour ago
https://archive.is/P98JR
spaceman_2020 2 hours ago
Kimi was already one of the best writing models. Excited to try this one out
Jackson__ 1 hour ago
As your local vision nut, their claims about "SOTA" vision are absolutely BS in my tests.
Sure it's SOTA at standard vision benchmarks. But on tasks that require proper image understanding, see for example BabyVision[0] it appears very much lacking compared to Gemini 3 Pro.
[0] https://arxiv.org/html/2601.06521v1
pplonski86 2 hours ago
There are so many models, is there any website with list of all of them and comparison of performance on different tasks?
[-]
- Reubend 1 hour ago
  The post actually has great benchmark tables inside of it. They might be outdated in a few months, but for now, it gives you a great summary. Seems like Gemini wins on image and video perf, Claude is the best at coding, ChatGPT is the best for general knowledge.
  But ultimately, you need to try them yourself on the tasks you care about and just see. My personal experience is that right now, Gemini Pro performs the best at everything I throw at it. I think it's superior to Claude and all of the OSS models by a small margin, even for things like coding.
  [-]
  - Imustaskforhelp 1 hour ago
    I like Gemini Pro's UI over Claude so much but honestly I might start using Kimi K2.5 if its open source & just +/- Gemini Pro/Chatgpt/Claude because at that point I feel like the results are negligible and we are getting SOTA open source models again.
- coffeeri 1 hour ago
  There is https://artificialanalysis.ai
  [-]
  - pplonski86 1 hour ago
    Thank you! Exactly what I was looking for
DeathArrow 2 hours ago
Those are some impressive benchmark results. I wonder how well it does in real life.
Maybe we can get away with something cheaper than Claude for coding.
[-]
- oneneptune 2 hours ago
  I'm curious about the "cheaper" claim -- I checked Kimi pricing, and it's a $200/mo subscription too?
  [-]
  - NitpickLawyer 2 hours ago
    On openrouter 2.5 is at 0.60/3$ per Mtok. That's haiku pricing.
    [-]
    - storystarling 42 minutes ago
      The unit economics seem tough at that price for a 1T parameter model. Even with MoE sparsity you are still VRAM bound just keeping the weights resident, which is a much higher baseline cost than serving a smaller model like Haiku.
  - mrklol 1 hour ago
    They also have a $20 and $40 tier.
lrvick 2 hours ago
Actually open source, or yet another public model, which is the equivalent of a binary?
URL is down so cannot tell.
[-]
- typ 38 minutes ago
  The label 'open source' has become a reputation reaping and marketing vehicle rather than an informative term since the Hugging Face benchmark race started. With the weights only, we cannot actually audit that if a model is a) contaminated by benchmarks, b) built with deliberate biases, or c) trained on copyrighted/privacy data, let alone allowing other vendors to replicate the results. Anyways, people still love free stuff.
  [-]
  - Der_Einzige 16 minutes ago
    Just accept that IP laws don't matter and the old "free software" paradigm is dead. Aaron Swartz died so that GenAI may live. RMS and his model of "copyleft" are so Web 1.0 (not even 2.0). No one in GenAI cares AT ALL about the true definition of open source. Good.
- Tepix 2 hours ago
  It's open weights, not open source.
maximgeorge 2 hours ago
[dead]
mangolie 3 hours ago
they cooked
billyellow 3 hours ago
Cool
rvz 1 hour ago
The chefs at Moonshot have cooked once again.