danielhanchen commited on
Commit
0422fe4
·
verified ·
1 Parent(s): 51082bc

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +119 -0
README.md ADDED
@@ -0,0 +1,119 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - moonshotai/Kimi-Dev-72B
5
+ tags:
6
+ - code
7
+ - unsloth
8
+ - swebench
9
+ - software
10
+ - issue-resolving
11
+ ---
12
+ <div>
13
+ <p style="margin-top: 0;margin-bottom: 0;">
14
+ <em><a href="https://docs.unsloth.ai/basics/unsloth-dynamic-v2.0-gguf">Unsloth Dynamic 2.0</a> achieves superior accuracy & outperforms other leading quants.</em>
15
+ </p>
16
+ <div style="display: flex; gap: 5px; align-items: center; ">
17
+ <a href="https://github.com/unslothai/unsloth/">
18
+ <img src="https://github.com/unslothai/unsloth/raw/main/images/unsloth%20new%20logo.png" width="133">
19
+ </a>
20
+ <a href="https://discord.gg/unsloth">
21
+ <img src="https://github.com/unslothai/unsloth/raw/main/images/Discord%20button.png" width="173">
22
+ </a>
23
+ <a href="https://docs.unsloth.ai/basics/qwen3-how-to-run-and-fine-tune">
24
+ <img src="https://raw.githubusercontent.com/unslothai/unsloth/refs/heads/main/images/documentation%20green%20button.png" width="143">
25
+ </a>
26
+ </div>
27
+ </div>
28
+
29
+ <!-- # Kimi-Dev -->
30
+
31
+ <div align="center">
32
+ <img src="./assets/main_logo.png" alt="Kimi Logo" width="400" />
33
+ <h2><a href="https://moonshotai.github.io/Kimi-Dev/">
34
+ Introducing Kimi-Dev: <br>A Strong and Open-source Coding LLM for Issue Resolution</a></h2>
35
+ </a></h2>
36
+ <b>Kimi-Dev Team</b>
37
+ <br>
38
+
39
+ </div>
40
+ <div align="center">
41
+ <a href="">
42
+ <b>📄 Tech Report (Coming soon...)</b>
43
+ </a> &nbsp;|&nbsp;
44
+ <a href="https://github.com/MoonshotAI/Kimi-Dev">
45
+ <b>📄 Github</b>
46
+ </a> &nbsp;
47
+ </div>
48
+
49
+ <br>
50
+ <br>
51
+
52
+ <!-- https://github.com/MoonshotAI/Kimi-Dev -->
53
+
54
+ We introduce Kimi-Dev-72B, our new open-source coding LLM for software engineering tasks. Kimi-Dev-72B achieves a new state-of-the-art on SWE-bench Verified among open-source models.
55
+
56
+ - Kimi-Dev-72B achieves 60.4% performance on SWE-bench Verified. It surpasses the runner-up, setting a new state-of-the-art result among open-source models.
57
+
58
+
59
+ - Kimi-Dev-72B is optimized via large-scale reinforcement learning. It autonomously patches real repositories in Docker and gains rewards only when the entire test suite passes. This ensures correct and robust solutions, aligning with real-world development standards.
60
+
61
+
62
+ - Kimi-Dev-72B is available for download and deployment on Hugging Face and GitHub. We welcome developers and researchers to explore its capabilities and contribute to development.
63
+
64
+
65
+ <div align="center">
66
+ <img src="./assets/open_performance_white.png" alt="Kimi Logo" width="600" />
67
+ <p><b>Performance of Open-source Models on SWE-bench Verified.</b></p>
68
+
69
+ </div>
70
+
71
+
72
+
73
+ ## Quick Start
74
+ ```
75
+ from transformers import AutoModelForCausalLM, AutoTokenizer
76
+
77
+ model_name = "moonshotai/Kimi-Dev-72B"
78
+
79
+ model = AutoModelForCausalLM.from_pretrained(
80
+ model_name,
81
+ torch_dtype="auto",
82
+ device_map="auto"
83
+ )
84
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
85
+
86
+ prompt = "Give me a short introduction to large language model."
87
+ messages = [
88
+ {"role": "system", "content": "You are a helpful assistant."},
89
+ {"role": "user", "content": prompt}
90
+ ]
91
+ text = tokenizer.apply_chat_template(
92
+ messages,
93
+ tokenize=False,
94
+ add_generation_prompt=True
95
+ )
96
+ model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
97
+
98
+ generated_ids = model.generate(
99
+ **model_inputs,
100
+ max_new_tokens=512
101
+ )
102
+ generated_ids = [
103
+ output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
104
+ ]
105
+
106
+ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
107
+
108
+ ```
109
+
110
+ ## Citation
111
+ ```
112
+ @misc{kimi_dev_72b_2025,
113
+ title = {Introducing Kimi-Dev: A Strong and Open-source Coding LLM for Issue Resolution},
114
+ author = {{Kimi-Dev Team}},
115
+ year = {2025},
116
+ month = {June},
117
+ url = {\url{https://www.moonshot.cn/Kimi-Dev}}
118
+ }
119
+ ```