LovelyBuggies commited on
Commit
8180432
·
verified ·
1 Parent(s): bb21273

Add model README

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Qwen/Qwen2.5-Coder-3B
4
+ tags:
5
+ - code
6
+ - humaneval
7
+ - multi-agent
8
+ - mlgrpo
9
+ - qwen2.5
10
+ library_name: transformers
11
+ pipeline_tag: text-generation
12
+ ---
13
+
14
+ # 2xQwen2.5-Coder-3B-Satyr-Aux
15
+
16
+ This model is a fine-tuned version of **Qwen/Qwen2.5-Coder-3B** using Multi-LLM Group Relative Policy Optimization (MAGRPO) on HumanEval dataset.