Qwen
/

Qwen2.5-Coder-1.5B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

feihu.hf commited on Sep 18

Commit

19a986b

•

1 Parent(s): d85b5e3

update README & LICENSE

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 license: apache-2.0
 language:
 - en
 base_model:
@@ -9,8 +10,8 @@ library_name: transformers
 tags:
 - code
 - qwen
-- codeqwen
 - qwen-coder
 ---
 # Qwen2.5-Coder-1.5B
@@ -23,6 +24,7 @@ Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (
 - A more comprehensive foundation for real-world applications such as **Code Agents**. Not only enhancing coding capabilities but also maintaining its strengths in mathematics and general competencies.
 - **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
 **This repo contains the 1.5B Qwen2.5-Coder model**, which has the following features:
 - Type: Causal Language Models
 - Training Stage: Pretraining & Post-training
@@ -31,7 +33,7 @@ Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (
 - Number of Paramaters (Non-Embedding): 1.31B
 - Number of Layers: 28
 - Number of Attention Heads (GQA): 12 for Q and 2 for KV
-- Context Length: Full 32,768 tokens and generation 8192 tokens
 **We do not recommend using base language models for conversations.** Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., or fill in the middle tasks on this model.
@@ -46,6 +48,7 @@ With `transformers<4.37.0`, you will encounter the following error:
 KeyError: 'qwen2'
 ```
 ## Evaluation & Performance
 Detailed evaluation results are reported in this [📑 blog](https://qwenlm.github.io/blog/qwen2.5-coder/).
@@ -70,6 +73,4 @@ If you find our work helpful, feel free to give us a cite.
       journal={arXiv preprint arXiv:2407.10671},
       year={2024}
 }
-```

 ---
 license: apache-2.0
+license_link: https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B/blob/main/LICENSE
 language:
 - en
 base_model:
 tags:
 - code
 - qwen
 - qwen-coder
+- codeqwen
 ---
 # Qwen2.5-Coder-1.5B
 - A more comprehensive foundation for real-world applications such as **Code Agents**. Not only enhancing coding capabilities but also maintaining its strengths in mathematics and general competencies.
 - **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
 **This repo contains the 1.5B Qwen2.5-Coder model**, which has the following features:
 - Type: Causal Language Models
 - Training Stage: Pretraining & Post-training
 - Number of Paramaters (Non-Embedding): 1.31B
 - Number of Layers: 28
 - Number of Attention Heads (GQA): 12 for Q and 2 for KV
+- Context Length: Full 32,768 tokens
 **We do not recommend using base language models for conversations.** Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., or fill in the middle tasks on this model.
 KeyError: 'qwen2'
 ```
 ## Evaluation & Performance
 Detailed evaluation results are reported in this [📑 blog](https://qwenlm.github.io/blog/qwen2.5-coder/).
       journal={arXiv preprint arXiv:2407.10671},
       year={2024}
 }
+```