Update README.md
Browse files
README.md
CHANGED
@@ -32,7 +32,14 @@ That is an improvement of almost 12 points over the starcoder 1b baseline, almos
|
|
32 |
|
33 |
The results (on the human eval benchmark) are on par with other open-source models like StarCoderBase (30.4) StarCoder(33.6) CodeGen-16B-Mono(29.3) while the model being 15 times smaller.
|
34 |
|
35 |
-
It still underperforms compared to other models like CodeLLama (53%)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
36 |
|
37 |
|
38 |
## Disclaimer
|
@@ -43,3 +50,6 @@ It still underperforms compared to other models like CodeLLama (53%) or chat gpt
|
|
43 |
* This model has not been aligned with human preferences and therefore could potentially generate harmful content
|
44 |
* This model has been trained on a dataset generated by ChatGPT 3.5, and you should check the legal status of AI-generated content in your jurisdiction before using it. You should make sure that your usage complies with the OpenAI Terms of Use, in so far as legally applicable.
|
45 |
|
|
|
|
|
|
|
|
32 |
|
33 |
The results (on the human eval benchmark) are on par with other open-source models like StarCoderBase (30.4) StarCoder(33.6) CodeGen-16B-Mono(29.3) while the model being 15 times smaller.
|
34 |
|
35 |
+
It still underperforms compared to other models like CodeLLama (53%) chat gpt 4 (82) or wizard coder (73.2), but these model are more than 30 times bigger.
|
36 |
+
|
37 |
+
## Finetuning details
|
38 |
+
|
39 |
+
We did full parameter fine-tuning and used a Nvidia a40 for 12 hours using a batch size of 128 and a micro-batch size of 8.
|
40 |
+
|
41 |
+
|
42 |
+
To reproduce the training just follow the training instructions in our [open source codebase](https://github.com/jina-ai/textbook)
|
43 |
|
44 |
|
45 |
## Disclaimer
|
|
|
50 |
* This model has not been aligned with human preferences and therefore could potentially generate harmful content
|
51 |
* This model has been trained on a dataset generated by ChatGPT 3.5, and you should check the legal status of AI-generated content in your jurisdiction before using it. You should make sure that your usage complies with the OpenAI Terms of Use, in so far as legally applicable.
|
52 |
|
53 |
+
## Credits
|
54 |
+
|
55 |
+
This model was trained and released by [Jina.ai](https://jina.ai/)
|