Commit
·
480b107
1
Parent(s):
5150184
Update README.md
Browse files
README.md
CHANGED
|
@@ -12,6 +12,9 @@ pipeline_tag: text-generation
|
|
| 12 |
# VMware/xgen-7b-8k-open-instruct
|
| 13 |
Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
|
| 14 |
|
|
|
|
|
|
|
|
|
|
| 15 |
We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). <br>
|
| 16 |
|
| 17 |
|
|
@@ -31,8 +34,6 @@ Subset of COT SUBMIX (FROM FLAN V2) Zeroshot examples
|
|
| 31 |
|
| 32 |
The model supports up to <b>8192 tokens </b>
|
| 33 |
|
| 34 |
-
<b> NOTE </b> : The model was trained using the Alpaca prompt template
|
| 35 |
-
<b> NOTE </b> : tiktoken library is required for the tokenizer. Set trust_remote_code=True when launching the tokenizer.
|
| 36 |
|
| 37 |
|
| 38 |
## License
|
|
|
|
| 12 |
# VMware/xgen-7b-8k-open-instruct
|
| 13 |
Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
|
| 14 |
|
| 15 |
+
<b> NOTE </b> : The model was trained using the Alpaca prompt template <br>
|
| 16 |
+
<b> NOTE </b> : tiktoken library is required for the tokenizer. Set trust_remote_code=True when launching the tokenizer.<br>
|
| 17 |
+
|
| 18 |
We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). <br>
|
| 19 |
|
| 20 |
|
|
|
|
| 34 |
|
| 35 |
The model supports up to <b>8192 tokens </b>
|
| 36 |
|
|
|
|
|
|
|
| 37 |
|
| 38 |
|
| 39 |
## License
|