Vezora
/

Narwhal-7b-v3

Text Generation

text-generation-inference

Model card Files Files and versions Community

Vezora commited on Dec 5, 2023

Commit

b90f7de

·

1 Parent(s): c4f9f49

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -4,6 +4,24 @@ license: apache-2.0
 This is a merge model using Tie merge method.
 Created using openchat 3.5 and una-cybertron-7b-v2-bf16.
 This model is exceptionally well at labeling data, bringing down labeling cost to server cost. Hurray! Here is an example

 This is a merge model using Tie merge method.
 Created using openchat 3.5 and una-cybertron-7b-v2-bf16.
+Instruction template:
+```python
+import transformers
+tokenizer = transformers.AutoTokenizer.from_pretrained("openchat/openchat_3.5")
+# Single-turn
+tokens = tokenizer("GPT4 Correct User: Hello<|end_of_turn|>GPT4 Correct Assistant:").input_ids
+assert tokens == [1, 420, 6316, 28781, 3198, 3123, 1247, 28747, 22557, 32000, 420, 6316, 28781, 3198, 3123, 21631, 28747]
+# Multi-turn
+tokens = tokenizer("GPT4 Correct User: Hello<|end_of_turn|>GPT4 Correct Assistant: Hi<|end_of_turn|>GPT4 Correct User: How are you today?<|end_of_turn|>GPT4 Correct Assistant:").input_ids
+assert tokens == [1, 420, 6316, 28781, 3198, 3123, 1247, 28747, 22557, 32000, 420, 6316, 28781, 3198, 3123, 21631, 28747, 15359, 32000, 420, 6316, 28781, 3198, 3123, 1247, 28747, 1602, 460, 368, 3154, 28804, 32000, 420, 6316, 28781, 3198, 3123, 21631, 28747]
+# Coding Mode
+tokens = tokenizer("Code User: Implement quicksort using C++<|end_of_turn|>Code Assistant:").input_ids
+assert tokens == [1, 7596, 1247, 28747, 26256, 2936, 7653, 1413, 334, 1680, 32000, 7596, 21631, 28747]
+```
 This model is exceptionally well at labeling data, bringing down labeling cost to server cost. Hurray! Here is an example