Text Generation
Transformers
PyTorch
Safetensors
English
mistral
text-generation-inference
instruction-pretrain commited on
Commit
8a5e3ac
·
verified ·
1 Parent(s): 6bcbd8a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -1,3 +1,10 @@
 
 
 
 
 
 
 
1
  # Instruction Pre-Training: Language Models are Supervised Multitask Learners
2
  This repo contains the **context-based instruction synthesizer** used in our paper **Instruction Pre-Training: Language Models are Supervised Multitask Learners**.
3
 
@@ -22,7 +29,7 @@ pip install -e .
22
  2. Evalaute
23
  ```bash
24
  MODEL=instruction-pretrain/InstructLM-500M
25
- add_bos_token=True # this flag is needed because lm-eval-harness set add_bos_token to False by default, but ours require add_bos_token to be True
26
 
27
  accelerate launch -m lm_eval --model hf \
28
  --model_args pretrained=${MODEL},add_bos_token=${add_bos_token},dtype=float16 \
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - tiiuae/falcon-refinedweb
5
+ language:
6
+ - en
7
+ ---
8
  # Instruction Pre-Training: Language Models are Supervised Multitask Learners
9
  This repo contains the **context-based instruction synthesizer** used in our paper **Instruction Pre-Training: Language Models are Supervised Multitask Learners**.
10
 
 
29
  2. Evalaute
30
  ```bash
31
  MODEL=instruction-pretrain/InstructLM-500M
32
+ add_bos_token=True # this flag is needed because lm-eval-harness set add_bos_token to False by default, but ours requires add_bos_token to be True
33
 
34
  accelerate launch -m lm_eval --model hf \
35
  --model_args pretrained=${MODEL},add_bos_token=${add_bos_token},dtype=float16 \