princeton-nlp
/

Sheared-LLaMA-1.3B-Pruned

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

princeton-nlp commited on Jan 23

Commit

300c0d8

•

1 Parent(s): 3d8fd96

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -11,4 +11,8 @@ license: llama2
 **License**: Must comply with license of Llama2 since it's a model derived from Llama2.
-Sheared-LLaMA-2.7B-Pruned is the model pruned from [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) **without continued pre-training**. We used roughly 0.4B tokens to perform the pruning experiment.

 **License**: Must comply with license of Llama2 since it's a model derived from Llama2.
+Sheared-LLaMA-1.3B-Pruned is the model pruned from [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) **without continued pre-training**.
+We used roughly 0.4B tokens to perform the pruning experiment. This model could be a good use to study
+- effective data mixtures for continued pre-training
+- comparisons to other pruning techniques
+- extensive evaluations to understand how pruning affects knowledge and reasoning capabilities of LLMs