anamikac2708
/

Llama3-8b-finetuned-NEFTune-investopedia-Lora-Adapters

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

anamikac2708 commited on Jun 18

Commit

4203da5

•

1 Parent(s): 7381c70

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -20,7 +20,9 @@ base_model: meta-llama/Meta-Llama-3-8B
 This llama model was trained with Huggingface's TRL library and NEFTune https://arxiv.org/abs/2310.05914 using open-sourced finance dataset https://huggingface.co/datasets/FinLang/investopedia-instruction-tuning-dataset developed for finance application by FinLang Team
-This project is for research purposes only. Third-party datasets may be subject to additional terms and conditions under their associated licenses.
 ## How to Get Started with the Model

 This llama model was trained with Huggingface's TRL library and NEFTune https://arxiv.org/abs/2310.05914 using open-sourced finance dataset https://huggingface.co/datasets/FinLang/investopedia-instruction-tuning-dataset developed for finance application by FinLang Team
+NEFTune paper propose to add random noise to the embedding vectors of the training data during the forward pass of fine-tuning as a result the model overfits less to the
+specifics of the instruction-tuning dataset, such as formatting details, exact wording, and text length. Instead of collapsing to the exact instruction distribution, the model is more capable of providing
+answers that incorporate knowledge and behaviors of the pretrained base model.
 ## How to Get Started with the Model