fearlessdots
/

Llama-3-Alpha-Centauri-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

fearlessdots commited on May 25

Commit

7ab2b01

•

1 Parent(s): 0d31bb3

Update README.md

Files changed (1) hide show

README.md +12 -4

README.md CHANGED Viewed

@@ -34,12 +34,19 @@ The LoRA used to create this model is available at [https://huggingface.co/fearl
 ## Fine Tuning
 ### - PEFT Parameters
-- lora_alpha=64,
-- lora_dropout=0.05,
-- r=128,
-- bias="none",
 ### - Training Arguments
@@ -62,6 +69,7 @@ The LoRA used to create this model is available at [https://huggingface.co/fearl
 ## Credits
 - Meta ([https://huggingface.co/meta-llama](https://huggingface.co/meta-llama)): for the original Llama-3;
 - failspy ([https://huggingface.co/failspy](https://huggingface.co/failspy)): for the base model and the orthogonalization implementation;
 - NobodyExistsOnTheInternet ([https://huggingface.co/NobodyExistsOnTheInternet](https://huggingface.co/NobodyExistsOnTheInternet)): for the incredible dataset;
 - Undi95 ([https://huggingface.co/Undi95](https://huggingface.co/Undi95)) and Sao10k ([https://huggingface.co/Sao10K](https://huggingface.co/Sao10K)): my main inspirations for doing these models =]

 ## Fine Tuning
+### - Quantization Configuration
+- load_in_4bit=True
+- bnb_4bit_quant_type="fp4"
+- bnb_4bit_compute_dtype=compute_dtype
+- bnb_4bit_use_double_quant=False
+-
 ### - PEFT Parameters
+- lora_alpha=64
+- lora_dropout=0.05
+- r=128
+- bias="none"
 ### - Training Arguments
 ## Credits
 - Meta ([https://huggingface.co/meta-llama](https://huggingface.co/meta-llama)): for the original Llama-3;
+- HuggingFace: for hosting this model and for creating the fine-tuning tools used;
 - failspy ([https://huggingface.co/failspy](https://huggingface.co/failspy)): for the base model and the orthogonalization implementation;
 - NobodyExistsOnTheInternet ([https://huggingface.co/NobodyExistsOnTheInternet](https://huggingface.co/NobodyExistsOnTheInternet)): for the incredible dataset;
 - Undi95 ([https://huggingface.co/Undi95](https://huggingface.co/Undi95)) and Sao10k ([https://huggingface.co/Sao10K](https://huggingface.co/Sao10K)): my main inspirations for doing these models =]