BramVanroy
/

falcon-7b-ft-mc4_nl_cleaned_tiny

Text Generation

text-generation-inference

Model card Files Files and versions

BramVanroy commited on Jul 25, 2023

Commit

b106433

·

1 Parent(s): af0b0ff

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -38,7 +38,8 @@ At 2048 tokens context length, the training set was around 2M (2,008,858) sample
 ## Training procedure
-Trained with LoRA in 4 bit and merged before upload. The adapters are in the `adapters` branch.
 ### Training hyperparameters

 ## Training procedure
+Trained with LoRA targetting `['query_key_value', 'dense', 'dense_h_to_4h', 'dense_4h_to_h']` in 4 bit and merged before upload.
+The adapters are in the `adapters` branch.
 ### Training hyperparameters