ChocoLlama
/

Llama-3-ChocoLlama-8B-base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

matthieumeeus97 commited on Dec 11, 2024

Commit

929af7d

·

verified ·

1 Parent(s): 2fab3cc

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -39,7 +39,7 @@ We provide 6 variants (of which 3 base and 3 instruction-tuned models):
 - **Llama-3-ChocoLlama-8B-base** ([link](https://huggingface.co/ChocoLlama/Llama-3-ChocoLlama-8B-base)): A language-adapted version of Meta's Llama-8-8B, fine-tuned on the same Dutch dataset as ChocoLlama-2-7B-base, again using LoRa.
 - **Llama-3-ChocoLlama-instruct** ([link](https://huggingface.co/ChocoLlama/Llama-3-ChocoLlama-8B-instruct)): An instruction-tuned version of Llama-3-ChocoLlama-8B-base, fine-tuned on the same dataset as ChocoLlama-2-7B-instruct, again using SFT followed by DPO.
-For benchmark results for all models, including compared to their base models and other Dutch LLMs, we refer to our paper [here](some_url).
 ### Model Description
@@ -51,8 +51,8 @@ For benchmark results for all models, including compared to their base models an
 ### Model Sources
-- **Repository:** Will be released soon.
-- **Paper:** Will be released soon.
 ## Uses

 - **Llama-3-ChocoLlama-8B-base** ([link](https://huggingface.co/ChocoLlama/Llama-3-ChocoLlama-8B-base)): A language-adapted version of Meta's Llama-8-8B, fine-tuned on the same Dutch dataset as ChocoLlama-2-7B-base, again using LoRa.
 - **Llama-3-ChocoLlama-instruct** ([link](https://huggingface.co/ChocoLlama/Llama-3-ChocoLlama-8B-instruct)): An instruction-tuned version of Llama-3-ChocoLlama-8B-base, fine-tuned on the same dataset as ChocoLlama-2-7B-instruct, again using SFT followed by DPO.
+For benchmark results for all models, including compared to their base models and other Dutch LLMs, we refer to our paper [here](https://arxiv.org/pdf/2412.07633).
 ### Model Description
 ### Model Sources
+- **Repository:** [on Github here](https://github.com/ChocoLlamaModel/ChocoLlama).
+- **Paper:** [on ArXiv here](https://arxiv.org/pdf/2412.07633).
 ## Uses