projecte-aina
/

Plume32k

text-generation

text-generation-inference

Model card Files Files and versions Community

javi8979 commited on Jun 6, 2024

Commit

b84f4c9

·

verified ·

1 Parent(s): d83f0d8

Update README.md

Files changed (1) hide show

README.md +19 -1

README.md CHANGED Viewed

@@ -16,4 +16,22 @@ metrics:
 pipeline_tag: translation
 ---
-# Model Card for Plume32k

 pipeline_tag: translation
 ---
+# Plume32k
+## Table of Contents
+<details>
+<summary>Click to expand</summary>
+- [Model description](#model-description)
+- [Intended uses and limitations](#intended-uses-and-limitations)
+- [How to use](#how-to-use)
+- [Training](#training)
+- [Evaluation](#evaluation)
+- [Citation](#citation)
+- [Additional information](#additional-information)
+</details>
+## Summary
+Plume is the first LLM trained for Neural Machine Translation with only parallel Catalan-Centric data from scratch. It is a language model with the same architecture as Gemma 2B. The model is trained for general translation tasks at sentence level. For more information about training, architecture and interpretability of the model check out the paper;  "Investigating the translation capabilities of Large Language Models trained on parallel data only". The preprint is available on [arXiv]().