EuroBERT 🇪🇺 Scaling Multilingual Encoders for European Languages. EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 81 EuroBERT/EuroBERT-210m Fill-Mask • Updated Apr 17 • 12.3k • 70 EuroBERT/EuroBERT-610m Fill-Mask • Updated Apr 17 • 8.74k • 29 EuroBERT/EuroBERT-2.1B Fill-Mask • Updated Apr 17 • 1.51k • 50
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 81
LLMs Distillation The ULD loss, based on optimal transport, enables distillation across different LLM families without requiring shared tokenizers. Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs Paper • 2402.12030 • Published Feb 19, 2024 mistralai/Mistral-7B-Instruct-v0.2 Text Generation • Updated Sep 27, 2024 • 1.83M • • 2.79k meta-llama/Llama-2-7b-chat-hf Text Generation • Updated Apr 17, 2024 • 1.17M • 4.44k EleutherAI/pythia-160m-deduped Text Generation • Updated Jul 9, 2023 • 42.9k • 3
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs Paper • 2402.12030 • Published Feb 19, 2024
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 11
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 16
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 9
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 25
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 8
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 12
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 13
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 9
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_uld_loss Text Generation • Updated Feb 19, 2024 • 18
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_text_teacher Text Generation • Updated Feb 19, 2024 • 21
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-pubmed_qa_50k Viewer • Updated Mar 13, 2024 • 50.5k • 12
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-squad Viewer • Updated Mar 13, 2024 • 87.6k • 26