view article Article Navigating Korean LLM Research #2: Evaluation Tools By amphora ā¢ Oct 23, 2024 ā¢ 7
LLM-jp-3 Fine-tuned Models Collection Fine-tuned models in the LLM-jp-3 model series ā¢ 5 items ā¢ Updated 11 days ago ā¢ 2
LLM-jp-3 Pre-trained Models Collection Pre-trained models in the LLM-jp-3 model series ā¢ 4 items ā¢ Updated 25 days ago ā¢ 5
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 ā¢ 140
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency Paper ā¢ 2410.07563 ā¢ Published Oct 10, 2024 ā¢ 2
gemma-2-baku Collection The baku model series are based on the gemma-2 series and have been continually pre-trained on Japanese-specific corpora. ā¢ 4 items ā¢ Updated Dec 5, 2024 ā¢ 3
Gemma 2 JPN Release Collection A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. ā¢ 3 items ā¢ Updated Dec 13, 2024 ā¢ 27
Japanese SimCSE Collection Tsukagoshi et al., Japanese SimCSE Technical Report, arXiv 2023. https://arxiv.org/abs/2310.19349 ā¢ 5 items ā¢ Updated Sep 4, 2024 ā¢ 2
llama-3-youko Collection The youko model series are based on the llama-3 series and have been continually pre-trained on Japanese-specific corpora. ā¢ 9 items ā¢ Updated Dec 5, 2024 ā¢ 1
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM š¤ ā¢ 9 items ā¢ Updated Sep 26, 2024 ā¢ 56
Sarashina Collection Large Language Models developed by SB Intuitions ā¢ 8 items ā¢ Updated Dec 12, 2024 ā¢ 5
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs Paper ā¢ 2407.03963 ā¢ Published Jul 4, 2024 ā¢ 16