-
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Paper • 2403.13257 • Published • 20 -
Model Stock: All we need is just a few fine-tuned models
Paper • 2403.19522 • Published • 12 -
Mergenetic: a Simple Evolutionary Model Merging Library
Paper • 2505.11427 • Published • 12 -
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Paper • 2410.01335 • Published • 5
Yamata Zen
yamatazen
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
bartowski/google_gemma-3-12b-it-GGUF
liked
a Space
1 day ago
alexnasa/Chain-of-Zoom
liked
a model
2 days ago
arcee-ai/Homunculus
Organizations
None yet
Collections
6
-
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability
Paper • 2306.06688 • Published -
Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs
Paper • 2412.14471 • Published -
Language Models' Factuality Depends on the Language of Inquiry
Paper • 2502.17955 • Published • 34 -
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Paper • 2410.01335 • Published • 5
models
115

yamatazen/LorablatedStock-12B
Text Generation
•
Updated
•
62
•
3

yamatazen/FusionEngine-12B-Lorablated
Text Generation
•
Updated
•
66
•
2

yamatazen/HMS-Fusion-12B-Lorablated
Text Generation
•
Updated
•
57
•
1

yamatazen/ForgottenMaid-12B-Lorablated
Text Generation
•
Updated
•
27
•
1

yamatazen/Shisa-v2-Mistral-Nemo-12B-Lorablated
Text Generation
•
Updated
•
15
•
2

yamatazen/ForgottenMaid-12B-LoRA-Rank128
Updated
•
23
•
1

yamatazen/Gemma2-Ataraxy-Psycho-9B
Text Generation
•
Updated
•
25
•
1

yamatazen/FusionEngine-12B
Text Generation
•
Updated
•
104
•
2

yamatazen/HMS-Fusion-12B
Text Generation
•
Updated
•
42
•
3

yamatazen/Shisa-DellaTest-12B
Text Generation
•
Updated
•
9
•
1
datasets
0
None public yet