Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mesolitica
's Collections
Audio Language Model
Malaysian Reasoning
Malaysian Finetuned Instruct LoRA
Malaysian Speech-to-Text
Malaysian Text-to-Speech
Malaysian Translation
Malaysian pretraining dataset
Malaysian instruction dataset
MaLLaM 🌙
Malaysian CausalLM
Malaysian LLM2Vec
Malaysian Seq2Seq
Malaysian MaskLM
Malaysian Reasoning
updated
about 1 month ago
Full parameter post training using SFT warmup and GRPO.
Upvote
1
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-SFT
2B
•
Updated
Jun 18
•
3
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-GRPO
2B
•
Updated
Jun 18
•
2
mesolitica/Malaysian-Qwen2.5-7B-Reasoning-SFT
8B
•
Updated
Jun 18
•
612
•
1
mesolitica/Malaysian-Qwen2.5-7B-Dialect-Reasoning-GRPO
8B
•
Updated
Jun 4
•
5
•
3
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-SFT
15B
•
Updated
Jun 18
•
565
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-GRPO
15B
•
Updated
Jun 18
•
9
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections