RichardErkhov/mlfoundations-dev_-_hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-6-gguf 9B • Updated 21 days ago • 258
RichardErkhov/cutelemonlili_-_MAmmoTH2-8B-Plus_MATH_training_Qwen2.5-32B-Instruct-gguf 8B • Updated 21 days ago • 365
RichardErkhov/mlfoundations-dev_-_oh_teknium_scaling_down_random_0.8-gguf 8B • Updated 21 days ago • 335
RichardErkhov/violetxi_-_ak-prm-sub2k_sft-steptok_lr1e-5_wa0.03_balanced_checkpoint1800-gguf 8B • Updated 21 days ago • 337
RichardErkhov/violetxi_-_ak-prm-full-sft_lr1e-5_wa0.03_balanced_checkpoint3900-gguf 8B • Updated 21 days ago • 337
RichardErkhov/violetxi_-_ak_prm_lr1e-5_wa0.03_balanced_checkpoint6100-gguf 8B • Updated 21 days ago • 321
RichardErkhov/violetxi_-_ak_prm_lr1e-5_wa0.03_balanced_checkpoint5400-gguf 8B • Updated 21 days ago • 350
RichardErkhov/violetxi_-_ak_prm_lr1e-5_wa0.03_balanced_checkpoint2900-gguf 8B • Updated 21 days ago • 337
RichardErkhov/violetxi_-_ak_prm_lr1e-5_wa0.03_balanced_checkpoint2200-gguf 8B • Updated 21 days ago • 334
RichardErkhov/suehyunpark_-_potpourri-8b-inst-fft-induction-bc-trajectory-max10-per-task-check-answer-gguf 8B • Updated 22 days ago • 334