which one is vllm based? How can one tell? can you mention it in article?
Aslo, are you happen to aware work on using grpo to improve MMLU (or some task inside it) with models like qwen 2.5 7b or even smaller? @prithivMLmods thanks.

commented on Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies about 1 month ago

why there are 2 methods?

updated a model about 1 year ago

OpenCUI/dug-t5base-0.1

Text2Text Generation • Updated Jan 21, 2024 • 5

updated 2 models over 1 year ago

OpenCUI/dug-t5large-0.1

Text2Text Generation • Updated Jan 15, 2024

CUIGuy/flan-t5-base-ecommerce-text-classification

Updated Dec 25, 2023

New activity in togethercomputer/Llama-2-7B-32K-Instruct over 1 year ago

when will have a ggml version?

#3 opened over 1 year ago by

CUIGuy

can not install rotary

#4 opened over 1 year ago by

CUIGuy

New activity in cerebras/btlm-3b-8k-base over 1 year ago

why we can not make this fully HF ready?

#11 opened over 1 year ago by

CUIGuy

why we can not make this fully HF ready?

#11 opened over 1 year ago by

CUIGuy

why we can not make this fully HF ready?

#11 opened over 1 year ago by

CUIGuy