Non applicable
CombinHorizon
AI & ML interests
language models, speech to text
Recent Activity
new activity
about 2 months ago
CombinHorizon/zetasepic-abliteratedV2-Qwen2.5-32B-Inst-BaseMerge-TIES:Invalid LLM Leaderboard results
new activity
about 2 months ago
open-llm-leaderboard/open_llm_leaderboard:failed models, check logs
new activity
4 months ago
openlifescienceai/open_medical_llm_leaderboard:Model evaluation and submission stuck of LB.
Organizations
None yet
CombinHorizon's activity
New activity in
CombinHorizon/zetasepic-abliteratedV2-Qwen2.5-32B-Inst-BaseMerge-TIES
about 2 months ago
Invalid LLM Leaderboard results
1
#1 opened 3 months ago
by
cmp-nct

failed models, check logs
1
#1062 opened 4 months ago
by
CombinHorizon
Model evaluation and submission stuck of LB.
17
#17 opened 11 months ago
by
abideen

Unable submit this model to the LLM leaderboard (tokenizers issue)
2
#3 opened 4 months ago
by
CombinHorizon
14B model detected as 7B
11
#1049 opened 5 months ago
by
djuna

Resubmitting a model to use `chat_template` doesn't re-evaluate, but does change `chat_template` column
1
3
#1066 opened 4 months ago
by
xzuyn
Update leaderboard so can support newer models? (Can't submit newer Qwen2.5, gemma, phi models)
4
#3 opened 10 months ago
by
CombinHorizon
Feature Request: change request file format to disambiguate chat and non-chat models?
1
4
#954 opened 7 months ago
by
CombinHorizon
Suggestion: Adding outlier-resistant averaging methods
10
#968 opened 7 months ago
by
zelk12
[Important Notice] Evaluation of Submitted Models
1
#89 opened 9 months ago
by
Chanjun

Adding Evaluation Results
#1 opened 5 months ago
by
leaderboard-pr-bot

Which models do you want to see on here?
2
14
#2 opened 6 months ago
by
kaikaidai

Repeated failures of various running models
3
#6 opened 9 months ago
by
CombinHorizon
Adding Evaluation Results
#1 opened 6 months ago
by
CombinHorizon
What is the current status for the leaderboard? (H-CLCC, and any recent results?)
2
#7 opened 8 months ago
by
CombinHorizon
Eval time vs. score diagram
1
4
#950 opened 7 months ago
by
HenkPoley
Normalization for MMLU-Pro doesn't make sense
11
#947 opened 7 months ago
by
ekurtic
