None
Thireus
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
DavidAU/Qwen3-4B-Q8_0-64k-128k-256k-context-GGUF
new activity
4 days ago
unsloth/Qwen3-32B-GGUF:Potentially still broken?
liked
a model
4 days ago
unsloth/Qwen3-32B-128K-GGUF
Organizations
None yet
Thireus's activity
Potentially still broken?
3
13
#8 opened 12 days ago
by
qenthousiast
Brainstorming
4
5
#6 opened 11 days ago
by
Downtown-Case
UD version not doing great with YaRN compared to non-UD of the same size
4
#4 opened 13 days ago
by
Thireus
PPL vs model size - safe to assume larger size == better accuracy regardless of UD vs non-UD?
3
#6 opened 15 days ago
by
Thireus
Potential issue with large context sizes - can someone confirm?
13
#18 opened 14 days ago
by
Thireus
Qwen is loosing broad knowledge since Qwen2.
10
14
#16 opened 15 days ago
by
phil111
YaRN not enabled correctly
1
1
#3 opened 15 days ago
by
CISCai
Any chance of a 128k version so we can use it as a draft model for the larger 128k models?
3
8
#3 opened 14 days ago
by
smcleod

Could we have the 128K variants as well please?
1
#1 opened 11 days ago
by
Thireus
Context size? YaRN still supported?
2
#3 opened 11 days ago
by
Thireus
CORRECTION: THIS SYSTEM MESSAGE IS ***PURE GOLD***!!!
17
16
#33 opened 8 months ago
by
jukofyork

FP8 Checkpoint version size mismatch?
2
#15 opened 9 months ago
by
Thireus
What is the "system" prompt?
3
#13 opened about 1 year ago
by
kk3dmax
First Word Ignored Issue / Single Word Instruction
1
20
#11 opened about 1 year ago
by
pandora-s

Prompt format
1
#12 opened about 1 year ago
by
Thireus
Incomplete Answers
7
#59 opened over 1 year ago
by
samparksoftwares
Higher PPL than Mixtral?
3
#11 opened over 1 year ago
by
Thireus
exl2-2 please?
8
#1 opened over 1 year ago
by
Thireus
Safetensors version?
6
#2 opened over 1 year ago
by
matatonic
tokenization_yi.py
6
#1 opened over 1 year ago
by
Stilgar