Daniel Han-Chen
danielhanchen
AI & ML interests
None yet
Recent Activity
updated
a model
28 minutes ago
unsloth/DeepSeek-V3-0324-GGUF
updated
a dataset
about 4 hours ago
unsloth/precompiled_llama_cpp
published
a dataset
about 6 hours ago
unsloth/precompiled_llama_cpp
Organizations
danielhanchen's activity
Trying to fine tune this with LoRA
3
#36 opened 8 days ago
by
abcampbell
Can't run in llama.cpp, wrong tensor shape
13
#1 opened 11 days ago
by
bartowski

Is this model native 128K context length, or YaRN extended?
7
#28 opened 19 days ago
by
danielhanchen

LM Studio vs llama.cpp different results?
6
#5 opened 17 days ago
by
urtuuuu
recommended generation parameters
6
#5 opened 20 days ago
by
erichartford

QwQ-32B-Q5_K_M Cyclically thinking
9
#2 opened 19 days ago
by
yorktown
Is `rms_norm_eps` 1e-5 or 1e-6
#9 opened 20 days ago
by
danielhanchen

EOS token should be <|end|>
3
#1 opened 23 days ago
by
Mungert

Are the Q4 and Q5 models R1 or R1-Zero
18
#2 opened 2 months ago
by
gng2info
fix position embeddings
3
#1 opened 2 months ago
by
PatentPilotAI
I loaded DeepSeek-V3-Q5_K_M up on my 10yrs old old Tesla M40 (Dell C4130)
3
#8 opened 2 months ago
by
gng2info
Suggested tokenizer changes by Unsloth.ai
7
#21 opened 2 months ago
by
gugarosa

Getting error with Q3-K-M
7
#2 opened 3 months ago
by
alain401
Advice on running llama-server with Q2_K_L quant
3
#6 opened 3 months ago
by
vmajor

llama.cpp cannot load Q6_K model
5
#3 opened 3 months ago
by
vmajor

Big thanks for these "without original" uploads!
1
#1 opened 4 months ago
by
jukofyork

Aphrodite/VLLM/SGLang all refuse to load this model
2
#5 opened 7 months ago
by
fullstack
No module named 'triton'
1
#3 opened 6 months ago
by
NeelM0906
update base_model
#1 opened 7 months ago
by
davanstrien
