Michael Han
shimmyshimmer
AI & ML interests
None yet
Recent Activity
new activity
about 4 hours ago
unsloth/Meta-Llama-3.1-8B-Instruct:How to merge these 4 split model files?
liked
a model
1 day ago
mlx-community/Unsloth-Phi-4-4bit
liked
a model
1 day ago
codelion/Llama-3.2-3B-o1-lora
Organizations
shimmyshimmer's activity
How to merge these 4 split model files?
1
#3 opened 2 days ago
by
AnsonTeng
Am I the only person using this?
1
#1 opened 1 day ago
by
patruff
What is the required GPU size to run Is a 4090 possible and does it support ollama
9
#5 opened 9 days ago
by
sminbb
fix position embeddings
3
#1 opened 4 days ago
by
PatentPilotAI
Is there a fine-tuning cookbook for phi4 ?
2
#27 opened 3 days ago
by
sbhctashi
Encountering Unknown quantization type, got fp8 - supported types are: XXXXX
2
#1 opened 4 days ago
by
ivanmanu
First review, Q5-K-M require 502Gb RAM, better than Meta 405billions
5
#11 opened 5 days ago
by
krustik
Issue with --n-gpu-layers 5 Parameter: Model Only Running on CPU
5
#10 opened 6 days ago
by
vuk123
Iโm new to GGUF quants
1
#9 opened 6 days ago
by
fsaudm
why use q5 for key cache?
1
#7 opened 8 days ago
by
CHNtentes
Are these imatrix GGUF quants?
4
#1 opened 10 days ago
by
Kearm
llama.cpp cannot load Q6_K model
5
#3 opened 9 days ago
by
vmajor
Getting error with Q3-K-M
7
#2 opened 10 days ago
by
alain401
More info about this model?
5
#1 opened 2 months ago
by
sirus
Model card is 405B and not 70B
1
#1 opened 18 days ago
by
Spestly
RAM requirements for running Llama-3.3-70B-Instruct-Q5_K_M.gguf
1
#4 opened 20 days ago
by
hyadav22
NameError: name 'CohereLayerNorm' is not defined
2
#1 opened 29 days ago
by
joelniklaus
Strange behaviour of Llama3.2-vision - it behaves like text model
1
#9 opened 29 days ago
by
jirkazcech
'LlamaForCausalLM' object has no attribute 'max_seq_length'
3
#8 opened 6 months ago
by
AronVic
Can you post the script that was used to quantize this model please?
10
#2 opened 4 months ago
by
ctranslate2-4you