cicdatopea
cicdatopea
AI & ML interests
None yet
Recent Activity
new activity
4 days ago
kaitchup/QwQ-32B-AutoRoundGPTQ-4bit:Qwen-32B overflow issue
new activity
4 days ago
kaitchup/Qwen2.5-72B-Instruct-AutoRound-GPTQ-4bit:how to run this model
updated
a model
5 days ago
OPEA/Llama-3.3-70B-Instruct-int2-sym-inc
Organizations
cicdatopea's activity
Qwen-32B overflow issue
8
#1 opened 10 days ago
by
cicdatopea

how to run this model
4
#1 opened 4 days ago
by
cicdatopea

without licence
1
#2 opened 7 days ago
by
Futureli
how to inference this model?
1
#1 opened 10 days ago
by
xiximayou
so consider build a model for GPU?
1
#1 opened 11 days ago
by
kq

Your quants are not listed in the base model
2
#2 opened about 1 month ago
by
dazipe
VLLM 0.7.2 can start the model normally, but there is no output when simulating a request using Curl, it blocks!
1
#2 opened about 1 month ago
by
JZMALi
sglang inference issue
7
#1 opened about 1 month ago
by
su400
Start on cpu with vllm.
1
#1 opened about 2 months ago
by
kuliev-vitaly
“a larger accuracy drop in Chinese tasks"? how much exectaly?
1
#1 opened 2 months ago
by
chuangzhidian
A bug when running the demo inference on GPU
1
#5 opened 3 months ago
by
HuggingLianWang
vllm
23
#4 opened 3 months ago
by
NikolaSigmoid
Base model please!
2
#2 opened 3 months ago
by
deltanym

alternative serving framework
2
#1 opened 3 months ago
by
erichartford

Update README.md
#1 opened 3 months ago
by
n1ck-guo
suggest fill the base model of metadata UI in model card
#1 opened 3 months ago
by
cicdatopea
