Phyo Arkar Lwin
v3ss0n
ยท
AI & ML interests
None yet
Organizations
None yet
I can't run any of the dynamic bnb-4bit quants with TextGenerationInference
2
#6 opened 5 months ago
by
v3ss0n
Request to Release Qwen2.5-Max as Open Source Model
๐
๐ฅ
44
5
#8 opened 6 months ago
by
quantflex

Remove gated access?
2
#25 opened 5 months ago
by
davidmezzetti

fix: strftime_now is unknown (in <string>:1)
8
#17 opened 5 months ago
by
v3ss0n
Why increase censorship?
21
#20 opened 5 months ago
by
notafraud

Request access to the model
1
#22 opened 5 months ago
by
klydekushy
Adding tool call support in chat template
27
#13 opened 5 months ago
by
Navanit-AI

Commit #e969dcf155adde0b0654770948d93d1b2646d3f4 Introduced `strftime_now` and it is unknown in TGI.
๐
1
3
#8 opened 5 months ago
by
v3ss0n
chat template doesn't include tools
๐
4
10
#3 opened 5 months ago
by
copasseron
Add system message to chat template
1
#6 opened 5 months ago
by
Rocketknight1

chat template
๐
1
1
#9 opened 5 months ago
by
lucyknada

llama.cpp / gguf?
3
#3 opened about 1 year ago
by
nacs
Run inference in CPU
3
#1 opened about 1 year ago
by
hythythyt3
Quantized model coming?
๐
6
8
#3 opened about 1 year ago
by
dnhkng

GGUF file request
โ
10
3
#14 opened about 1 year ago
by
MicFizzy