Arthur Zucker
ArthurZ
AI & ML interests
None yet
Recent Activity
liked
a model
6 days ago
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
upvoted
a
changelog
12 days ago
Static Spaces can now have a build step
liked
a Space
12 days ago
m-ric/beam_search_visualizer
Organizations
ArthurZ's activity
Adding transformers tag for better tracking of library
🚀
🔥
3
1
#2 opened 14 days ago
by
reach-vb

No attribute `sliding_window`?
2
#59 opened about 2 months ago
by
farzadab

Does LLama4 have chunked attention in generation phase ?
1
#64 opened about 2 months ago
by
vanshils
remove <|finetune_right_pad_id|> and change pad_token to <|finetune_right_pad|>
1
#25 opened about 2 months ago
by
wukaixingxp

pad error
➕
👍
7
8
#25 opened about 2 months ago
by
bobber
Bug in AutoModel
👍
1
3
#26 opened about 2 months ago
by
random-checkin

Cannot generate with BS > 1
1
#25 opened about 2 months ago
by
chenjiel
change to spda
2
#14 opened about 2 months ago
by
wukaixingxp

Fastest way for inference?
3
#28 opened 4 months ago
by
psycy
model-00078-of-000163.safetensors not marked safe?
2
#80 opened 4 months ago
by
aborst

Upload transformers version
10
#3 opened 7 months ago
by
ArthurZ

Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened 7 months ago
by
kwen2501
Update model weight
8
#13 opened 8 months ago
by
nguyen-brat
Update hidden_act to silu
2
#14 opened 8 months ago
by
ArthurZ

llama.cpp support
👍
🔥
11
9
#1 opened 8 months ago
by
ayyylol

tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened 8 months ago
by
dahara1
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 8 months ago
by
Valadaro
hidden_activation vs hidden_act in config.json
2
#10 opened 9 months ago
by
heheda
How to use safetensors?
2
#13 opened 8 months ago
by
prathi1729
lamma cpp ht to gguf not working
4
#2 opened 9 months ago
by
RameshRajamani