Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
144.5
TFLOPS
670
15
189
Arthur Zucker
ArthurZ
Follow
victor's profile picture
Ligeng-Zhu's profile picture
andreaschandra's profile picture
273 followers
·
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Articles
Fixing Gradient Accumulation
27 days ago
•
39
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
Aug 21
•
22
Fine-Tuning Gemma Models in Hugging Face
Feb 23
•
23
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
8
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mistral-community/pixtral-12b
21 days ago
Update model weight
8
#13 opened 25 days ago by
nguyen-brat
New activity in
mistral-community/pixtral-12b
24 days ago
Update hidden_act to silu
2
#14 opened 24 days ago by
ArthurZ
New activity in
rhymes-ai/Aria
about 1 month ago
llama.cpp support
9
#1 opened about 1 month ago by
ayyylol
New activity in
google/gemma-2-2b-jpn-it
about 1 month ago
tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened about 1 month ago by
dahara1
New activity in
mistral-community/pixtral-12b
about 2 months ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened about 2 months ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
about 2 months ago
hidden_activation vs hidden_act in config.json
2
#10 opened about 2 months ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
about 2 months ago
How to use safetensors?
2
#13 opened about 2 months ago by
prathi1729
New activity in
mistral-community/pixtral-12b
about 2 months ago
lamma cpp ht to gguf not working
4
#2 opened about 2 months ago by
RameshRajamani
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
3 months ago
8-kv-heads
8
#14 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
Update config.json
#17 opened 3 months ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened 3 months ago by
tanmaylaud
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
3 months ago
8 kv heads
2
#13 opened 3 months ago by
kkokkie2360
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
8-kv-heads
#15 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B
3 months ago
8-kv-heads
3
#21 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct
3 months ago
8-kv-heads
4
#17 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
Updated eos_token to include multiple IDs
1
#14 opened 3 months ago by
vontimitta
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
Update tokenizer to prepend special token
#12 opened 4 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-70B
4 months ago
Update tokenizer to prepend special token
1
#11 opened 4 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-8B-Instruct
4 months ago
Upload tokenizer
2
#29 opened 4 months ago by
ArthurZ
Upload tokenizer
#28 opened 4 months ago by
ArthurZ
Load more