Marc Sun
marcsun13
AI & ML interests
LLM, Quantization, Training, Inference
Recent Activity
upvoted
an
article
about 16 hours ago
Fine-tuning Llama 2 70B using PyTorch FSDP
liked
a model
6 days ago
deepseek-ai/DeepSeek-R1-0528
Organizations
marcsun13's activity
add multi_xpu configuration
3
#5 opened 21 days ago
by
faaany
Error accessing Llama4 models: 'does not have any library metadata on the Hub'
3
#39 opened about 1 month ago
by
asharma2024
GPTQ group size effects vRAM usage
2
#18 opened over 1 year ago
by
sgjohnson
Determining Minimum GPU Memory and Input Text Length Calculation in Model Training
π
1
2
#19 opened over 1 year ago
by
kobe8-24
microsoft/Florence-2-base raises error without any detail
#36 opened 9 months ago
by
fcakyon

For Llama-2 model, I tried to get the memory usage, it says I do not have the access
1
#35 opened 9 months ago
by
AiEvanTao
Improve name of the model detection to allow several URL usages
2
#28 opened about 1 year ago
by
Pablohn26
meta-llama/Meta-Llama-3.1-8B-Instruct ....why not found this
β
6
#34 opened 9 months ago
by
world-of-ai
Llama3 is not found using link and API token
π
6
#31 opened about 1 year ago
by
barnaszocs
Question about Gradient calculation
#30 opened about 1 year ago
by
Infinity4B
How to Calculate GPU usage for re-training Microsoft/speechT5_tts ?
1
#27 opened about 1 year ago
by
huggingfacerohit
Unable to Access Gated Model Repository databricks/dbrx-instruct
1
#26 opened about 1 year ago
by
lambda1989
Data Wrong, THUDM/chatglm3-6b
1
#20 opened over 1 year ago
by
YEKANGMING

Analyzing tool for inference
#13 opened over 1 year ago
by
cllatMTK
Dropdown menu of models
3
#2 opened almost 2 years ago
by
Ouz-G

Plz fix it!!!
#33 opened 10 months ago
by
DonDonDon

This space is no longer working
β
11
3
#38 opened 4 months ago
by
LuisVasquezBSC

Upload folder using huggingface_hub
2
#4 opened 11 months ago
by
marcsun13

Upload folder using huggingface_hub
2
#8 opened 11 months ago
by
marcsun13

Upload folder using huggingface_hub
2
#8 opened 11 months ago
by
marcsun13
