Thomas Gauthier-Caron PRO
thomasgauthier
AI & ML interests
None yet
Recent Activity
updated
a model
13 days ago
thomasgauthier/csm-1b-hf
new activity
13 days ago
sesame/csm-1b:PSA: HF transformers implementation open sourced (with Trainer support)
published
a model
13 days ago
thomasgauthier/csm-1b-hf
Organizations
thomasgauthier's activity
PSA: HF transformers implementation open sourced (with Trainer support)
3
1
#39 opened 13 days ago
by
thomasgauthier

Setup Discussion
2
#1 opened 6 months ago
by
CharlesCXK
Why is the size of pruned model bigger than the original ones after 24 layers been sliced?
4
#1 opened about 1 year ago
by
iheardyoulooking
Base Model or Finetuned Version?
13
#2 opened about 1 year ago
by
jphme

Lora adapter version
2
#15 opened over 1 year ago
by
thomasgauthier

Prompt format?
3
#1 opened over 1 year ago
by
thomasgauthier

Mistral tokenizer
2
#2 opened over 1 year ago
by
thomasgauthier

LoraConfig's target_modul with peft ?
8
#10 opened over 1 year ago
by
Handgun1773