Jaron
JaronTHU
AI & ML interests
None yet
Organizations
JaronTHU's activity
Fast Tokenizer
2
#17 opened 5 months ago
by
JaronTHU
Question about lm_head weights in Gemma-2-9b-it model
2
#34 opened 11 months ago
by
mjkmain

Fails to generate with `inputs_embeds`
2
#18 opened 11 months ago
by
JaronTHU
"It is strongly recommended to train Gemma2 models with the `eager` attention implementation "
๐
3
2
#10 opened 12 months ago
by
JaronTHU