Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kimxxxx
/
mistral_r64_a128_g8_gas8_lr9e-5_4500tk_droplast_nopacking_nooverlapping_2epoch
like
0
Transformers
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
mistral_r64_a128_g8_gas8_lr9e-5_4500tk_droplast_nopacking_nooverlapping_2epoch
Commit History
Upload tokenizer
c8aa2d1
verified
kimxxxx
commited on
Jun 30
Upload MistralForCausalLM
c59f1e5
verified
kimxxxx
commited on
Jun 30
initial commit
712dc71
verified
kimxxxx
commited on
Jun 30