Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
prav719
/
DeepSeek-R1-Distill-Qwen-32B-flash-attention-2_H100
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
sft
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-32B-flash-attention-2_H100
Commit History
Model save
80691f4
verified
prav719
commited on
Feb 24
Training in progress, epoch 3
33fd8c0
verified
prav719
commited on
Feb 24
Training in progress, epoch 1
6eec95b
verified
prav719
commited on
Feb 24
initial commit
50c5df3
verified
prav719
commited on
Feb 23