Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

wangrongsheng
/
LLM

Model card Files Files and versions
xet
Community
LLM / ppo-lora
211 MB
  • 2 contributors
History: 1 commit
wangrongsheng
commit from root
42de9a6 over 2 years ago
  • checkpoint-1000
    commit from root over 2 years ago
  • checkpoint-2000
    commit from root over 2 years ago
  • checkpoint-3000
    commit from root over 2 years ago
  • checkpoint-4000
    commit from root over 2 years ago
  • checkpoint-5000
    commit from root over 2 years ago
  • checkpoint-6000
    commit from root over 2 years ago
  • checkpoint-7000
    commit from root over 2 years ago
  • reward
    commit from root over 2 years ago
  • README.md
    27 Bytes
    commit from root over 2 years ago
  • adapter_config.json
    412 Bytes
    commit from root over 2 years ago
  • adapter_model.bin
    26.3 MB
    xet
    commit from root over 2 years ago
  • finetuning_args.json
    235 Bytes
    commit from root over 2 years ago
  • trainer_log.jsonl
    154 kB
    commit from root over 2 years ago
  • trainer_state.json
    103 kB
    commit from root over 2 years ago
  • training_args.bin
    3.27 kB
    xet
    commit from root over 2 years ago
  • training_loss.png
    54.7 kB
    commit from root over 2 years ago
  • training_reward.png
    61.3 kB
    commit from root over 2 years ago
  • value_head.bin
    21.5 kB
    xet
    commit from root over 2 years ago