Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kaiokendev
/
superhot-13b-16k-no-rlhf-test

Model card Files Files and versions Community
4
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Possibility that Claude/ChatGPT uses similar techniques on adjusting RoPE sampling rate?

1
#4 opened almost 2 years ago by
Yhyu13

PPL chart for 16k models?

#3 opened almost 2 years ago by
Yhyu13

7B, 33B and 65B versions?

3
#2 opened almost 2 years ago by
flashvenom

Difference between this and 8k version?

10
#1 opened almost 2 years ago by
flashvenom
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs