Model Details

  • Architecture: Basic/default GPT-2, decoder only
  • Num params: ~810M
  • Num tokens seen: ~2 B
  • Dataset: PubMed Abstracts subset of The Pile
Downloads last month
12
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support