Post
830
EARLY SNEAK PREVIEW: get a first look at the Celestia 3 science-reasoning dataset, built with DeepSeek's newest R1-0528 reasoning model! Subjects include physics, chemistry, biology, computer science, Earth science, astronomy, and information theory.
This early look contains the first 14k rows, all synthetic responses using deepseek-ai/DeepSeek-R1-0528
SEE IT HERE: sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW
Support our releases: sequelbox/SupportOpenSource
Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune.
more to come soon!
allegra
This early look contains the first 14k rows, all synthetic responses using deepseek-ai/DeepSeek-R1-0528
SEE IT HERE: sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW
Support our releases: sequelbox/SupportOpenSource
Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune.
more to come soon!
allegra