https://huggingface.co/posts/Reality123b/379097737205276 remember this dataset? im bumping the example count to approx 23 million prompt-response pairs and ofc. it is going to be a hybrid reasoning, well, it isnt programmatically hybrid reasoning but it is that it is going to use CoT whenever necessary and it doesnt when it doesnt seem like it doesnt need