AIPlans 's Collections

Post Training Versions - Qwen 0.6B

Different versions of Qwen 0.6b, where the only difference is the post training method used. The post training database should be the hh rlhf dataset.