CritiqueFineTuning
Collection
The dataset and models for CritiqueFineTuning
•
3 items
•
Updated
Qwen2.5-32B-Instruct-CFT is a 32B parameter model fine-tuned using our novel Critique Fine-Tuning (CFT) approach. Built upon the Qwen2.5-32B-Instruct base model, this variant is trained to critique and analyze responses rather than simply imitate them, leading to enhanced reasoning capabilities.
For more details about the model architecture, methodology, and comprehensive evaluation results, please visit our project webpage.