WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation Paper โข 2312.14187 โข Published Dec 20, 2023 โข 49
RedStone: Curating General, Code, Math, and QA Data for Large Language Models Paper โข 2412.03398 โข Published Dec 4, 2024 โข 1
MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark Paper โข 2412.15194 โข Published 30 days ago โข 1
MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer Paper โข 2304.12043 โข Published Apr 24, 2023 โข 1
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper โข 2501.04694 โข Published 10 days ago โข 9