SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance Paper • 2406.18118 • Published Jun 26, 2024
Empirical Insights on Fine-Tuning Large Language Models for Question-Answering Paper • 2409.15825 • Published Sep 24, 2024 • 1
TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use Paper • 2412.15495 • Published Dec 20, 2024 • 1
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs Paper • 2410.11302 • Published Oct 15, 2024
Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric Paper • 2502.17184 • Published Feb 24
Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition Paper • 2406.11192 • Published Jun 17, 2024