Simple and Effective Masked Diffusion Language Models Paper • 2406.07524 • Published Jun 11, 2024 • 10
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper • 2412.14922 • Published 30 days ago • 85
A Multi-Task, Multi-Modal Approach for Predicting Categorical and Dimensional Emotions Paper • 2401.00536 • Published Dec 31, 2023 • 2
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published Dec 13, 2024 • 139
ATHAR: A High-Quality and Diverse Dataset for Classical Arabic to English Translation Paper • 2407.19835 • Published Jul 29, 2024 • 21
Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names Paper • 2408.00298 • Published Aug 1, 2024 • 10
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition Paper • 2407.13559 • Published Jul 18, 2024 • 14