OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation Paper • 2412.02592 • Published Dec 3, 2024 • 22
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 44
Training Socially Aligned Language Models in Simulated Human Society Paper • 2305.16960 • Published May 26, 2023 • 3
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation Paper • 2404.09127 • Published Apr 14, 2024 • 1
Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper • 2407.04620 • Published Jul 5, 2024 • 31
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3 Text Generation • Updated May 13, 2024 • 4
GeorgiaTech/0.0005_zephyr_withdpo_5551_4iters_bs256_newtrl_iter_3 Text Generation • Updated May 12, 2024 • 6
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_2 Text Generation • Updated May 12, 2024 • 64
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_1 Text Generation • Updated May 12, 2024 • 67
Improving Language Models with Advantage-based Offline Policy Gradients Paper • 2305.14718 • Published May 24, 2023 • 2