Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data Paper • 2404.03862 • Published Apr 5, 2024
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees Paper • 2404.08417 • Published Apr 12, 2024 • 1
Dated Data: Tracing Knowledge Cutoffs in Large Language Models Paper • 2403.12958 • Published Mar 19, 2024
Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering Paper • 2502.13962 • Published 18 days ago • 28
Few-Shot Detection of Machine-Generated Text using Style Representations Paper • 2401.06712 • Published Jan 12, 2024 • 1
AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies Paper • 2402.12370 • Published Feb 19, 2024 • 1
Low-Resource Authorship Style Transfer with In-Context Learning Paper • 2212.08986 • Published Dec 18, 2022
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation Paper • 2411.14384 • Published Nov 21, 2024 • 9
LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression Paper • 2406.20092 • Published Jun 28, 2024
Every Language Counts: Learn and Unlearn in Multilingual LLMs Paper • 2406.13748 • Published Jun 19, 2024
Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell Paper • 2406.14673 • Published Jun 20, 2024
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF Paper • 2406.07971 • Published Jun 12, 2024
ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers Paper • 2309.16119 • Published Sep 28, 2023 • 1
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models Paper • 2407.05502 • Published Jul 7, 2024
RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation Paper • 2406.14764 • Published Jun 20, 2024 • 4