Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1, 2024 • 151
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 14
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7, 2024 • 10
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 16
Self-Specialization: Uncovering Latent Expertise within Large Language Models Paper • 2310.00160 • Published Sep 29, 2023
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding Paper • 2403.00425 • Published Mar 1, 2024 • 1
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future Paper • 1906.01702 • Published Jun 4, 2019
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings Paper • 2204.10298 • Published Apr 21, 2022 • 1
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Paper • 2309.10814 • Published Sep 19, 2023 • 3
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning Paper • 2303.05670 • Published Mar 10, 2023 • 1
Cooperative Self-training of Machine Reading Comprehension Paper • 2103.07449 • Published Mar 12, 2021 • 1
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Paper • 2309.03883 • Published Sep 7, 2023 • 35