Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 14
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1, 2024 • 151
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 14
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 14
QTSumm: A New Benchmark for Query-Focused Table Summarization Paper • 2305.14303 • Published May 23, 2023
Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model Paper • 2209.11477 • Published Sep 23, 2022
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples Paper • 2210.12374 • Published Oct 22, 2022
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper • 2408.06195 • Published Aug 12, 2024 • 74
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7, 2024 • 10
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 16
SUPERB: Speech processing Universal PERformance Benchmark Paper • 2105.01051 • Published May 3, 2021 • 1
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Paper • 2203.04911 • Published Mar 9, 2022
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings Paper • 2204.10298 • Published Apr 21, 2022 • 1