Kevin16
's Collections
LLM Paperlist
updated
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
•
2406.04692
•
Published
•
56
CRAG -- Comprehensive RAG Benchmark
Paper
•
2406.04744
•
Published
•
45
Boosting Large-scale Parallel Training Efficiency with C4: A
Communication-Driven Approach
Paper
•
2406.04594
•
Published
•
6
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language
Models
Paper
•
2406.04271
•
Published
•
29
4-bit Shampoo for Memory-Efficient Network Training
Paper
•
2405.18144
•
Published
•
9
Self-Exploring Language Models: Active Preference Elicitation for Online
Alignment
Paper
•
2405.19332
•
Published
•
15
Paper
•
2405.18407
•
Published
•
46
2BP: 2-Stage Backpropagation
Paper
•
2405.18047
•
Published
•
23
Yuan 2.0-M32: Mixture of Experts with Attention Router
Paper
•
2405.17976
•
Published
•
18
LLaMA-NAS: Efficient Neural Architecture Search for Large Language
Models
Paper
•
2405.18377
•
Published
•
18
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
•
2406.15319
•
Published
•
63
ColPali: Efficient Document Retrieval with Vision Language Models
Paper
•
2407.01449
•
Published
•
43
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
•
2406.19215
•
Published
•
30
Visual Haystacks: Answering Harder Questions About Sets of Images
Paper
•
2407.13766
•
Published
•
2