Collections

2

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 38
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 82
Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 82

-

Understanding the planning of LLM agents: A survey

Paper • 2402.02716 • Published Feb 5 • 1
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 65
LLM Multi-Agent Systems: Challenges and Open Problems

Paper • 2402.03578 • Published Feb 5
CACA Agent: Capability Collaboration based AI Agent

Paper • 2403.15137 • Published Mar 22

Chain-of-Verification Reduces Hallucination in Large Language Models

Adapting Large Language Models via Reading Comprehension

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Language Modeling Is Compression

Understanding the planning of LLM agents: A survey

LLM Agent Operating System

LLM Multi-Agent Systems: Challenges and Open Problems

CACA Agent: Capability Collaboration based AI Agent

Communicative Agents for Software Development

Self-Refine: Iterative Refinement with Self-Feedback

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

ReAct: Synergizing Reasoning and Acting in Language Models

Design2Code: How Far Are We From Automating Front-End Engineering?

Wukong: Towards a Scaling Law for Large-Scale Recommendation

StarCoder: may the source be with you!

Exploring Parameter-Efficient Fine-Tuning Techniques for Code Generation with Large Language Models

CodeBERT: A Pre-Trained Model for Programming and Natural Languages

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

CodeFusion: A Pre-trained Diffusion Model for Code Generation

CodePlan: Repository-level Coding using LLMs and Planning

Chain-of-Thought Reasoning Without Prompting

How to Train Data-Efficient LLMs

BitDelta: Your Fine-Tune May Only Be Worth One Bit

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

CodePlan: Repository-level Coding using LLMs and Planning

MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning

StarCoder: may the source be with you!

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

SantaCoder: don't reach for the stars!

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Creative Robot Tool Use with Large Language Models

CodeCoT and Beyond: Learning to Program and Test like a Developer

Lemur: Harmonizing Natural Language and Code for Language Agents

CodePlan: Repository-level Coding using LLMs and Planning

Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Adapting Large Language Models via Reading Comprehension

Democratizing Reasoning Ability: Tailored Learning from Large Language Model