ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Paper • 2404.02893 • Published Apr 3, 2024 • 22
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Paper • 2406.12793 • Published Jun 18, 2024 • 32
LongReward: Improving Long-context Large Language Models with AI Feedback Paper • 2410.21252 • Published Oct 28, 2024 • 18
Does RLHF Scale? Exploring the Impacts From Data, Model, and Method Paper • 2412.06000 • Published Dec 8, 2024
A Human-Like Reasoning Framework for Multi-Phases Planning Task with Large Language Models Paper • 2405.18208 • Published May 28, 2024
Can Large Language Model Agents Simulate Human Trust Behaviors? Paper • 2402.04559 • Published Feb 7, 2024