DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking Paper • 2502.20730 • Published 10 days ago • 32
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations Paper • 2410.22821 • Published Oct 30, 2024 • 2
Iterative Forward Tuning Boosts In-Context Learning in Language Models Paper • 2305.13016 • Published May 22, 2023 • 1
CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment Paper • 2310.16271 • Published Oct 25, 2023 • 1
From Skepticism to Acceptance: Simulating the Attitude Dynamics Toward Fake News Paper • 2403.09498 • Published Mar 14, 2024 • 1
Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer Paper • 2403.19979 • Published Mar 29, 2024 • 1
Aligning Logits Generatively for Principled Black-Box Knowledge Distillation Paper • 2205.10490 • Published May 21, 2022 • 1
Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning Paper • 2403.19962 • Published Mar 29, 2024 • 1
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents Paper • 2406.14884 • Published Jun 21, 2024 • 1
SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published Jan 3 • 18
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis Paper • 2501.04561 • Published Jan 8 • 16
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 71
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 80
Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts Paper • 2411.10669 • Published Nov 16, 2024 • 10
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization Paper • 2411.06208 • Published Nov 9, 2024 • 20
Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper • 2410.19008 • Published Oct 21, 2024 • 24
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs Paper • 2410.18451 • Published Oct 24, 2024 • 17
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization Paper • 2410.08815 • Published Oct 11, 2024 • 48