Yedidia AGNIMO

YedsonUQ

AI & ML interests

[Uncertainty Quantification, "Hallucinations"] in LLMs, Federated Learning

Recent Activity

updated a collection 1 day ago

Hallucination

upvoted a paper 8 days ago

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

updated a collection 9 days ago

Uncertainty Quantification

View all activity

Organizations

None yet

YedsonUQ's activity

upvoted a paper 8 days ago

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published 9 days ago • 15

upvoted a paper 9 days ago

MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs

Paper • 2505.24858 • Published 11 days ago • 17

upvoted 2 papers 22 days ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 25 days ago • 119

Qwen3 Technical Report

Paper • 2505.09388 • Published 28 days ago • 187

upvoted a paper 28 days ago

A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models

Paper • 2505.07591 • Published 30 days ago • 10

upvoted 2 papers about 1 month ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 170

Cost-of-Pass: An Economic Framework for Evaluating Language Models

Paper • 2504.13359 • Published Apr 17 • 5

upvoted a paper about 2 months ago

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23 • 30

upvoted 6 papers 3 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 165

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 21

upvoted 6 papers 4 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 126

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 100

Linear Correlation in LM's Compositional Generalization and Hallucination

Paper • 2502.04520 • Published Feb 6 • 11

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 218

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 122

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59