Human_Eval_RLHF

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

seungone authored a paper 25 days ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

hyungjoochae authored a paper 25 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

seungone authored a paper about 1 month ago

Let's Predict Sentence by Sentence

View all activity

seungone

authored a paper 25 days ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

Paper • 2506.01789 • Published 26 days ago • 14

hyungjoochae

authored a paper 25 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

Paper • 2506.02338 • Published 26 days ago • 4

seungone

authored a paper about 1 month ago

Let's Predict Sentence by Sentence

Paper • 2505.22202 • Published May 28 • 17

jinheon

authored a paper about 1 month ago

Knowledge Base Construction for Knowledge-Augmented Text-to-SQL

Paper • 2505.22096 • Published May 28 • 1

seungone

authored 2 papers about 1 month ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published May 21 • 102

FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS

Paper • 2505.16409 • Published May 22 • 2

hyungjoochae

authored 4 papers about 1 month ago

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 3

Evaluating Robustness of Reward Models for Mathematical Reasoning

Paper • 2410.01729 • Published Oct 2, 2024

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

Paper • 2406.14703 • Published Jun 20, 2024 • 2

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published May 21 • 102

seungone

authored a paper about 1 month ago

Reasoning Models Better Express Their Confidence

Paper • 2505.14489 • Published May 20 • 19

DKYoon

authored a paper about 1 month ago

Reasoning Models Better Express Their Confidence

Paper • 2505.14489 • Published May 20 • 19

seungone

authored a paper about 1 month ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published May 15 • 25

jinheon

authored a paper about 1 month ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 70

jinheon

authored a paper about 2 months ago

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published Apr 29 • 62

jinheon

authored 5 papers 2 months ago

Test-Time Self-Adaptive Small Language Models for Question Answering

Paper • 2310.13307 • Published Oct 20, 2023

Knowledge-Augmented Language Model Verification

Paper • 2310.12836 • Published Oct 19, 2023

Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks

Paper • 2305.18395 • Published May 28, 2023 • 1

Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering

Paper • 2306.04136 • Published Jun 7, 2023 • 2

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

Paper • 2105.00666 • Published May 3, 2021