Shentao Yang's picture

2

Shentao Yang

shentaoyang

https://scholar.google.com/citations?hl=en&user=jxxSLbkAAAAJ&view_op=list_works

AI & ML interests

Generative AI, Large Language Models, RLHF, RLAIF, Reinforcement Learning

Recent Activity

authored a paper about 1 month ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

authored a paper 8 months ago

Preference-grounded Token-level Guidance for Language Model Fine-tuning

authored a paper 8 months ago

A Dense Reward View on Aligning Text-to-Image Diffusion with Preference

View all activity

Organizations

None yet

authored a paper about 1 month ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7 • 59

authored 3 papers 8 months ago

Preference-grounded Token-level Guidance for Language Model Fine-tuning

Paper • 2306.00398 • Published Jun 1, 2023

A Dense Reward View on Aligning Text-to-Image Diffusion with Preference

Paper • 2402.08265 • Published Feb 13, 2024

Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model

Paper • 2501.02790 • Published Jan 6 • 9