Yinxu Pan

cppowboy

AI & ML interests

Code LLM, Function Calling, Code Interpreter, Vision-Language Pretraining, Text-Rich Vision-Language Pretraining

Recent Activity

Organizations

Diffusers Pipelines Library for Stable Diffusion's profile picture OpenBMB's profile picture XAgentCommunity's profile picture

cppowboy's activity

upvoted an article 16 days ago
view article
Article

The N Implementation Details of RLHF with PPO

39
upvoted an article about 2 months ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu and 1 other
24