5 30 85

Pu Fanyi

pufanyi

https://pufanyi.github.io

AI & ML interests

Recent Activity

liked a model 6 days ago

ds4sd/docling-models

liked a model 7 days ago

deepseek-ai/DeepSeek-R1

upvoted a paper 12 days ago

Fast Video Generation with Sliding Tile Attention

View all activity

Organizations

pufanyi's activity

liked a model 6 days ago

ds4sd/docling-models

Updated Dec 10, 2024 • 223k • 82

liked a model 7 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 13 days ago • 4.35M • • 9.87k

upvoted a paper 12 days ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published 15 days ago • 46

upvoted 2 papers 19 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 24 days ago • 106

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

liked a dataset 22 days ago

lmms-lab/multimodal-open-r1-8k-verified

Viewer • Updated 26 days ago • 7.69k • 3.55k • 38

liked a dataset 23 days ago

cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 148k • 397

upvoted a paper 29 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published 29 days ago • 24

authored a paper 29 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published 29 days ago • 24

upvoted a paper about 1 month ago

Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 3

liked a model about 1 month ago

jinaai/reader-lm-1.5b

Text Generation • Updated Jan 17 • 1.3k • 589

upvoted a paper about 2 months ago

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 8

liked a Space about 2 months ago

LiveBench

🥇

upvoted 2 papers about 2 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 99

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 6

liked a dataset about 2 months ago

Evo-LMM/rlaif-v

Viewer • Updated Jan 3 • 82.3k • 50 • 3

upvoted 2 papers about 2 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 53

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

updated a dataset about 2 months ago

pufanyi/MMMU

Viewer • Updated Dec 30, 2024 • 1.05k • 57

liked a dataset about 2 months ago

trl-lib/rlaif-v

Viewer • Updated Jan 8 • 83.1k • 307 • 3