AI4Bio@ZJLab

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

JustinLin610 authored a paper 4 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Junde updated a model 18 days ago

InstructPLM/Concated-Progen2-xlarge-CATH42-AFDB

Junde published a model 18 days ago

InstructPLM/Concated-Progen2-xlarge-CATH42-AFDB

View all activity

InstructPLM's activity

JustinLin610

authored a paper 4 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 4 days ago • 128

Junde

updated a model 18 days ago

InstructPLM/Concated-Progen2-xlarge-CATH42-AFDB

Updated 18 days ago • 4

Junde

published a model 18 days ago

InstructPLM/Concated-Progen2-xlarge-CATH42-AFDB

Updated 18 days ago • 4

JustinLin610

authored 2 papers 21 days ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 22 days ago • 78

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published 22 days ago • 33

JustinLin610

authored a paper 2 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 156

JustinLin610

authored 2 papers 3 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published Feb 27 • 28

xptree

authored a paper 3 months ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18 • 17

JustinLin610

authored 3 papers 4 months ago

JustinLin610

authored 5 papers 5 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 59

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

JustinLin610

authored 3 papers 6 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 51

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

AI & ML interests

Recent Activity

Team members 6

InstructPLM's activity