SpectralPO

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

PeterLauLukCh authored a paper 1 day ago

Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

PeterLauLukCh authored a paper 1 day ago

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

PeterLauLukCh submitted a paper 8 days ago

Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

View all activity

Organization Card

Community About org cards

This repo contains all the models for paper -

Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO

https://arxiv.org/abs/2505.11595

Please cite

@inproceedings{chen2025spectral,
  title = {Spectral Policy Optimization: Coloring your Incorrect Reasoning in {GRPO}},
  author = {Peter Chen and Xiaopeng Li and Ziniu Li and Xi Chen and Tianyi Lin},
  booktitle = {2nd AI for Math Workshop @ ICML 2025},
  year = {2025},
  url = {https://openreview.net/forum?id=IIBDElbi7s}
}

Collections 7

View 7 collections

models 27

datasets 0

None public yet

AI & ML interests

Recent Activity

Team members 3

Collections 7

models 27 Sort: Recently updated

datasets 0

models 27