Dataset and Models for ''IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards''
guox18
guox18
·
AI & ML interests
Alignment
Recent Activity
authored
a paper
about 10 hours ago
Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and
Self-Improving OCR
authored
a paper
about 10 hours ago
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with
Verifiable Rewards
authored
a paper
about 10 hours ago
Intern-S1: A Scientific Multimodal Foundation Model
Organizations
None yet