Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper โข 2504.12626 โข Published 19 days ago โข 48
One-Minute Video Generation with Test-Time Training Paper โข 2504.05298 โข Published 28 days ago โข 101
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published Jan 28 โข 121
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper โข 2408.08872 โข Published Aug 16, 2024 โข 101
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper โข 2407.12784 โข Published Jul 17, 2024 โข 52