Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper โข 2506.19852 โข Published 24 days ago โข 38
kz919/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Cautious-TRL-0.18.0.dev Text Generation โข 2B โข Updated Jun 9 โข 1 โข 1
kz919/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Cautious-TRL-0.18.0.dev Text Generation โข 2B โข Updated Jun 9 โข 1 โข 1
view post Post 2697 Anyone using AI and ML to help neurodivergent people? I'd love to hear what you're doing. See translation 4 replies ยท ๐ 7 7 + Reply
kz919/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Cautious-TRL-0.18.0.dev Text Generation โข 2B โข Updated Jun 9 โข 1 โข 1