Collections
Discover the best community collections!
Collections including paper arxiv:2503.14456
-
FAN: Fourier Analysis Networks
Paper • 2410.02675 • Published • 26 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 85 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 21 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 7
-
Trellis Networks for Sequence Modeling
Paper • 1810.06682 • Published • 1 -
ProSG: Using Prompt Synthetic Gradients to Alleviate Prompt Forgetting of RNN-like Language Models
Paper • 2311.01981 • Published • 1 -
Gated recurrent neural networks discover attention
Paper • 2309.01775 • Published • 10 -
Inverse Approximation Theory for Nonlinear Recurrent Neural Networks
Paper • 2305.19190 • Published • 1
-
RuCCoD: Towards Automated ICD Coding in Russian
Paper • 2502.21263 • Published • 122 -
Unified Reward Model for Multimodal Understanding and Generation
Paper • 2503.05236 • Published • 105 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 43 -
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Paper • 2503.05592 • Published • 25
-
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper • 2409.02097 • Published • 33 -
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper • 2409.11406 • Published • 26 -
Diffusion Models Are Real-Time Game Engines
Paper • 2408.14837 • Published • 124 -
Segment Anything with Multiple Modalities
Paper • 2408.09085 • Published • 22