Mostafa

A7m0d

AI & ML interests

None yet

Recent Activity

liked a model about 10 hours ago

mistralai/Magistral-Small-2506

liked a Space about 13 hours ago

yonigozlan/GOT-OCR-Transformers

liked a model 15 days ago

ByteDance-Seed/BAGEL-7B-MoT

View all activity

Organizations

A7m0d's activity

upvoted 11 papers 20 days ago

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published 28 days ago • 72

VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank

Paper • 2505.14460 • Published 24 days ago • 30

This Time is Different: An Observability Perspective on Time Series Foundation Models

Paper • 2505.14766 • Published 24 days ago • 37

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published 25 days ago • 44

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published 24 days ago • 54

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published 25 days ago • 51

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published 23 days ago • 88

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published 25 days ago • 73

upvoted a paper 3 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 158

upvoted 2 articles 3 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

and 3 others •

Mar 4

• 75

Article

SigLIP 2: A better multilingual vision language encoder

and 2 others •

Feb 21

• 166

upvoted a paper 4 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

upvoted a collection 5 months ago

SmolVLM 256M & 500M

Collection

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated May 5 • 77

upvoted 3 papers 6 months ago

DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation

Paper • 2412.15200 • Published Dec 19, 2024 • 9

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 55

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 111

upvoted an article 7 months ago

Article

Fine-tuning Parler TTS on a Specific Language

•

Sep 16, 2024

• 31