Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AdamF92 's Collections
RxT-Alpha Micro by Reactive AI
Sparse Query Attention (SQA) Research by Reactive AI

Sparse Query Attention (SQA) Research by Reactive AI

updated 4 days ago

Experimental models with Sparse Query Attention layers. Reducing training time/cost by ~3-10% compared to GQA & MQA, with the same level performance

Upvote
-

  • ReactiveAI/sSQAT-mm

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/SQAT-mm

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/xSQAT-mm

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/GQA-Ref-Micro

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/MQA-Ref-Micro

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/SQAT-m

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/xSQAT-m

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/sSQAT-m

    Text Generation • 0.0B • Updated 4 days ago

  • ReactiveAI/xSMQAT-m

    Text Generation • 0.0B • Updated 4 days ago

  • Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction

    Paper • 2510.01817 • Published 5 days ago • 12
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs