Shawon Ashraf's picture

35 333

Shawon Ashraf

shawon

·

https://shawonashraf.github.io

AI & ML interests

Multi-Modal NLP, LLM and RAG

Recent Activity

liked a dataset 18 days ago

black-forest-labs/kontext-bench

liked a model 24 days ago

black-forest-labs/FLUX.1-Kontext-dev

liked a model 27 days ago

google/gemma-3n-E4B-it

View all activity

Organizations

upvoted an article about 2 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

Jun 4, 2024

• 79

upvoted a collection about 2 months ago

Any-to-Any Models, Datasets, Spaces

18 items • Updated Jun 20 • 23

upvoted 8 papers 2 months ago

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

Paper • 2505.10468 • Published May 15 • 9

ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking

Paper • 2505.08581 • Published May 13 • 9

Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning

Paper • 2505.09738 • Published May 14 • 9

Style Customization of Text-to-Vector Generation with Image Diffusion Priors

Paper • 2505.10558 • Published May 15 • 15

Depth Anything with Any Prior

Paper • 2505.10565 • Published May 15 • 11

PointArena: Probing Multimodal Grounding Through Language-Guided Pointing

Paper • 2505.09990 • Published May 15 • 12

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published May 15 • 23

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

upvoted an article 2 months ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 488

upvoted 2 collections 3 months ago

Pleias-RAG

New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 27

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 4 days ago • 52

upvoted an article 4 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 351

upvoted a collection 5 months ago

SigLIP2

36 items • Updated 15 days ago • 79

upvoted a paper 6 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 237

upvoted a collection 8 months ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 177

upvoted a paper 9 months ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted a collection 9 months ago

LongVU

7 items • Updated Oct 31, 2024 • 34

upvoted a paper 10 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180