Shawon Ashraf
shawon
AI & ML interests
Multi-Modal NLP, LLM and RAG
Recent Activity
liked
a model
about 7 hours ago
distilbert/distilbert-base-uncased-finetuned-sst-2-english
upvoted
a
collection
4 days ago
Any-to-Any Models, Datasets, Spaces
liked
a model
14 days ago
google/medgemma-4b-pt
Organizations
Collections
6
-
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing
Paper • 2505.09990 • Published • 11 -
Style Customization of Text-to-Vector Generation with Image Diffusion Priors
Paper • 2505.10558 • Published • 15 -
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Paper • 2505.10046 • Published • 9 -
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real
Paper • 2505.07096 • Published • 3