Mishig Davaadorj's picture

Mishig Davaadorj

mishig

·

AI & ML interests

NP-completeness, grammars, universality

Recent Activity

upvoted a paper about 20 hours ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

new activity 6 days ago

deepseek-ai/DeepSeek-R1-0528:Summer or Winter?

updated a Space 6 days ago

huggingface/inference-playground

View all activity

Organizations

mishig's activity

upvoted a paper about 20 hours ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published 1 day ago • 50

upvoted an article 9 days ago

Article

Interactive Tools for machine learning, deep learning, and math

By

•

9 days ago

• 40

upvoted a paper 11 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 21 days ago • 62

upvoted a changelog 12 days ago

Changelog

AI-generated Abstract summaries on Hugging Face Papers

13 days ago

• 65

upvoted a changelog 13 days ago

Changelog

Filter by MCP compatibility available in HF Spaces

13 days ago

• 70

upvoted an article 20 days ago

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

By

and 9 others •

20 days ago

• 32

upvoted a paper 29 days ago

RobustDexGrasp: Robust Dexterous Grasping of General Objects from Single-view Perception

Paper • 2504.05287 • Published Apr 7 • 6

upvoted 2 articles about 1 month ago

Article

How to Build an MCP Server with Gradio

By

and 1 other •

Apr 30

• 147

Article

The 4 Things Qwen-3's Chat Template Teaches Us

By

•

Apr 30

• 49

upvoted a paper about 1 month ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published Apr 15 • 28

upvoted an article about 1 month ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 267

upvoted a paper about 1 month ago

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Paper • 2201.11903 • Published Jan 28, 2022 • 13

upvoted an article about 1 month ago

Article

An Introduction to AI Model Optimization Techniques

By

and 1 other •

Apr 18

• 28

upvoted a paper 2 months ago

Universal Language Model Fine-tuning for Text Classification

Paper • 1801.06146 • Published Jan 18, 2018 • 7

upvoted a paper 3 months ago

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 15

upvoted 3 articles 3 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

By

•

Jan 15

• 185

Article

❤️ a love letter to the Open AI inference client

By

•

Feb 28

• 9

Article

Remote VAEs for decoding with HF endpoints 🤗

By

and 1 other •

Feb 24

• 39

upvoted 2 papers 3 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 189

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 159