Sunyoung Hwang's picture

Sunyoung Hwang PRO

sosoai

·

https://sosohajalab.com

AI & ML interests

llm, vision, transformers, megabytes

Recent Activity

liked a model about 23 hours ago

CohereLabs/command-a-reasoning-08-2025

liked a model 1 day ago

nasa-ibm-ai4science/Surya-1.0

liked a model 3 days ago

deepseek-ai/DeepSeek-V3.1-Base

View all activity

Organizations

upvoted a collection 7 days ago

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated about 23 hours ago • 216

upvoted an article 17 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

By

and 11 others •

18 days ago

• 470

upvoted an article 21 days ago

Article

Introducing Command A Vision: Multimodal AI built for Business

By

and 3 others •

22 days ago

• 63

upvoted an article 23 days ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By

and 4 others •

25 days ago

• 158

upvoted a paper 26 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 30 days ago • 289

upvoted a collection about 1 month ago

Skywork-R1V3

Advanced multimodal reasoning model • 7 items • Updated 15 days ago • 14

upvoted 2 collections about 2 months ago

GLM-4.1V-Thinking

5 items • Updated Jul 2 • 54

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated Jul 11 • 158

upvoted a paper 2 months ago

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Paper • 2506.13654 • Published Jun 16 • 44

upvoted a collection 3 months ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 15 days ago • 72

upvoted an article 3 months ago

Article

Interactive Tools for machine learning, deep learning, and math

By

•

May 26

• 44

upvoted 3 collections 3 months ago

Perception Encoder

17 items • Updated Jul 11 • 65

Qwen3

84 items • Updated 16 days ago • 1.13k

INTELLECT-2

INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. • 3 items • Updated Jul 14 • 24

upvoted 2 collections 4 months ago

CCI4.0

5 items • Updated Jun 10 • 11

Qwen3

21 items • Updated Apr 29 • 30

upvoted an article 4 months ago

Article

How to Build an MCP Server with Gradio

By

and 1 other •

Apr 30

• 189

upvoted 2 collections 4 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 1 day ago • 193

Perception LM

7 items • Updated Apr 17 • 61

upvoted an article 4 months ago

Article

Cohere on Hugging Face Inference Providers 🔥

By

and 6 others •

Apr 16

• 131