Ming Jie Wong's picture

Ming Jie Wong PRO

mjwong

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 15 days ago

liked a model 18 days ago

openai/gpt-oss-20b

liked a model 18 days ago

google/gemma-3-270m

View all activity

Organizations

upvoted a collection 15 days ago

DeepSeek-V3.1

3 items • Updated 16 days ago • 221

upvoted a collection about 2 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 3 items • Updated 1 day ago • 120

upvoted a collection 2 months ago

Gemma 3n

4 items • Updated Jul 10 • 216

upvoted a collection 3 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 161

upvoted 3 collections 4 months ago

Gemma 3n Preview

4 items • Updated Jul 10 • 171

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated Jul 21 • 156

Qwen3

84 items • Updated about 1 month ago • 1.19k

upvoted 3 collections 5 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 208

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 16 days ago • 49

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 616

upvoted an article 7 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

By

and 5 others •

Sep 18, 2024

• 265

upvoted a paper 7 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 418