Young Jin Kim's picture

17 6 8

Young Jin Kim

ykim362

·

https://www.microsoft.com/en-us/research/people/youki/

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

new activity about 2 months ago

lmstudio-community/Phi-4-mini-reasoning-GGUF:Update the paper link

upvoted a paper about 2 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

View all activity

Organizations

authored a paper about 2 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 47

authored a paper 4 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 88

authored 2 papers 9 months ago

PEMA: An Offsite-Tunable Plug-in External Memory Adaptation for Language Models

Paper • 2311.08590 • Published Nov 14, 2023

GRIN: GRadient-INformed MoE

Paper • 2409.12136 • Published Sep 18, 2024 • 16

authored 6 papers over 1 year ago

Scalable and Efficient MoE Training for Multitask Multilingual Models

Paper • 2109.10465 • Published Sep 22, 2021

Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness

Paper • 2310.02410 • Published Oct 3, 2023 • 3

FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs

Paper • 2308.09723 • Published Aug 16, 2023 • 2

AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation

Paper • 2210.07535 • Published Oct 14, 2022 • 1

Taming Sparsely Activated Transformer with Stochastic Experts

Paper • 2110.04260 • Published Oct 8, 2021 • 2

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37

authored a paper almost 2 years ago

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

Paper • 2309.11674 • Published Sep 20, 2023 • 32