Joonhyung Lee

joonhyung-lee-naver

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago

You could have designed state of the art positional encoding

commented on an article 2 months ago

Accelerating LLM Inference with TGI on Intel Gaudi

upvoted an article 2 months ago

Accelerating LLM Inference with TGI on Intel Gaudi

View all activity

Organizations

joonhyung-lee-naver's activity

upvoted an article 13 days ago

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 287

commented on Accelerating LLM Inference with TGI on Intel Gaudi 2 months ago

Great work!

upvoted an article 2 months ago

Article

Accelerating LLM Inference with TGI on Intel Gaudi

and 4 others •

Mar 28

• 13

liked a Space 4 months ago

2.66k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

reacted to regisss's post with 🔥 4 months ago

Post

1755

Nice paper comparing the fp8 inference efficiency of Nvidia H100 and Intel Gaudi2: An Investigation of FP8 Across Accelerators for LLM Inference (2502.01070)

The conclusion is interesting: "Our findings highlight that the Gaudi 2, by leveraging FP8, achieves higher throughput-to-power efficiency during LLM inference"

One aspect of AI hardware accelerators that is often overlooked is how they consume less energy than GPUs. It's nice to see researchers starting carrying out experiments to measure this!

Gaudi3 results soon...

authored 2 papers 10 months ago

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 25

To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability

Paper • 2405.18710 • Published May 29, 2024

liked a Space about 1 year ago

959

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training