Marc Sun's picture

Marc Sun

marcsun13

·

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

upvoted an article about 16 hours ago

Fine-tuning Llama 2 70B using PyTorch FSDP

liked a model 6 days ago

deepseek-ai/DeepSeek-R1-0528

reacted to sayakpaul's post with 🚀 8 days ago

Diffusers supports a good variety of quantization backends. It can be challenging to navigate through them, given the complex nature of diffusion pipelines in general. So, @derekl35 set out to write a comprehensive guide that puts users in the front seat. Explore the different backends we support, learn the trade-offs they offer, and finally, check out the cool space we built that lets you compare quantization results. Give it a go here: https://lnkd.in/gf8Pi4-2

View all activity

Organizations

marcsun13's activity

published an article 14 days ago

Article

Exploring Quantization Backends in Diffusers

By

and 2 others •

14 days ago

• 31

published an article about 1 month ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

By

and 8 others •

Apr 29

• 32

published an article 3 months ago

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

By

and 1 other •

Mar 7

• 60

published an article 7 months ago

Article

Introducing SynthID Text

By

and 5 others •

Oct 23, 2024

• 45

published an article 9 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

By

and 5 others •

Sep 18, 2024

• 246

published an article 9 months ago

Article

Accelerate 1.0.0

By

and 2 others •

Sep 13, 2024

• 52

published an article 11 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

By

and 7 others •

Jul 23, 2024

• 234

published an article about 1 year ago

Article

quanto: a pytorch quantization toolkit

By

and 2 others •

Mar 18, 2024

• 38

published an article over 1 year ago

Article

Overview of natively supported quantization schemes in 🤗 Transformers

By

and 4 others •

Sep 12, 2023

• 12

published an article almost 2 years ago

Article

Making LLMs lighter with AutoGPTQ and transformers

By

and 5 others •

Aug 23, 2023

• 54