Abhay Gupta's picture

1 10 2

Abhay Gupta

abhaygupta

·

AI & ML interests

LLM Training & Inference; Sparsity and Quantization

Recent Activity

updated a model about 2 months ago

abhaygupta/Qwen3-32B

published a model about 2 months ago

abhaygupta/Qwen3-32B

updated a model about 2 months ago

abhaygupta/Qwen3-14B

View all activity

Organizations

authored a paper 3 months ago

$μ$nit Scaling: Simple and Scalable FP8 LLM Training

Paper • 2502.05967 • Published Feb 9

authored a paper about 1 year ago

DAiSEE: Towards User Engagement Recognition in the Wild

Paper • 1609.01885 • Published Sep 7, 2016

authored 4 papers over 1 year ago

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Paper • 2206.14098 • Published Jun 28, 2022

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Paper • 2303.10464 • Published Mar 18, 2023 • 1

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Paper • 2303.11525 • Published Mar 21, 2023 • 1

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7