Michael Goin

mgoin

mgoin_
mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

updated a model 17 days ago

google/gemma-4-E4B-it-qat-mobile-ct

updated a model 17 days ago

google/gemma-4-E2B-it-qat-mobile-ct

published a model 19 days ago

google/gemma-4-E4B-it-qat-mobile-ct

View all activity

Organizations

Collections 1

Papers 4

spaces 5

redhatai-model-explorer

🐳

Browse and filter text models by RedHatAI

Generate text in a chat format

models 103

mgoin/Qwen3.6-35B-A3B-2Bit-GSQ-ct

Image-Text-to-Text • 35B • Updated 24 days ago • 26

mgoin/Qwen3-0.6B-MXFP8

0.6B • Updated Feb 16 • 29

mgoin/GLM-4.6-FP8-BLOCK

Text Generation • 357B • Updated Feb 10 • 11

mgoin/Qwen3-0.6B-NVFP4

0.6B • Updated Aug 26, 2025 • 3

mgoin/mlperf-inference-llama3.1-8b-data

Updated Jul 15, 2025

mgoin/Llama-3.1-8B-Instruct-FP8-BLOCK

8B • Updated Jul 1, 2025 • 2

mgoin/SEMIKONG-70B-W4A16-G128

71B • Updated Jun 16, 2025 • 5

mgoin/llama-4-tiny-random

Text Generation • 6.69M • Updated May 14, 2025 • 3

mgoin/Qwen1.5-14B-Chat-GPTQ

Text Generation • Updated Mar 5, 2025 • 3

mgoin/pixtral-12b

Image-Text-to-Text • 13B • Updated Feb 7, 2025 • 381 • 1

View 103 models

datasets 4

mgoin/mlperf-inference-llama3.1-8b-data

Viewer • Updated Jul 15, 2025 • 13.4k • 26

mgoin/mlperf-inference-llama2-data

Viewer • Updated May 22, 2025 • 24.6k • 295

mgoin/mlperf-inference-llama3.1-405b-data

Viewer • Updated May 22, 2025 • 8.31k • 31

mgoin/ultrachat_2k

Viewer • Updated May 24, 2024 • 2.05k • 59

Michael Goin

AI & ML interests

Recent Activity

Organizations

Collections 1

mgoin/Nemotron-4-340B-Instruct-hf-FP8

mgoin/Nemotron-4-340B-Base-hf-FP8

mgoin/Nemotron-4-340B-Instruct-hf

mgoin/Nemotron-4-340B-Base-hf

mgoin/Nemotron-4-340B-Instruct-hf-FP8

mgoin/Nemotron-4-340B-Base-hf-FP8

mgoin/Nemotron-4-340B-Instruct-hf

mgoin/Nemotron-4-340B-Base-hf

Papers 4

spaces 5

redhatai-model-explorer

Convert Fp8

Hermes Mistral 7b Vllm

Sparse Llama Gsm8k

TinyStories DeepSparse

models 103

mgoin/Qwen3.6-35B-A3B-2Bit-GSQ-ct

mgoin/Qwen3-0.6B-MXFP8

mgoin/GLM-4.6-FP8-BLOCK

mgoin/Qwen3-0.6B-NVFP4

mgoin/mlperf-inference-llama3.1-8b-data

mgoin/Llama-3.1-8B-Instruct-FP8-BLOCK

mgoin/SEMIKONG-70B-W4A16-G128

mgoin/llama-4-tiny-random

mgoin/Qwen1.5-14B-Chat-GPTQ

mgoin/pixtral-12b

datasets 4

mgoin/mlperf-inference-llama3.1-8b-data

mgoin/mlperf-inference-llama2-data

mgoin/mlperf-inference-llama3.1-405b-data

mgoin/ultrachat_2k

Michael Goin

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

spaces 5 Sort: Recently updated

redhatai-model-explorer

Convert Fp8

Hermes Mistral 7b Vllm

Sparse Llama Gsm8k

TinyStories DeepSparse

models 103 Sort: Recently updated

datasets 4 Sort: Recently updated

spaces 5

models 103

datasets 4