🔄 In a Training Loop

archit

archit11

archit-spec

AI & ML interests

small language models

Recent Activity

updated a model 25 days ago

archit11/glm-4.7-flash-hyperswitch-rl-step50

published a model 25 days ago

archit11/glm-4.7-flash-hyperswitch-rl-step50

updated a model 25 days ago

archit11/glm-4.7-flash-hyperswitch-rl-step20

View all activity

Organizations

upvoted an article about 2 months ago

Article

Forge: Scalable Agent RL Framework and Algorithm

MiniMax-AI

•

Feb 13

• 155

upvoted an article 6 months ago

Article

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

ariG23498, aerdem4

•

Dec 23, 2024

• 51

upvoted an article 8 months ago

Article

Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement

TensorSlay

•

Nov 7, 2025

• 4

upvoted 2 articles 10 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

drbh, danieldk

•

Aug 18, 2025

• 104

Article

How to Run a Hugging Face Model in JAX (Part 1)

qihqi

•

Jul 20, 2025

• 31

upvoted a paper 11 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161

upvoted 3 articles 12 months ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 487

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

rishiraj

•

Jun 26, 2025

• 50

Article

G2P Shrinks Speech Models

hexgrad

•

Feb 5, 2025

• 97

upvoted 3 articles about 1 year ago

Article

State of open video generation models in Diffusers

sayakpaul, a-r-r-o-w, dn6

•

Jan 27, 2025

• 71

Article

How Long Prompts Block Other Requests - Optimizing LLM Performance

tngtech

•

Jun 12, 2025

• 13

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

tngtech

•

Apr 16, 2025

• 81

upvoted 2 papers about 1 year ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

upvoted an article about 1 year ago

Article

Enabling Long Context Training with Sequence Parallelism in Axolotl

axolotl-ai-co

•

Apr 4, 2025

• 17

upvoted 2 articles over 1 year ago

Article

SigLIP 2: A better multilingual vision language encoder

ariG23498, merve, qubvel-hf

•

Feb 21, 2025

• 217

Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

Pclanglais

•

Aug 4, 2024

• 30

upvoted 3 collections over 1 year ago

archit

AI & ML interests

Recent Activity

Organizations

archit11's activity

Forge: Scalable Agent RL Framework and Algorithm

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

How to Run a Hugging Face Model in JAX (Part 1)

You could have designed state of the art positional encoding

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

G2P Shrinks Speech Models

State of open video generation models in Diffusers

How Long Prompts Block Other Requests - Optimizing LLM Performance

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Enabling Long Context Training with Sequence Parallelism in Axolotl

SigLIP 2: A better multilingual vision language encoder

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks