Bit Transformer Dashboard

Initialize Model

d_model:
nhead:
num_layers:
dim_feedforward:
max_seq_len:
chunk_size:
overlap:
Reversible:
Gradient Checkpointing:
act_threshold:
c_floor:
s_floor:

Train Step

Bits (e.g. 0 1 0 1):
Upload file:

Scale Up

Width Mult:

Collapse Submodel

Cluster Bits (JSON array of arrays):

Target Params (JSON):

Width Scale:

Inference

Bits:
Upload file:

    

Long Inference

Bits:
ctx_bits:
overlap:

    

Text Inference

Text:

    

λ Weights

λK:
λC:
λS:

Diffusion LM

GPU Acceleration

Enable Compression

Ratio: 1.0

Quantization Aware Training

Model Status


    

Telemetry

Hugging Face Checkpoints

Repo ID:
Token: