Crystalite Balanced 100K (Production)

Crystalite checkpoint trained for 100K steps on a balanced 32K subset of Alex-MP-20 with 35% insulators (vs 2.1% in the full dataset). This is the production model for guided crystal generation.

Architecture: 67.8M-parameter Diffusion Transformer with subatomic tokenizer and GEM attention bias (Crystalite, Hadzi Veljkovic et al.).

Results at w=3 (production operating point)

Metric	Value
In-window rate (4-6 eV)	42.6%
Lattice validity	100%
Geometry validity	99.6%
Compositional uniqueness	78%
Metal fraction	0.2%

Formation energy probe AUROC: 0.990. Band gap probe AUROC: ~0.95.

Multi-constraint generation

Hybrid gradient steering + token masking produces: 100% refractory, 0% cobalt/nickel, 100% insulator, 30% in target window.

Usage

Requires the Crystalite codebase and probe-gradient-guidance scripts.

from scripts.train_probe import load_model
model = load_model("final.pt", device="cuda")

Used In

This checkpoint was used as an upstream generation asset in the open-world environment pipeline for Training Scientific Judgment with Verified Environments for Autonomous Science.

Scientific judgment blog post: Training Scientific Judgment
Public repo: Dynamical-Systems-Research/training-scientific-judgment
Paper PDF: Training Scientific Judgment with Verified Environments for Autonomous Science

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for Dynamical-Systems/crystalite-balanced

Crystalite: A Lightweight Transformer for Efficient Crystal Modeling

Paper • 2604.02270 • Published 12 days ago • 1

Dynamical-Systems
/

crystalite-balanced