emperoresearch lab
Open weights03/06
EMPERO/02 — MODELShuggingface.co/empero-ai4 FLAGSHIPS · 4 VARIANTS

Pure PyTorch,
load it and go.

Frontier architectures re-implemented in vanilla PyTorch so the original weights load with bitsandbytes, train with QLoRA, and run on a single consumer GPU — and reasoning distilled from closed frontier models onto small open Qwen weights. Apache-2.0 or NVIDIA OML.

02Models4 ON HUGGING FACE · APACHE-2.0
FIG. 02 — Qwen3.5-9B-Claude-Opus-4.6-DistillApache-2.0
Qwen3.5-9B-Claude-Opus-4.6-Distill

Reasoning-focused fine-tune of Qwen3.5-9B trained to produce <think>-tagged chains before answering. Distilled from Claude Opus 4.6 + Qwen3.5 reasoning traces (~12.8k examples). QLoRA, single RTX 5090, ~4.5 hours.

Benchmarks
Token Acc
86.15
Eval Loss×100
48.09%
06 — Dispatch

First in line for claire.

One letter every other Tuesday — and a single dispatch on the day claire ships with the install line. What we shipped, what we read, the one thing we got wrong. No hype, no roadmap teasers. Cancel from any line.

1 readers · we never share addresses