`bunsen::kits::gpts`

Full GPT / LLM variants. Where bunsen::blocks provides reusable transformer sub-modules, gpts is for whole language-model architectures: end-to-end models, tokenizer wiring, and the training/inference surface around them.

A compact GPT in the spirit of the “nano” GPT lineage — small enough to train on modest hardware, opinionated enough to be a useful reference implementation.

The model lives in bunsen::kits::gpts::nanochat and is split into:

the per-layer MLP,
the transformer block (attention + MLP + norms),
the full model wrapper that stacks the blocks and adds embedding and head layers.

gpts is a work in progress; further GPT/LLM variants will land here as the port from upstream reference implementations progresses.

The Bunsen Book

`bunsen::kits::gpts`

Current models

`nanochat`

Keyboard shortcuts

The Bunsen Book

bunsen::kits::gpts

Current models

nanochat

`bunsen::kits::gpts`

`nanochat`