Glitch Token

Level 1 Probes — Solved 0

A vocabulary, full of tokens. One of them is cursed.

Inference output Tap a token to run inference.

How Glitch Token Works

Somewhere in this vocabulary lurks a SolidGoldMagikarp — a token so cursed it makes the model speak in tongues. It was never trained on, only tokenized, and now it sits in the embedding matrix like a landmine. Your job: find it before it ships to a billion users.

The grid is the model's token vocabulary. Exactly one token is glitched.
Tap a token to run inference on it. You'll see the output it produces.
Clean tokens decode cleanly. Tokens near the glitch come out increasingly garbled, looping, and unhinged.
You have a limited probe budget — every probe costs compute, so triangulate wisely.
When you've found the strongest corruption, tap the cursed token to select it, then Accuse it.

Reading the Corruption

Corruption strength is a heat signal: the closer a probed token sits to the glitch on the grid, the more broken its output. A token that returns pure noise is a direct neighbor. A token that decodes almost-cleanly is far away. The glitch itself is silent — it just breaks everything around it.

Slop Fact: Glitch tokens are real. " SolidGoldMagikarp", " petertodd", and friends were Reddit usernames that got tokenized but never trained, so the model treats them as cursed runes. Ask GPT to repeat them and it hallucinates, refuses, or insults you. Anomalous tokens: the closest thing an LLM has to a haunted house.