Phone browsers don't have enough GPU memory — the download will crash the tab. Open this URL on a desktop Chrome, Edge, or Safari TP.
The download happens once. After that, the model is cached locally — no server, no API key, no telemetry.
WebGPU required (Chrome / Edge / Safari TP) · ~2 GB GPU memory · 1–3 min first download
lm_head