Z
Chat with Phi-3-mini
q4f16_1 · 3.8 B
4 K context
~40 tok/s · M2 Pro
On-device only
Running locally on your GPU through 10 hand-written WGSL kernels. Nothing leaves this tab — prompts, tokens and KV cache all stay in your browser.