Zero-TVM Chat — Phi-3-mini on 10 hand-written WGSL shaders

Preparing Zero-TVM

Starting…

Details

Chat with Phi-3-mini

q4f16_1 · 3.8 B 4 K context ~40 tok/s · M2 Pro On-device only

Running locally on your GPU through 10 hand-written WGSL kernels. Nothing leaves this tab — prompts, tokens and KV cache all stay in your browser.

Enter to send · Shift+Enter for new line · Zero TVM · 10 WGSL kernels · 228 dispatches/token