/ browser ai sandbox

Free AI tools that run in your browser, on your GPU.

Local LLMs, speech-to-text, vision, depth, and image utilities. Each one loads its model once and runs inference on WebGPU or WebAssembly inside your tab. No upload, no API key, no server.

filter

// demos11 of 11

01 · spaLive
/bleepit Bleep words out of a video
Drop a video, name the words (or pick a profanity level), and get a clean cut back, with each match muted and beeped. Whisper finds the words and ffmpeg.wasm does the edit, entirely in your tab. Nothing uploaded.
Open demo
02 · nextLive
/browser-llm In-browser LLM (WebGPU)
Pick from Llama, Qwen, Gemma, Phi, SmolLM and more. Weights download once and cache to your browser; inference runs on WebGPU. Zero servers, zero API keys.
Open demo
03 · nextLive
/embeddings In-browser embeddings (MiniLM)
Index 5 public-domain novels (Austen, Shelley, Doyle, Carroll, Sun Tzu), then search them by meaning. Runs MiniLM-L6-v2 in your tab via Transformers.js.
Open demo
04 · nextLive
/vision In-browser webcam vision
Real-time object detection on your camera feed via YOLOs-tiny. Bounding boxes drawn on a canvas overlay; the video stream never leaves your tab.
Open demo
05 · nextLive
/bg-remove In-browser background remover (RMBG)
Drop any photo, get a transparent-PNG cutout in about a second. Briaai's RMBG-1.4 segmentation model running entirely client-side via Transformers.js. No upload, no API.
Open demo
06 · nextLive
/whisper In-browser speech-to-text (Whisper)
Record from your mic or drop an audio file; OpenAI's Whisper transcribes in your tab with word-level timestamps. Three model sizes from 25 MB; nothing uploaded.
Open demo
07 · nextLive
/depth In-browser depth estimation
Pixel-wise depth maps from a 25 MB Depth Anything v2. Run it on a static image or live webcam, choose a colormap, see what your GPU sees about the world.
Open demo
08 · nextLive
/particles WebGPU particle field
One million points pushed around by a compute shader on your GPU. Move the mouse to attract or repel. No model, no download. Pure WGSL.
Open demo
09 · nextLive
/translator In-browser translator (NLLB-200)
Translate between 23 languages with Meta's NLLB-200 distilled model running fully client-side via Transformers.js. Auto-detect source via a script + stopword heuristic. No second model.
Open demo
10 · nextLive
/tokenizer In-browser tokenizer playground
Paste text → see exactly how 6 different models (GPT-4o, GPT-4, Claude, Llama 3, Mistral, GPT-2) carve it into tokens. Color-coded chips, live char/token ratio.
Open demo
11 · nextLive
/stream In-browser streaming UI
Token-by-token text streaming UI. The chrome we drop in front of OpenAI, Anthropic, or local model calls.
Open demo

// about these tools

Every demo above is a real AI application that downloads its model weights once, caches them in your browser, and then runs inference on your own hardware, usually accelerated by your GPU through WebGPU. Nothing you type, upload, record, or capture ever leaves the tab.

They're built with Transformers.js, WebLLM, and raw WGSL compute shaders, the same primitives we use when we ship production AI features for clients. Treat the sandbox as a showroom: every tile is a thing the browser can already do, with no cloud cost and no privacy compromise.

$ Want to drop in your own SPA? Build it, copy the dist into public/sandbox/<slug>/, and add an entry in src/data/sandbox.ts.

Free AI tools that run in your browser, on your GPU.

/bleepit Bleep words out of a video

/browser-llm In-browser LLM (WebGPU)

/embeddings In-browser embeddings (MiniLM)

/vision In-browser webcam vision

/bg-remove In-browser background remover (RMBG)

/whisper In-browser speech-to-text (Whisper)

/depth In-browser depth estimation

/particles WebGPU particle field

/translator In-browser translator (NLLB-200)

/tokenizer In-browser tokenizer playground

/stream In-browser streaming UI

// about these tools