Paste a GitHub PR, an arXiv paper, a Hacker News story, or any article. Pick a tone, a voice, and a format — solo monologue or a two-character dialogue with sticker bodies reacting per line. Get back a render-ready vertical MP4 with karaoke captions over a looped gameplay clip. Built for builders who don't feel like editing in CapCut.
Free. No card. 2 renders to see if the output sounds like you before you commit.
GitHub PR, arXiv paper, Hacker News story, or article (free). YouTube, PDF, Google Doc, or X thread (Pro). We auto-detect the type from the URL, no picker. PDF uploads work too.
Tone: Educational, Hot Take, or Hype. Format: solo monologue (one voice, ~30-45s) or dialogue (two characters back and forth, ~50s, sticker bodies reacting per line). Same source, very different videos.
Script streams in about 10 seconds — fully editable line-by-line in dialogue mode. Pick voices (per-speaker in dialogue), pick a caption style, and render. 1080×1920 MP4 with karaoke captions over a gameplay loop, sticker characters in the bottom corners during dialogue, b-roll images popping in on cued phrases.
Free: GitHub PR diff, arXiv abstract, Hacker News story + comments, any article (Mozilla Readability). Pro: YouTube transcripts, PDFs (URL or upload), Google Docs, X threads. Auto-detected from the URL — no source-type picker.
Three male narrators (Liam, Drew, Adam), three female (Rachel, Aria, Charlotte), two character voices (gravelly Clyde, witchy Glinda). In dialogue mode you pick one voice for the asker and one for the expert independently, so the back-and-forth has real contrast.
Solo voiceover for explainers, or a 4-8 line back-and-forth between an asker (naive, hooks the viewer) and an expert (drops the punchline). Per-line reaction poses drive the sticker characters that animate during their lines.
Four asker bodies + four expert bodies (dev-bro, dev-girl, gamer, skater, anime-girl, founder, scientist, professor). The renderer pairs gender-to-voice automatically, or you pick the exact pair in the UI. Six reaction poses each.
Clean (Inter 72, amber karaoke fill) or Bold 3D (Inter 96, chunky black-stroke TikTok feel). B-roll images pop in for ~2s on cued phrases — Unsplash photos when the prompt is concrete, Flux Schnell when it's abstract. SRT export for CapCut.
1080×1920 vertical, captions burned over a looped 90-second Subway Surfers parkour clip — the brainrot template that holds retention on TikTok. Rendered server-side with ffmpeg, drop straight into TikTok or Shorts.
Monologue scripts stream token-by-token so you can read and edit while the model is still typing. Dialogue scripts come back as 4-8 inline-editable cards so you can polish individual exchanges before voicing.
Enough to see if the output is good enough to post.
For when one render isn't enough.
Plans are billed monthly via Stripe. Refund policy.
Two free renders. No card. Sign in with Google and ship something from your laptop in the next ten minutes.