01
Understand the core voice pipeline: audio -> ASR -> LLM -> tools -> TTS.
Applications open now
A practical crash course on how to build voice AI: from pipeline fundamentals and architecture to your first working voice agent.
Who it is for
Outcomes
01
Understand the core voice pipeline: audio -> ASR -> LLM -> tools -> TTS.
02
Learn why latency, interruptions, and turn-taking are central in voice AI.
03
Know the main building blocks of a voice agent.
04
Understand tradeoffs of local vs API-based components.
05
Build a first working voice agent.
06
Develop vocabulary to discuss voice AI architecture credibly.
Curriculum
Session 1
Session 2
Session 3
Session 4
Included
Application
The form helps identify applicant fit, technical background, motivation, and the right founder-led follow-up.
Applications open now
$490
FAQ
Founders, engineers, PMs, and technical builders who want system-level understanding of voice AI.
No prior speech AI expertise is required. Technical curiosity and willingness to reason about architecture are expected.
Yes. Recordings are included so you can revisit the material after the live sessions.
It is practical and technical. The goal is to understand the stack, constraints, and build a first agent, not only click through no-code tools.
A simple working voice agent and a stronger architecture base for future prototypes.
The focus is voice-specific system understanding: latency, turn-taking, interruptions, ASR/TTS, realtime pipelines, and production tradeoffs, not generic LLM prompting.