Ivan Gabor • 2 months ago
Building Squared: How I Used Gemini Live API to Create an AI That Coaches You While You Speak
Hey everyone! I just submitted my project Squared to the Gemini Live Agent Challenge, and I wanted to share what I built.
The problem: Every presentation tool gives you feedback after the fact β record, upload, wait, repeat. That's like a GPS that tells you "you missed the turn" five minutes too late.
The solution: Squared is a real-time AI navigator for presentations. A coach evaluates your performance after the fact. A navigator rides with you, knows the road ahead, remembers the turns where you've gotten lost before, and guides you through them as they happen.
Key highlights:
Rehearsal Mode β The AI speaks to you in real time, tracking live metrics (pace, eye contact, posture, filler words) and remembering your weak spots across sessions.
Presentation Mode β On stage, the AI goes completely silent but surfaces a live HUD with micro-prompts and rescue text visible only to you.
Pattern Memory β Built with Gemini Embedding API + pgvector, it retains every rehearsal so each session builds on the last.
Dual-Agent Architecture β Two parallel Gemini Live sessions: one navigating your delivery, one monitoring audience engagement.
Built with Gemini 2.5 Flash Live API, MediaPipe, and Google Cloud. Running in full production β not a localhost demo.
Would love to hear your thoughts and feedback!
Links
Live Demo
https://squared-j2gx3ygtta-uc.a.run.app
Source Code
https://github.com/gaborishka/squared
Mac App
https://github.com/gaborishka/squared/releases/tag/v0.2.0
Technical Deep-Dive
https://ivanhabor.substack.com/p/building-squared-how-i-used-gemini
Devpost Submission
https://devpost.com/software/squared-ai-presentation-navigator
Log in or sign up for Devpost to join the conversation.

0 comments