What We’re Building
VideoDB is the perception, memory, and action layer for AI agents operating on video and audio. We turn continuous media—files, live streams, and desktop capture—into structured context, searchable moments with playable evidence, and event triggers that drive automations. The core loop: See → Understand → Act- See: Ingest video and audio from files, RTSP streams, or desktop capture
- Understand: Build indexes that convert raw media into timestamped, searchable memory
- Act: Trigger alerts, automate workflows, and edit video programmatically
What You’ll Work On
Platform Engineering
Platform Engineering
- Backend systems in Python, Node.js, or Go
- APIs, serverless architecture, and scalable video infrastructure
- Real-time pipelines for live streams and desktop capture
AI & Vision Systems
AI & Vision Systems
- Build indexes that turn video into structured understanding
- Work on scene segmentation, frame sampling, and multimodal search
- Develop agent memory and retrieval systems
Open Source
Open Source
- Contribute to Director—our open source AI video agent
- Build with Capture SDK for screen and audio capture
- Create workflow templates for n8n, Zapier, and MCP-based tools
Developer Experience
Developer Experience
- Build intuitive interfaces and developer tools
- Create tutorials, demos, and documentation
- Grow our community of builders
Details
- Location: Remote-first
- Duration: 3-6 months, flexible start dates
- Compensation: Competitive stipend + mentorship