Skip to main content

What We’re Building

VideoDB is the perception, memory, and action layer for AI agents operating on video and audio. We turn continuous media—files, live streams, and desktop capture—into structured context, searchable moments with playable evidence, and event triggers that drive automations. The core loop: See → Understand → Act
  • See: Ingest video and audio from files, RTSP streams, or desktop capture
  • Understand: Build indexes that convert raw media into timestamped, searchable memory
  • Act: Trigger alerts, automate workflows, and edit video programmatically
We’re building for developers who need to give their agents eyes and ears—whether that’s media archives, real-time monitoring, desktop agents, or automation workflows.

What You’ll Work On

  • Backend systems in Python, Node.js, or Go
  • APIs, serverless architecture, and scalable video infrastructure
  • Real-time pipelines for live streams and desktop capture
  • Build indexes that turn video into structured understanding
  • Work on scene segmentation, frame sampling, and multimodal search
  • Develop agent memory and retrieval systems
  • Contribute to Director—our open source AI video agent
  • Build with Capture SDK for screen and audio capture
  • Create workflow templates for n8n, Zapier, and MCP-based tools
  • Build intuitive interfaces and developer tools
  • Create tutorials, demos, and documentation
  • Grow our community of builders

Details

  • Location: Remote-first
  • Duration: 3-6 months, flexible start dates
  • Compensation: Competitive stipend + mentorship

Apply

Send your application to [email protected] with subject: Internship Application. Include your GitHub or relevant projects if you have them.