Get Your API Key
- Go to VideoDB Console
- Copy your API key (free tier: 50 uploads, no credit card)
- Set it in your environment:
Install the SDK
Real-time Perception (Desktop Capture)
Stream what your agent sees and hears. Get structured context back in real-time.Desktop capture currently supports macOS only. Windows support is coming soon.
What You Get
Your backend receives AI-ready events in real-time:Now, It’s Your Turn
Use the code below to connect to our OpenClaw’s live visual and audio feeds, get real-time context, define events, and create alerts. You’ll receive transcript updates and structured screen context in your WebSocket listener, plus you can attach event rules for alerts.Full Capture Guide
Deep dive: channels, permissions, client code, and event handling
Working with Video Files
Upload, index, and search existing recordings.Upload a video
Index spoken words
Create a searchable transcript:Search with natural language
Index Visual Scenes
For video where visuals matter (security footage, tutorials, presentations):Search Across Collections
Scale to thousands of videos:What’s Next
Core Concepts in 5 Min
The mental model: See → Understand → Act
Ingesting Files
Upload videos, audio, and images from URLs or local files
RTSP Ingest
Connect live camera streams and feeds
Create an Index
Make your media searchable with indexes