Video Search and Understanding
Turn video archives into searchable, understandable knowledge. Find exactly what you’re looking for in hours of video content using natural language queries. Extract insights from spoken words, visual content, and on-screen information.When to Use This
- You have meeting recordings and need to find “what did the client say about the budget?”
- You want to extract all clips where a specific person appears
- You’re building video Q&A for your content library
- You need to search across both what’s spoken AND what’s shown
- You want to extract slides, diagrams, or on-screen content from recordings
What You’ll Build
Keyword Compilations
Search for specific words/phrases and auto-generate highlight reels
Character Extraction
Find all scenes where a specific person appears using face recognition
Multimodal Search
Combine spoken word + visual search for comprehensive queries
NFL Game Analysis
Search and analyze sports footage with multimodal understanding
Conference Slide Scraper
Extract slides and on-screen content from conference recordings
How It Works
- Timestamps - Exact start/end times
- Playable URLs - Stream the matching segment instantly
- Relevance scores - Confidence in the match
- Text evidence - What was said/shown
Related Documentation
Natural Language Query
How semantic search works under the hood
Collection Search
Search across your entire video library
Indexes and Search
Core concepts behind video indexing
Timestamps & Clips
Working with search results