Natural Language Query

Ask questions in plain English. VideoDB uses semantic search to understand intent and return relevant video segments.

Quick Example

import videodb

conn = videodb.connect()
coll = conn.get_collection()
video = coll.get_video("m-xxx")

# Natural language query
results = video.search("when does the speaker discuss climate change?")

# Play matching segments
results.play()

How It Works

Query Understanding - Your query is transformed into a vector embedding
Similarity Matching - Embeddings are compared against indexed content
Relevance Scoring - Results are ranked by semantic similarity
Timestamp Retrieval - Matching segments are returned with timestamps

Search Types

Semantic Search (Default)

Understands meaning and intent, not just keywords.

from videodb import SearchType

# Semantic search (default)
results = video.search("How do I fix a leaky faucet?")

# Explicit semantic search
results = video.search(
    query="How do I fix a leaky faucet?",
    search_type=SearchType.semantic
)

Best for:

Questions (“What causes…?”, “How do you…?”)
Conceptual queries (“explain the theory”)
Fuzzy matching (“something about cars”)

Keyword Search

Exact substring matching. Finds literal occurrences.

from videodb import SearchType

results = video.search(
    query="API",
    search_type=SearchType.keyword
)

Best for:

Technical terms
Proper nouns
Exact phrases

Comparison

Feature	Semantic Search	Keyword Search
Query	Natural language	Exact terms
Matching	By meaning	By substring
Example	”How to repair pipes?"	"plumbing repair”
Scope	Single video or collection	Single video only

Index Types

Specify which index to search.

from videodb import IndexType

# Search spoken content (default)
results = video.search(
    query="discusses machine learning",
    index_type=IndexType.spoken_word
)

# Search visual content
results = video.search(
    query="person running through a park",
    index_type=IndexType.scene
)

# Search specific scene index
results = video.search(
    query="red car",
    index_type=IndexType.scene,
    index_id="scene-index-xxx"
)

Tuning Results

Result Threshold

Limit the number of results returned:

results = video.search(
    query="funny moments",
    result_threshold=10  # Return top 10 matches
)

Score Threshold

Filter out low-relevance results:

results = video.search(
    query="product demo",
    score_threshold=0.3  # Only results with score >= 0.3
)

Dynamic Score Percentage

Adaptive filtering based on score distribution:

results = video.search(
    query="key insights",
    dynamic_score_percentage=50  # Keep top 50% of score range
)

The dynamic threshold is calculated as:

dynamic_threshold = max_score - (range × percentage)

Search Parameters Reference

Parameter	Type	Default	Description
`query`	str	required	Natural language query
`search_type`	SearchType	semantic	`semantic` or `keyword`
`index_type`	IndexType	spoken_word	`spoken_word` or `scene`
`result_threshold`	int	5	Max results to return
`score_threshold`	float	0.2	Minimum relevance score
`dynamic_score_percentage`	float	20	Adaptive score filter
`index_id`	str	None	Specific scene index ID

Layers and parameters of semantic search showing how queries are transformed into vectors and matched against indexed content

Query Examples

Spoken Content Queries

# Question format
video.search("What are the main benefits of solar energy?")

# Topic lookup
video.search("discussion about renewable energy")

# Speaker search
video.search("when the CEO mentions revenue")

Visual Content Queries

# Object detection
video.search("red car on the highway", index_type=IndexType.scene)

# Action detection
video.search("person running", index_type=IndexType.scene)

# Scene description
video.search("sunset over the ocean", index_type=IndexType.scene)

Multimodal Queries

Combine spoken and visual search for precise results:

from videodb import IndexType

# Search spoken content
spoken_results = video.search(
    query="talks about the solar system",
    index_type=IndexType.spoken_word
)

# Search visual content
visual_results = video.search(
    query="shows planets or galaxies",
    index_type=IndexType.scene
)

# Find intersection (both conditions met)
spoken_times = [(s.start, s.end) for s in spoken_results.get_shots()]
visual_times = [(s.start, s.end) for s in visual_results.get_shots()]

What You Can Build

Keyword Search Compilation

Create highlight reels from specific keywords or phrases

Multimodal Search

Combine spoken and visual search for precise results

Character Clips

Extract clips featuring specific people using search

Next Steps

Timestamps, Clips, Streams

What you get back from search

Collection Search

Search across your entire library

Start Here

Core Concepts

Ingest

Understand

Act

Automate

Build with Agents

Quick Example

How It Works

Search Types

Semantic Search (Default)

Keyword Search

Comparison

Index Types

Tuning Results

Result Threshold

Score Threshold

Dynamic Score Percentage

Search Parameters Reference

Query Examples

Spoken Content Queries

Visual Content Queries

Multimodal Queries

What You Can Build

Keyword Search Compilation

Multimodal Search

Character Clips

Next Steps

Timestamps, Clips, Streams

Collection Search

Start Here

Core Concepts

Ingest

Understand

Act

Automate

Build with Agents

​Quick Example

​How It Works

​Search Types

​Semantic Search (Default)

​Keyword Search

​Comparison

​Index Types

​Tuning Results

​Result Threshold

​Score Threshold

​Dynamic Score Percentage

​Search Parameters Reference

​Query Examples

​Spoken Content Queries

​Visual Content Queries

​Multimodal Queries

​What You Can Build

Keyword Search Compilation

Multimodal Search

Character Clips

​Next Steps

Timestamps, Clips, Streams

Collection Search

Quick Example

How It Works

Search Types

Semantic Search (Default)

Keyword Search

Comparison

Index Types

Tuning Results

Result Threshold

Score Threshold

Dynamic Score Percentage

Search Parameters Reference

Query Examples

Spoken Content Queries

Visual Content Queries

Multimodal Queries

What You Can Build

Next Steps