Skip to main content
GET
/
audio
/
{audio_id}
/
transcription
Get audio transcription
curl --request GET \
  --url https://api.videodb.io/audio/{audio_id}/transcription/ \
  --header 'x-access-token: <api-key>'
{
  "success": true,
  "status": "completed",
  "data": {
    "transcript": [
      {
        "text": "Hello world",
        "start": 1.5,
        "end": 3.2
      }
    ]
  }
}

Documentation Index

Fetch the complete documentation index at: https://docs.videodb.io/llms.txt

Use this file to discover all available pages before exploring further.

Retrieve the transcription data for a specific audio file, including word-level timestamps.
import videodb

conn = videodb.connect(api_key="your_api_key")
coll = conn.get_collection()
audio = coll.get_audio("audio_id")

# Get full transcript with timestamps
transcript = audio.get_transcript()

# Get transcript for a time range
transcript = audio.get_transcript(start=10, end=60)

# Segment by sentence
transcript = audio.get_transcript(segmenter="sentence")

# Get plain text transcript
text = audio.get_transcript_text()
  • Use start and end parameters to retrieve a specific time range
  • Returns word-level timestamps for precise alignment
  • The engine parameter selects which transcription engine’s output to retrieve

Create Transcription

Generate a new transcription

Get Audio

Retrieve audio file details

Authorizations

x-access-token
string
header
required

API key for authentication (sk-xxx format)

Path Parameters

audio_id
string
required
Pattern: ^a-
Example:

"a-12345"

Query Parameters

engine
string
Example:

"default"

start
number
default:0
Example:

0

end
number
default:-1
Example:

60

Response

Audio transcription data

success
boolean
Example:

true

status
enum<string>
Available options:
completed,
processing,
failed
Example:

"completed"

data
object