> ## Documentation Index
> Fetch the complete documentation index at: https://docs.videodb.io/llms.txt
> Use this file to discover all available pages before exploring further.

# Create Video Transcription

> Generate a transcription for a video with speech recognition and speaker diarization

Generate a complete transcription of the video's audio content using automatic speech recognition.

<CodeGroup>
  ```python Python theme={null}
  import videodb

  conn = videodb.connect(api_key="your_api_key")
  coll = conn.get_collection()
  video = coll.get_videos()[0]

  # Generate transcript
  result = video.generate_transcript()

  if isinstance(result, dict) and result.get('success'):
      print("Transcript generated successfully")
  else:
      print("Transcript generation started or already exists")

  # Generate transcript with language hint
  video.generate_transcript(language_code="en")
  ```

  ```javascript Node.js theme={null}
  import { connect } from 'videodb';

  const conn = connect({ apiKey: 'your_api_key' });
  const coll = await conn.getCollection();
  const videos = await coll.getVideos();
  const video = videos[0];

  // Generate transcript
  const result = await video.generateTranscript();

  console.log(`Success: ${result.success}`);
  console.log(`Message: ${result.message}`);

  // Generate transcript with language hint
  await video.generateTranscript({ languageCode: 'en' });
  ```
</CodeGroup>

<Note>
  * Generates timestamped transcript with word-level timing information
  * Includes automatic speaker diarization when available
  * Returns success confirmation; retrieve actual transcript with get-transcription
  * Processing time depends on video duration
  * Transcript is cached after first generation (use force: true to regenerate)
  * Optional `language_code` parameter hints the transcription engine about the spoken language for better accuracy
</Note>


## OpenAPI

````yaml POST /video/{video_id}/transcription/
openapi: 3.0.3
info:
  title: VideoDB Server API
  description: >
    VideoDB Server API for video, audio, and image processing with AI
    capabilities.

    This API provides comprehensive video management, search, indexing, and
    AI-powered features.
  version: 1.0.0
  contact:
    name: VideoDB Support
    url: https://videodb.io
  license:
    name: MIT
    url: https://opensource.org/licenses/MIT
servers:
  - url: https://api.videodb.io
    description: Production server
  - url: https://staging-api.videodb.io
    description: Staging server
security:
  - ApiKeyAuth: []
tags:
  - name: Authentication
    description: User authentication and API key management
  - name: Collections
    description: Collection management operations
  - name: Videos
    description: Video upload, processing, and management
  - name: Audio
    description: Audio management operations
  - name: Images
    description: Image management operations
  - name: Search
    description: Content search and indexing
  - name: AI Generation
    description: AI-powered content generation
  - name: Billing
    description: Billing and usage management
  - name: RTStream
    description: Real-time streaming operations
  - name: Utilities
    description: Utility endpoints
  - name: Meeting
    description: Meeting recording and management
  - name: Capture
    description: Capture session management for recording streams
  - name: Editor
    description: Timeline editor operations
  - name: Transcode
    description: Media transcoding operations
  - name: Assets
    description: Cross-collection asset listing
paths:
  /video/{video_id}/transcription/:
    post:
      summary: Generate video transcription
      parameters:
        - name: video_id
          in: path
          required: true
          schema:
            type: string
            pattern: ^m-
            example: m-12345
      requestBody:
        required: true
        content:
          application/json:
            schema:
              type: object
              properties:
                engine:
                  type: string
                  default: default
                  example: default
                force:
                  type: boolean
                  example: false
                language_code:
                  type: string
                  example: en-US
                callback_url:
                  type: string
                  example: https://webhook.example.com/callback
                callback_data:
                  type: object
      responses:
        '200':
          description: Transcription job started
          content:
            application/json:
              schema:
                oneOf:
                  - $ref: '#/components/schemas/AsyncResponse'
                  - type: object
                    properties:
                      success:
                        type: boolean
                        example: true
                      message:
                        type: string
                        example: transcription already exists
      security:
        - ApiKeyAuth: []
components:
  schemas:
    AsyncResponse:
      type: object
      properties:
        success:
          type: boolean
          example: true
        status:
          type: string
          enum:
            - processing
            - done
            - failed
          example: processing
        data:
          type: object
          properties:
            id:
              type: string
              example: job-123
            output_url:
              type: string
              example: https://api.videodb.io/async-response/job-123
  securitySchemes:
    ApiKeyAuth:
      type: apiKey
      in: header
      name: x-access-token
      description: API key for authentication (sk-xxx format)

````