Skip to main content

AI-Powered Captions

Soku includes AI tools that can transcribe your video and generate a ready-to-use caption from the transcript. This feature helps you create engaging captions quickly, especially for video content where writing a summary from scratch can be time-consuming.

How It Works

The AI caption workflow has two steps:
  1. Transcribe — Soku uses OpenAI Whisper to convert the audio in your video into a text transcript.
  2. Generate caption — Soku sends the transcript to an AI model that writes a social media caption based on what was said in the video.
The generated caption is placed into your caption editor, where you can review, edit, and refine it before publishing.

Requirements

Before you can use AI-powered captions, the following conditions must be met:
  • A video must be uploaded. The transcription is based on the audio track of your video file.
  • At least one platform must be selected. The AI uses your selected platforms to tailor the caption style and length.
  • You must have available credits. Transcription and caption generation consume credits from your account balance.
AI-powered captions are only available for video posts. If you are creating a text or image post, these tools will not appear.

Step-by-Step: Generating an AI Caption

  1. On the Create Post page, select Video as your content type.
  2. Upload your video file. Wait for the upload to complete.
  3. Select at least one platform from the platform selector.
  4. Click the Transcribe button to generate a transcript from your video’s audio.
  5. Once the transcript is ready, click Generate Caption to create an AI-written caption.
  6. The generated caption will appear in the caption editor.
  7. Review and edit the caption as needed before publishing.
You can edit the generated caption freely. It is placed in the editor as a starting point — feel free to adjust the tone, add hashtags, or trim it to fit platform character limits.

Credit Usage

Both the transcription and caption generation steps consume credits from your account.
ActionCredits Used
Transcribe videoVaries by video length
Generate AI captionFixed cost per generation
Check your current credit balance in your account settings. If you do not have enough credits, the AI buttons will be disabled.

Tips for Best Results

  • Clear audio produces better transcripts. If your video has background noise, music, or overlapping speakers, the transcript may be less accurate.
  • Review the transcript before generating. If the transcript contains errors, the generated caption may inherit those mistakes.
  • Use platform-specific captions after generating. You can generate a base caption with AI and then enable platform-specific captions to tailor it for each network.