Does this work for private TikTok videos?

No. The API only works with publicly accessible TikTok videos.

How long does transcription take?

Our engine is blazing fast: we transcribe 1 hour of video in under 30 seconds. Short TikTok videos are processed almost instantly.

What languages are supported?

We support 99+ languages with high accuracy. The engine automatically detects the spoken language.

What's the maximum video length?

TikTok videos are typically short (up to 10 minutes). All standard TikTok video lengths are supported.

Which endpoint should I call — /v1/transcript or /v1/transcribe?

Use /v1/transcript when your input is a public TikTok URL; pricing scales up to as low as $0.000025 per transcript. Use /v1/transcribe when you are uploading your own audio or video file; pricing scales up to as low as $0.0042 per minute of audio.

Back to solutions

API Solution

TikTok Video Transcription API with Speech-to-Text

Transcribe any TikTok video using automatic speech recognition. Get accurate, timestamped text output in clean JSON format for LLMs and search applications.

What is the TikTok Transcription API?: The TikTok Transcription API returns clean, timestamped transcripts from any TikTok video in a consistent JSON schema, across 99+ languages — so developers can power search, moderation, and analytics over TikTok content at scale, whether the source clip ships with subtitles or not.

What is the TikTok Video Transcription API?

The TikTok Video Transcription API exposes two endpoints that return the same consistent JSON format with timestamps and metadata. Use /v1/transcript for any public TikTok URL — POST a link, receive a timestamped transcript. Use /v1/transcribe for audio or video files you upload yourself.

Two endpoints for TikTok videos

URL in

Transcript endpoint

POST a public TikTok video URL and receive a timestamped transcript in 99+ languages with rich video metadata in one JSON response.

POST /v1/transcript

File in

Transcribe endpoint

Upload your own audio or video file and receive a timestamped transcript in 99+ languages with the same JSON shape as the Transcript endpoint.

POST /v1/transcribe

What data is returned?

Full transcript with word-level timestamps (start/end times)
Video metadata: title, description, thumbnail URL
Creator information: username, profile URL
Video statistics: views, likes, shares, duration
Structured segments for easy integration
Audio quality indicators
Video URL and creation timestamp

Example use cases

Trend Analysis

Monitor TikTok content at scale by transcribing viral videos for trend detection and competitive intelligence.

Content Repurposing

Extract spoken content from TikTok videos to create blog posts, social captions, or searchable archives.

Influencer Research

Analyze creator content by transcribing their videos for topic analysis, keyword extraction, or sentiment tracking.

API workflow

1.Send POST request with TikTok URL
2.API downloads and processes audio
3.Speech-to-text generates transcript
4.Response returns transcript + metadata

Code examples

Integrate the TikTok Transcription API with these examples.

Transcript endpoint docs (URL in) →Transcribe endpoint docs (file in) →

bash

Response structure

{
  "status": "success",
  "data": {
    "video_info": {
      "title": "TikTok video by @creator",
      "description": "Check out this amazing content! #viral #fyp",
      "thumbnail": "https://p16-sign-va.tiktokcdn.com/...",
      "url": "https://www.tiktok.com/@user/video/1234567890",
      "channel": "@creator",
      "channel_url": "https://www.tiktok.com/@creator",
      "duration": 45.2,
      "views": 2500000,
      "likes": 150000
    },
    "transcript": [
      { "start": 0.0, "end": 2.5, "text": "Hey everyone, welcome back!" },
      { "start": 2.5, "end": 6.8, "text": "Today I'm going to show you something incredible." },
      { "start": 6.8, "end": 11.2, "text": "You won't believe how easy this is." }
    ]
  }
}

Why use VidNavigator for TikTok transcription?

Blazing fast

Our speech-to-text engine transcribes 1 hour of video in under 30 seconds. No waiting around.

99+ languages supported

Transcribe TikTok videos in over 99 languages with high accuracy.

Cost-effective at scale

Scale up to as low as $0.000025/transcript or $0.0042/min for speech-to-text. Even cheaper for Enterprise.

Accurate timestamps

Each transcript segment includes precise start and end times for citation and navigation.

Pricing

Start free. /v1/transcript scales up to as low as $0.000025 per transcript retrieval. /v1/transcribe scales up to as low as $0.0042 per minute of audio. Enterprise plans available at even lower rates.

View pricing Get API key

FAQ

Compare to other transcription APIs

See exactly where VidNavigator fits against Whisper, AssemblyAI, and Deepgram.

ComparisonVidNavigator vs. WhisperWhy VidNavigator is the fastest way to go from a video URL to a clean, timestamped transcript — without running your own ingestion pipeline and GPU.ComparisonVidNavigator vs. AssemblyAIPlatform-native URL ingestion, transparent per-transcript pricing, and one API for YouTube, TikTok, Instagram, Facebook and X.ComparisonVidNavigator vs. DeepgramStreaming ASR vs. video-native intelligence. Per-transcript pricing on popular platforms, cross-platform URL ingestion, and search on top of the transcript.