TikTokAPI Solution

TikTok Video Transcription API with Speech-to-Text

Transcribe any TikTok video using automatic speech recognition. Get accurate, timestamped text output in clean JSON format for LLMs and search applications.

What is the TikTok Transcription API?
The TikTok Transcription API returns clean, timestamped transcripts from any TikTok video in a consistent JSON schema, across 99+ languages — so developers can power search, moderation, and analytics over TikTok content at scale, whether the source clip ships with subtitles or not.

What is the TikTok Video Transcription API?

The TikTok Video Transcription API exposes two endpoints that return the same consistent JSON format with timestamps and metadata. Use /v1/transcript for any public TikTok URL — POST a link, receive a timestamped transcript. Use /v1/transcribe for audio or video files you upload yourself.

Two endpoints for TikTok videos

URL in

Transcript endpoint

POST a public TikTok video URL and receive a timestamped transcript in 99+ languages with rich video metadata in one JSON response.

POST /v1/transcript
File in

Transcribe endpoint

Upload your own audio or video file and receive a timestamped transcript in 99+ languages with the same JSON shape as the Transcript endpoint.

POST /v1/transcribe

What data is returned?

  • Full transcript with word-level timestamps (start/end times)
  • Video metadata: title, description, thumbnail URL
  • Creator information: username, profile URL
  • Video statistics: views, likes, shares, duration
  • Structured segments for easy integration
  • Audio quality indicators
  • Video URL and creation timestamp

Example use cases

Trend Analysis

Monitor TikTok content at scale by transcribing viral videos for trend detection and competitive intelligence.

Content Repurposing

Extract spoken content from TikTok videos to create blog posts, social captions, or searchable archives.

Influencer Research

Analyze creator content by transcribing their videos for topic analysis, keyword extraction, or sentiment tracking.

API workflow

  1. 1.Send POST request with TikTok URL
  2. 2.API downloads and processes audio
  3. 3.Speech-to-text generates transcript
  4. 4.Response returns transcript + metadata

Code examples

Integrate the TikTok Transcription API with these examples.

bash

Response structure

{
  "status": "success",
  "data": {
    "video_info": {
      "title": "TikTok video by @creator",
      "description": "Check out this amazing content! #viral #fyp",
      "thumbnail": "https://p16-sign-va.tiktokcdn.com/...",
      "url": "https://www.tiktok.com/@user/video/1234567890",
      "channel": "@creator",
      "channel_url": "https://www.tiktok.com/@creator",
      "duration": 45.2,
      "views": 2500000,
      "likes": 150000
    },
    "transcript": [
      { "start": 0.0, "end": 2.5, "text": "Hey everyone, welcome back!" },
      { "start": 2.5, "end": 6.8, "text": "Today I'm going to show you something incredible." },
      { "start": 6.8, "end": 11.2, "text": "You won't believe how easy this is." }
    ]
  }
}

Why use VidNavigator for TikTok transcription?

Blazing fast

Our speech-to-text engine transcribes 1 hour of video in under 30 seconds. No waiting around.

99+ languages supported

Transcribe TikTok videos in over 99 languages with high accuracy.

Cost-effective at scale

Scale up to as low as $0.000025/transcript or $0.0041/min for speech-to-text. Even cheaper for Enterprise.

Accurate timestamps

Each transcript segment includes precise start and end times for citation and navigation.

Pricing

Start free. /v1/transcript scales up to as low as $0.000025 per transcript retrieval. /v1/transcribe scales up to as low as $0.0041 per minute of audio. Enterprise plans available at even lower rates.

FAQ

Related solutions