Documentation Index
Fetch the complete documentation index at: https://docs.neolli.jocoding.io/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Transcription converts the spoken audio in your video into timestamped captions in the source language. Powered by ElevenLabs Scribe v2, it produces word-level timing and speaker diarization — the essential foundation for translation and dubbing.Starting a transcription
- Open your video from the dashboard
- Click Transcribe
- Select the spoken language, or choose Auto-detect
- Click Start
Supported languages
Neolli supports transcription in 10 languages (plus auto-detect):| Flag | Language | Code |
|---|---|---|
| 🇺🇸 | English | eng |
| 🇪🇸 | Spanish | spa |
| 🇫🇷 | French | fra |
| 🇩🇪 | German | deu |
| 🇮🇹 | Italian | ita |
| 🇧🇷 | Portuguese | por |
| 🇯🇵 | Japanese | jpn |
| 🇰🇷 | Korean | kor |
| 🇨🇳 | Chinese | zho |
| 🇳🇱 | Dutch | nld |
Features
- Speaker diarization — Automatically identifies and labels different speakers
- Word-level timing — Each word gets its own precise timestamp for accurate syncing
- Auto-detect — Identifies the spoken language automatically for common languages
Processing time
| Video length | Estimated time | Mode |
|---|---|---|
| Under 30 min | 1–3 minutes | Synchronous |
| Over 30 min | Proportional to length | Asynchronous |
File requirements
- Max file size: 3 GB
- Supported formats: MP4, MOV, MKV, AVI, WebM, and most common video/audio formats
- Audio: Must contain a detectable audio track with speech
After transcription
Once complete, you can:- Review and edit captions in the caption editor
- Add target languages for translation or dubbing
- Export the source captions as SRT