Supported Languages
Scriptix supports 40+ languages for speech-to-text transcription.
Language Codes
Use ISO 639-1 two-letter language codes in API requests.
All Supported Languages
European Languages
| Language | Code | Native Name |
|---|---|---|
| English | en | English |
| Dutch | nl | Nederlands |
| French | fr | Français |
| German | de | Deutsch |
| Spanish | es | Español |
| Italian | it | Italiano |
| Portuguese | pt | Português |
| Polish | pl | Polski |
| Czech | cs | Čeština |
| Danish | da | Dansk |
| Finnish | fi | Suomi |
| Greek | el | Ελληνικά |
| Hungarian | hu | Magyar |
| Norwegian | no | Norsk |
| Romanian | ro | Română |
| Slovak | sk | Slovenčina |
| Swedish | sv | Svenska |
| Russian | ru | Русский |
| Ukrainian | uk | Українська |
Middle Eastern Languages
| Language | Code | Native Name |
|---|---|---|
| Arabic | ar | العربية |
| Hebrew | he | עברית |
| Turkish | tr | Türkçe |
| Persian/Farsi | fa | فارسی |
Asian Languages
| Language | Code | Native Name |
|---|---|---|
| Chinese | zh | 中文 |
| Japanese | ja | 日本語 |
| Korean | ko | 한국어 |
| Hindi | hi | हिन्दी |
| Thai | th | ไทย |
| Indonesian | id | Bahasa Indonesia |
| Vietnamese | vi | Tiếng Việt |
| Malay | ms | Bahasa Melayu |
| Tamil | ta | தமிழ் |
| Telugu | te | తెలుగు |
Usage Examples
Specify Language
curl -X POST https://api.scriptix.io/api/v3/stt \
-H "Authorization: Bearer YOUR_API_KEY" \
-F "language=en" \
-F "audio_file=@audio.mp3"
Python
response = requests.post(
'https://api.scriptix.io/api/v3/stt',
headers={'Authorization': 'Bearer YOUR_API_KEY'},
files={'audio_file': open('audio.mp3', 'rb')},
data={'language': 'en'}
)
Automatic Language Detection
For automatic language detection, use "auto":
curl -X POST https://api.scriptix.io/api/v3/stt \
-H "Authorization: Bearer YOUR_API_KEY" \
-F "language=auto" \
-F "audio_file=@audio.mp3"
Note: Language detection works best with 10+ seconds of clear speech.
Language-Specific Features
Custom Models
Custom models can be trained for any supported language:
curl -X POST https://api.scriptix.io/api/v3/custom_models \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"name": "Medical Dutch",
"language": "nl",
"base_model": "medical"
}'
Translation
Translate transcripts between supported languages:
curl -X POST https://api/v3/documents/123/translate \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"target_language": "fr"}'
Best Practices
1. Always Specify Language
For best accuracy, specify the exact language:
# ✅ Specify language
data = {'language': 'en', 'audio_file': file}
# ⚠️ Auto-detection (slower, less accurate)
data = {'language': 'auto', 'audio_file': file}
2. Use Correct Dialect
For languages with dialects, use the closest code:
- English (US/UK/AU) →
en - Portuguese (BR/PT) →
pt - Spanish (ES/LATAM) →
es - Chinese (Simplified/Traditional) →
zh
3. Custom Models for Accents
For heavy accents or regional variations, train a custom model.
Language Support by Feature
| Feature | Supported Languages |
|---|---|
| Transcription | All 40+ languages |
| Custom Models | All 40+ languages |
| Glossaries | All 40+ languages |
| Translation | 30+ languages |
| Real-time | All 40+ languages |
| Speaker Diarization | All 40+ languages |
Accuracy by Language
Accuracy varies by language based on training data:
| Tier | Languages | WER Range |
|---|---|---|
| Tier 1 (Highest) | en, nl, de, fr, es | 5-10% |
| Tier 2 | it, pt, pl, ru, zh, ja | 8-15% |
| Tier 3 | Other languages | 10-20% |
WER = Word Error Rate (lower is better)
Requesting New Languages
Don't see your language? Contact us:
- Email: support@scriptix.io
- Subject: "Language Request: [Language Name]"
Include:
- Language name and ISO code
- Use case description
- Expected monthly volume