Training Process
Learn how to train your custom language model with audio and transcript data.
Training Overview
Training teaches your custom model to recognize speech patterns and vocabulary specific to your domain. The process requires:
- Training Data - Audio and/or transcript files
- Data Upload - Adding files to the model
- Validation - System validates datasets
- Training Execution - Running the training process
Training Status States
Models progress through five training states:
1. Not Running (Zinc Badge)
What it means:
- Model created but no training started
- No datasets uploaded yet
What to do:
- Upload datasets (audio and/or transcripts)
- Status will change when datasets uploaded
2. Ready to Run (Amber Badge)
What it means:
- Datasets have been uploaded
- Model ready to begin training
- Training queued to start
What appears:
- "Train the Language" button not shown
- Training will begin automatically when ready
3. Running (Sky Badge)
What it means:
- Training actively in progress
- System processing your data
- Model learning from datasets
What to do:
- Wait for training to complete
- Do not delete model during training
- Monitor status periodically
What appears:
- "Train the Language" button hidden
- Cannot modify model during training
4. Success (Emerald Badge)
What it means:
- Training completed successfully
- Model ready to use
- Can be selected for transcription
What to do:
- Use model in STT Session
- Select from language dropdown when creating transcripts
What appears:
- "Train the Language" button hidden
- Model available for use
5. Failed (Red Badge)
What it means:
- Training encountered an error
- Model not ready to use
What to do:
- Upload new datasets
- Try training again
- Check error messages
What appears:
- "Train the Language" button visible
- Can upload new datasets and retry
Uploading Datasets
Access Model Details
- Navigate to Custom Models page
- Click model name or Edit action
- Model details page opens
- Datasets section displays at bottom
Upload New Dataset
- Click "New Dataset" button
- Modal opens
- Select dataset type:
- Auto-detect (default)
- TRANSCRIPT (.vtt, .srt, .txt)
- AUDIO (.wav, .mp3, .m4a, .flac)
- Upload files:
- Click upload area or drag-and-drop
- Select 1-10 files
- Maximum 10GB per file
- Click "Create"
- Files upload and appear in datasets list
Dataset Types
TRANSCRIPT (1):
- Format: .vtt, .srt, .txt
- Text transcripts with or without timestamps
- Used for training text patterns
TEST (2):
- Format: .vtt, .srt, .txt
- Test transcripts for validation
- Verifies training quality
AUDIO (3):
- Format: .wav, .mp3, .m4a, .flac
- Audio files for training
- Used for acoustic model training
MANIFEST (4):
- Format: .jsonl
- Organizes datasets
- Links audio and transcripts
Upload Validation
Files validated on upload:
- File count: 1-10 files per upload
- File size: Maximum 10GB per file
- Format validation: Must match selected type
- Auto-detection: Determines type from extension if "auto" selected
Validation errors:
validation_failed- Datasets failed validationinvalid_audio_format- Audio files invalid formatinvalid_transcript_format- Transcript files invalid format- File size exceeds 10GB limit
Starting Training
Train Button Visibility
"Train the Language" button appears only when:
- Training status is Not Running (1) OR Failed (5)
- Button hidden for Ready to Run (2), Running (3), Success (4)
Button Location:
- Model details page header
- Next to "Update" button
- Only visible for applicable statuses
Training Process
- Upload datasets if not already uploaded
- Click "Train the Language" button (if status is 1 or 5)
- Confirmation dialog appears:
- Title: "Train the language confirmation"
- Message: "Are you sure you want to start the training process for [model name]? This process may take a long time to complete and it will use your credits, and you will not be able to modify the custom model until it is finished."
- Click "Train" to confirm
- Validation runs - System validates datasets
- Training starts if validation passes
- Status changes to "Running" (3)
- Success toast: "Training started successfully"