Training Process
Learn how to train your custom language model with audio and transcript data.
Training Overview
Training teaches your custom model to recognize speech patterns and vocabulary specific to your domain. The process requires:
- Training Data - Audio and/or transcript files
- Data Upload - Adding files to the model
- Validation - System validates datasets
- Training Execution - Running the training process
Training Status States
Models progress through five training states:
1. Not Running (Zinc Badge)
What it means:
- Model created but no training started
- No datasets uploaded yet
What to do:
- Upload datasets (audio and/or transcripts)
- Status will change when datasets uploaded
2. Ready to Run (Amber Badge)
What it means:
- Datasets have been uploaded
- Model ready to begin training
- Training queued to start
What appears:
- "Train the Language" button not shown
- Training will begin automatically when ready
3. Running (Sky Badge)
What it means:
- Training actively in progress
- System processing your data
- Model learning from datasets
What to do:
- Wait for training to complete
- Do not delete model during training
- Monitor status periodically
What appears:
- "Train the Language" button hidden
- Cannot modify model during training
4. Success (Emerald Badge)
What it means:
- Training completed successfully
- Model ready to use
- Can be selected for transcription
What to do:
- Use model in STT Session
- Select from language dropdown when creating transcripts
What appears:
- "Train the Language" button hidden
- Model available for use
5. Failed (Red Badge)
What it means:
- Training encountered an error
- Model not ready to use
What to do:
- Upload new datasets
- Try training again
- Check error messages
What appears:
- "Train the Language" button visible
- Can upload new datasets and retry
Uploading Datasets
Access Model Details
- Navigate to Custom Models page
- Click model name or Edit action
- Model details page opens
- Datasets section displays at bottom
Upload New Dataset
- Click "New Dataset" button
- Modal opens
- Select dataset type:
- Auto-detect (default)
- TRANSCRIPT (.vtt, .srt, .txt)
- AUDIO (.wav, .mp3, .m4a, .flac)
- Upload files:
- Click upload area or drag-and-drop
- Select 1-10 files
- Maximum 10GB per file
- Click "Create"
- Files upload and appear in datasets list
Dataset Types
TRANSCRIPT (1):
- Format: .vtt, .srt, .txt
- Text transcripts with or without timestamps
- Used for training text patterns
TEST (2):
- Format: .vtt, .srt, .txt
- Test transcripts for validation
- Verifies training quality
AUDIO (3):
- Format: .wav, .mp3, .m4a, .flac
- Audio files for training
- Used for acoustic model training
MANIFEST (4):
- Format: .jsonl
- Organizes datasets
- Links audio and transcripts
Upload Validation
Files validated on upload:
- File count: 1-10 files per upload
- File size: Maximum 10GB per file
- Format validation: Must match selected type
- Auto-detection: Determines type from extension if "auto" selected
Validation errors:
validation_failed- Datasets failed validationinvalid_audio_format- Audio files invalid formatinvalid_transcript_format- Transcript files invalid format- File size exceeds 10GB limit
Starting Training
Train Button Visibility
"Train the Language" button appears only when:
- Training status is Not Running (1) OR Failed (5)
- Button hidden for Ready to Run (2), Running (3), Success (4)
Button Location:
- Model details page header
- Next to "Update" button
- Only visible for applicable statuses
Training Process
- Upload datasets if not already uploaded
- Click "Train the Language" button (if status is 1 or 5)
- Confirmation dialog appears:
- Title: "Train the language confirmation"
- Message: "Are you sure you want to start the training process for [model name]? This process may take a long time to complete and it will use your credits, and you will not be able to modify the custom model until it is finished."
- Click "Train" to confirm
- Validation runs - System validates datasets
- Training starts if validation passes
- Status changes to "Running" (3)
- Success toast: "Training started successfully"
Training Requirements
Before training can start:
- Datasets uploaded to model
- Training credits available in organization
- Model validation passes
- Model not currently training
- Model status allows training (1 or 5)
Error Codes:
validation_failed- Datasets validation failedinvalid_audio_format- Invalid audio file formatinvalid_transcript_format- Invalid transcript formatno-claims-available- No training credits available
During Training
What Happens
The system:
- Validates all uploaded datasets
- Processes audio files
- Analyzes transcripts
- Trains the model
- Updates training status
Monitoring Progress
Status Badge:
- Shows "Running" (sky blue badge)
- Located in model information cards
What You Can Do:
- View model details
- Monitor training status
- Continue other work
- Cannot modify model during training
- Cannot delete model during training
What You Cannot Do:
- Start training again (button hidden)
- Upload new datasets (modal unavailable)
- Modify model name during training
- Delete model while training
Training Confirmation
Confirmation modal warns:
- Training may take a long time
- Will use training credits
- Cannot modify model until finished
- Requires user confirmation before starting
After Training
Successful Training (Status 4)
When training completes successfully:
Model Details Page:
- Status badge shows "Success" (emerald green)
- "Train the Language" button hidden
- Info banner shows success message with model name
- Model ready for use
Status Help Message:
- "Training complete! Select '[model name]' from the language dropdown when uploading files to use this model."
Next Steps:
- Navigate to workspace (Home page)
- Click "STT Session"
- Select language
- Select custom model from dropdown
- Upload audio
- Start transcription
Failed Training (Status 5)
If training fails:
Model Details Page:
- Status badge shows "Failed" (red)
- "Train the Language" button appears
- Can upload new datasets
- Can retry training
Status Help Message:
- "Upload datasets to get started. Add audio files and/or transcripts. Status will change to Ready to Run."
What to Do:
- Check error messages
- Review dataset quality
- Upload new/corrected datasets
- Click "Train the Language" to retry
- Confirm training start
Force Alignment for Training Data
Use Force Alignment to prepare training data:
What It Does:
- Adds timestamps to plain text transcripts
- Creates properly formatted training files
- Uses audio + existing transcript
How to Use:
- Navigate to workspace
- Click "STT Session"
- Select Force Alignment option
- Upload audio file
- Provide transcript text
- Process alignment
- Download result with timestamps
- Upload to custom model as training data
Dataset Management
View Datasets
Datasets List:
- Displayed on model details page
- Shows all uploaded datasets
- Columns: Name, Type, Duration, URL, Start Time, End Time
Dataset Types Shown:
- TRANSCRIPT
- TEST
- AUDIO
- MANIFEST
Delete Datasets
- Locate dataset in list
- Click three-dot menu on row
- Select "Delete"
- Confirmation dialog appears
- Confirm deletion
- Dataset removed from model
Dataset Deletion:
- Available for all datasets
- Permanent action
- Cannot undo
- Can delete during any training status
Training Credits
Training requires credits:
Credits System:
- Organization must have training credits
- Credits consumed when training starts
- Error if no credits available: "No training credits available"
Checking Credits:
- Organization settings or billing page
- Contact organization administrator
- Error message displays if insufficient
Info Banner Messages
Model details page shows status-specific help:
Status 1 (Not Running):
- "Upload your training datasets (audio + transcripts), then click 'Train the language' to start."
Status 2 (Ready to Run):
- "Training is queued and will begin shortly."
Status 3 (Running):
- "Training in progress. This may take several hours depending on dataset size."
Status 4 (Success):
- "Training complete! Select '[model name]' from the language dropdown when uploading files to use this model."
Status 5 (Failed):
- Same as Status 1 message
Banner Features:
- Located below model information cards
- Expands by default when status is Success (4)
- Collapsed by default for other statuses
- Includes link to Force Alignment feature
Limitations
Cannot Train When:
- Status is Ready to Run (2)
- Status is Running (3)
- Status is Success (4)
- No training credits available
- Datasets fail validation
Training Process:
- Cannot stop once started
- Cannot modify model during training
- Uses training credits
- Requires validation to pass
Button Visibility:
- Only visible for Not Running (1) or Failed (5)
- Hidden for all other statuses
Training complete! Your custom model is ready to improve transcription accuracy.
Next Steps
After successful training:
- Use in Transcription - Apply your trained model
- Test with sample audio
- Compare accuracy improvements
- Deploy to production use