Create Custom Model
Custom models improve transcription accuracy for specialized vocabularies, industry terms, or unique audio conditions.
What Are Custom Models?
Custom models are trained language models that enhance speech recognition:
- Built on base language models
- Trained with your specific audio and transcripts
- Improve accuracy for domain-specific terminology
- Require training data (audio files and/or transcripts)
- Must be trained before use
Requirements
To Create a Custom Model:
- Name for the model
- Base language selection (from trainable languages)
- Organization membership
To Train a Custom Model:
- Dataset files uploaded (audio and/or transcripts)
- Training credits available
- Model validation passes
Creating a New Custom Model
From Custom Models Page:
- Navigate to Custom Models page
- Click "New Custom Model" button
- Modal opens with creation form
- Enter model name (required)
- Select base language from dropdown (required)
- Only trainable languages shown
- Language determines base model used
- Click "Create"
- Model created and appears in list
After Creation:
- Model shows in custom models list
- Training status: "Not Running"
- Model cannot be used until trained
- Must upload datasets and train
Custom Model List
Columns Displayed:
| Column | Description | Info Tooltip |
|---|---|---|
| Name | Model name | - |
| Training Status | Current training state with badge | Training progress indicator |
| Base Language | Language name (e.g., "English") | Base language model used |
| Last Modified | Last update date | - |
Training Status Values:
- Not Running (gray badge) - Not started or failed
- Ready to Run (amber badge) - Datasets uploaded, ready to train
- Running (blue badge) - Currently training
- Success (green badge) - Training completed successfully
- Failed (red badge) - Training failed
Actions Available:
- Edit - Opens model details page
- Delete - Remove model (ORGADMIN or SYSOP only)
Pagination:
- 25 items per page (default)
- Options: 25, 50, 75, 100 per page
- Manual pagination controls at bottom
Model Details Page
Access model details by clicking model name or Edit action.
Page Sections:
Model Information Cards:
- Name - Editable input field
- Training Status - Badge with status
- Base Language Model - Language name (read-only)
- Last Modified - Date (read-only)
Header Actions:
- Train the Language - Starts training (only when status allows)
- Update - Saves name changes
Uploading Datasets
Datasets are training files used to improve the model.
Dataset Types:
- TRANSCRIPT - Text transcripts (.vtt, .srt, .txt)
- TEST - Test transcripts for validation (.vtt, .srt, .txt)
- AUDIO - Audio files (.wav, .mp3, .m4a, .flac)
- MANIFEST - Manifest files (.jsonl)
Supported File Formats:
- Audio: .wav, .mp3, .m4a, .flac
- Transcripts: .vtt, .srt, .txt
- Manifest: .jsonl
Upload New Dataset:
- Click "New Dataset" button on model details page
- Modal opens
- Select dataset type:
- Auto-detect (default)
- TRANSCRIPT
- AUDIO
- Upload files (1-10 files):
- Click upload area or drag-and-drop
- Select 1 to 10 files
- Files validated against selected type
- Maximum 10GB per file
- Click "Create"
- Files uploaded and added to datasets list
File Validation:
- Minimum: 1 file required
- Maximum: 10 files per upload
- File size limit: 10GB per file
- Format validation based on selected type
Auto-Detection:
- Detects type from file extension
- Uses first file's extension
- Falls back to TRANSCRIPT for unknown types
Dataset Management
Dataset List Columns:
- Name
- Type (TRANSCRIPT, TEST, AUDIO, MANIFEST)
- Duration (if available)
- URL
- Start Time
- End Time
Dataset Actions:
- Delete - Remove dataset file (via three-dot menu)
Pagination:
- 25 items per page (default)
- Options: 25, 50, 75, 100 per page
Training the Model
Training Process:
- Upload datasets (audio and/or transcripts)
- Check status - Model shows "Ready to Run" when datasets uploaded
- Click "Train the Language" button
- Confirmation dialog appears
- Click "Train" to confirm
- Validation runs - Model validates datasets
- Training starts - Status changes to "Running"
- Wait for completion - Status changes to "Success" or "Failed"
Training Status Messages:
Not Running:
- Upload datasets to get started
- Add audio files and/or transcripts
- Status will change to "Ready to Run"
Ready to Run:
- Datasets uploaded
- Click "Train the Language" to start
- Training credits required
Running:
- Training in progress
- Wait for completion
- Can take time depending on dataset size
Success:
- Training completed successfully
- Model ready to use
- Select model when creating transcripts
Failed:
- Training failed
- Check datasets and try again
- Upload new datasets if needed
Training Requirements:
- Training credits available in organization
- Valid datasets uploaded
- Datasets pass validation
- Cannot train while already training
- Cannot train if training succeeded (status 4)
Training Errors:
- validation_failed - Datasets failed validation
- invalid_audio_format - Audio files invalid
- invalid_transcript_format - Transcript files invalid
- no-claims-available - No training credits available
Using Trained Models
After training succeeds:
- Navigate to Home (workspace) page
- Click "STT Session" to create new transcript
- Select language from dropdown
- Select custom model from model dropdown
- Trained custom models appear in list
- Model name shown with base language
- Upload or select audio file
- Start transcription
- Model used for improved accuracy
Force Alignment:
- Use Force Alignment mode in STT Session
- Provide audio and existing transcript
- Custom model improves alignment accuracy
- Creates time-synced transcript
Updating Models
Update Model Name:
- Open model details page
- Edit name in Name field
- Click "Update" button
- Name saved
Language and Organization:
- Base language cannot be changed after creation
- Organization ID fixed at creation
- To change language, create new model
Deleting Models
Delete Custom Model:
- Locate model in custom models list
- Click three-dot menu
- Select "Delete"
- Confirmation dialog appears
- Confirm deletion
- Model permanently removed
Delete Restrictions:
- Only ORGADMIN or SYSOP can delete
- Delete action not available for other roles
- Deletion is permanent
Delete Datasets:
- Open model details page
- Locate dataset in datasets list
- Click three-dot menu on dataset row
- Select "Delete"
- Confirmation dialog appears
- Confirm deletion
- Dataset removed from model
Info Banner
Information banner on custom models page explains:
- What custom models are
- How they work with training data
- Link to Force Alignment feature for creating transcripts
- How they improve accuracy
- Instructions to select model after training
Info Banner Sections:
- What are custom models
- Easy setup with training data
- Use Force Alignment feature (link to Home page)
- Improves accuracy over time
- Select after training in STT Session
Getting Started Guide
First-time modal appears on first visit to custom models page.
Guide Sections:
- What are custom models - Explanation of functionality
- What you need - Requirements for training
- Only have audio files - Information about Force Alignment
- After training - How to use trained model
Access Guide:
- Click "Getting Started" button in header
- Modal opens with guide information
- Click "Got It" to close
Guide Tracking:
- Shown automatically on first visit
- Stored in localStorage:
customModelsGuideShown - Can reopen manually via button
Training Status Details
Banner on model details page shows current status and help:
Banner Contents:
- Title with current training status
- Help message appropriate for status
- Link to Force Alignment feature
- File requirements note
Banner Behavior:
- Always visible on model details page
- Expands by default when status is "Success" (4)
- Collapsed by default for other statuses
- Click to expand/collapse
Status-Specific Help:
- Status 1 (Not Running): Upload datasets to start
- Status 2 (Ready to Run): Click Train button
- Status 3 (Running): Wait for completion
- Status 4 (Success): Model ready to use
- Status 5 (Failed): Try again with new datasets
Best Practices
Creating Models:
- Use descriptive names
- Select correct base language
- One model per use case or domain
Uploading Datasets:
- Upload quality audio files
- Include accurate transcripts
- Use domain-specific content
- Upload multiple files for better training
- Stay within 10GB per file limit
Training:
- Upload all datasets before training
- Verify datasets are correct
- Ensure training credits available
- Wait for training to complete before using
Using Models:
- Select trained model in STT Session
- Use for appropriate language only
- Test accuracy improvements
- Retrain if needed with more data
Model created! Now proceed to upload training data and start the training process.
Next Steps
- Training Process - Learn how to train your model
- Use in Transcription - Apply your trained model