Create Custom Model

Custom models improve transcription accuracy for specialized vocabularies, industry terms, or unique audio conditions.

What Are Custom Models?

Custom models are trained language models that enhance speech recognition:

Built on base language models
Trained with your specific audio and transcripts
Improve accuracy for domain-specific terminology
Require training data (audio files and/or transcripts)
Must be trained before use

Requirements

To Create a Custom Model:

Name for the model
Base language selection (from trainable languages)
Organization membership

To Train a Custom Model:

Dataset files uploaded (audio and/or transcripts)
Training credits available
Model validation passes

Creating a New Custom Model

From Custom Models Page:

Navigate to Custom Models page
Click "New Custom Model" button
Modal opens with creation form
Enter model name (required)
Select base language from dropdown (required)
- Only trainable languages shown
- Language determines base model used
Click "Create"
Model created and appears in list

After Creation:

Model shows in custom models list
Training status: "Not Running"
Model cannot be used until trained
Must upload datasets and train

Custom Model List

Columns Displayed:

Column	Description	Info Tooltip
Name	Model name	-
Training Status	Current training state with badge	Training progress indicator
Base Language	Language name (e.g., "English")	Base language model used
Last Modified	Last update date	-

Training Status Values:

Not Running (gray badge) - Not started or failed
Ready to Run (amber badge) - Datasets uploaded, ready to train
Running (blue badge) - Currently training
Success (green badge) - Training completed successfully
Failed (red badge) - Training failed

Actions Available:

Edit - Opens model details page
Delete - Remove model (ORGADMIN or SYSOP only)

Pagination:

25 items per page (default)
Options: 25, 50, 75, 100 per page
Manual pagination controls at bottom

Model Details Page

Access model details by clicking model name or Edit action.

Page Sections:

Model Information Cards:

Name - Editable input field
Training Status - Badge with status
Base Language Model - Language name (read-only)
Last Modified - Date (read-only)

Header Actions:

Train the Language - Starts training (only when status allows)
Update - Saves name changes

Uploading Datasets

Datasets are training files used to improve the model.

Dataset Types:

TRANSCRIPT - Text transcripts (.vtt, .srt, .txt)
TEST - Test transcripts for validation (.vtt, .srt, .txt)
AUDIO - Audio files (.wav, .mp3, .m4a, .flac)
MANIFEST - Manifest files (.jsonl)

Supported File Formats:

Audio: .wav, .mp3, .m4a, .flac
Transcripts: .vtt, .srt, .txt
Manifest: .jsonl

Upload New Dataset:

Click "New Dataset" button on model details page
Modal opens
Select dataset type:
- Auto-detect (default)
- TRANSCRIPT
- AUDIO
Upload files (1-10 files):
- Click upload area or drag-and-drop
- Select 1 to 10 files
- Files validated against selected type
- Maximum 10GB per file
Click "Create"
Files uploaded and added to datasets list

File Validation:

Minimum: 1 file required
Maximum: 10 files per upload
File size limit: 10GB per file
Format validation based on selected type

Auto-Detection:

Detects type from file extension
Uses first file's extension
Falls back to TRANSCRIPT for unknown types

Dataset Management

Dataset List Columns:

Name
Type (TRANSCRIPT, TEST, AUDIO, MANIFEST)
Duration (if available)
URL
Start Time
End Time

Dataset Actions:

Delete - Remove dataset file (via three-dot menu)

Pagination:

25 items per page (default)
Options: 25, 50, 75, 100 per page

Training the Model

Training Process:

Upload datasets (audio and/or transcripts)
Check status - Model shows "Ready to Run" when datasets uploaded
Click "Train the Language" button
Confirmation dialog appears
Click "Train" to confirm
Validation runs - Model validates datasets
Training starts - Status changes to "Running"
Wait for completion - Status changes to "Success" or "Failed"

Training Status Messages:

Not Running:

Upload datasets to get started
Add audio files and/or transcripts
Status will change to "Ready to Run"

Ready to Run:

Datasets uploaded
Click "Train the Language" to start
Training credits required

Running:

Training in progress
Wait for completion
Can take time depending on dataset size

Success:

Training completed successfully
Model ready to use
Select model when creating transcripts

Failed:

Training failed
Check datasets and try again
Upload new datasets if needed

Training Requirements:

Training credits available in organization
Valid datasets uploaded
Datasets pass validation
Cannot train while already training
Cannot train if training succeeded (status 4)

Training Errors:

validation_failed - Datasets failed validation
invalid_audio_format - Audio files invalid
invalid_transcript_format - Transcript files invalid
no-claims-available - No training credits available

Using Trained Models

After training succeeds:

Navigate to Home (workspace) page
Click "STT Session" to create new transcript
Select language from dropdown
Select custom model from model dropdown
- Trained custom models appear in list
- Model name shown with base language
Upload or select audio file
Start transcription
Model used for improved accuracy

Force Alignment:

Use Force Alignment mode in STT Session
Provide audio and existing transcript
Custom model improves alignment accuracy
Creates time-synced transcript

Updating Models

Update Model Name:

Open model details page
Edit name in Name field
Click "Update" button
Name saved

Language and Organization:

Base language cannot be changed after creation
Organization ID fixed at creation
To change language, create new model

Deleting Models

Delete Custom Model:

Locate model in custom models list
Click three-dot menu
Select "Delete"
Confirmation dialog appears
Confirm deletion
Model permanently removed

Delete Restrictions:

Only ORGADMIN or SYSOP can delete
Delete action not available for other roles
Deletion is permanent

Delete Datasets:

Open model details page
Locate dataset in datasets list
Click three-dot menu on dataset row
Select "Delete"
Confirmation dialog appears
Confirm deletion
Dataset removed from model

Information banner on custom models page explains:

What custom models are
How they work with training data
Link to Force Alignment feature for creating transcripts
How they improve accuracy
Instructions to select model after training

Info Banner Sections:

What are custom models
Easy setup with training data
Use Force Alignment feature (link to Home page)
Improves accuracy over time
Select after training in STT Session

Getting Started Guide

First-time modal appears on first visit to custom models page.

Guide Sections:

What are custom models - Explanation of functionality
What you need - Requirements for training
Only have audio files - Information about Force Alignment
After training - How to use trained model

Access Guide:

Click "Getting Started" button in header
Modal opens with guide information
Click "Got It" to close

Guide Tracking:

Shown automatically on first visit
Stored in localStorage: customModelsGuideShown
Can reopen manually via button

Training Status Details

Banner on model details page shows current status and help:

Banner Contents:

Title with current training status
Help message appropriate for status
Link to Force Alignment feature
File requirements note

Banner Behavior:

Always visible on model details page
Expands by default when status is "Success" (4)
Collapsed by default for other statuses
Click to expand/collapse

Status-Specific Help:

Status 1 (Not Running): Upload datasets to start
Status 2 (Ready to Run): Click Train button
Status 3 (Running): Wait for completion
Status 4 (Success): Model ready to use
Status 5 (Failed): Try again with new datasets

Best Practices

Creating Models:

Use descriptive names
Select correct base language
One model per use case or domain

Uploading Datasets:

Upload quality audio files
Include accurate transcripts
Use domain-specific content
Upload multiple files for better training
Stay within 10GB per file limit

Training:

Upload all datasets before training
Verify datasets are correct
Ensure training credits available
Wait for training to complete before using

Using Models:

Select trained model in STT Session
Use for appropriate language only
Test accuracy improvements
Retrain if needed with more data

Model created! Now proceed to upload training data and start the training process.

Next Steps

Training Process - Learn how to train your model
Use in Transcription - Apply your trained model

What Are Custom Models?​

Requirements​

Creating a New Custom Model​

Custom Model List​

Model Details Page​

Uploading Datasets​

Dataset Management​

Training the Model​

Using Trained Models​

Updating Models​

Deleting Models​

Info Banner​

Getting Started Guide​

Training Status Details​

Best Practices​

Next Steps​