Getting Started with Transcription
This guide covers the essentials of creating transcripts from audio and video files.
What is Transcription?
Transcription converts spoken audio into written text using speech recognition technology, providing:
- Timestamped text - Words linked to their position in the audio
- Speaker identification - Automatic detection and labeling of different speakers (diarization)
- Editable transcripts - Powerful editor for reviewing and refining
- Multiple export formats - Various formats for different use cases
- Collaboration tools - Sharing and review workflows
Basic Workflow
- Upload - Upload media files or record directly
- Configure - Select language and transcription options
- Process - Automatic speech-to-text conversion
- Edit - Review and refine in the transcript editor
- Export - Download in your preferred format
Quick Start
Step 1: Upload Your File
- Click Transcript or Create
- Choose your upload method:
- Drag and drop a file into the upload area
- Click to browse and select from your computer
- Record using webcam, microphone, or screen capture
- OneDrive - Import from Microsoft OneDrive
- URL - Import from a public media URL
Supported formats:
- Audio: MP3, WAV, M4A, AAC, FLAC, OGG, WMA, AIFF
- Video: MP4, MOV, AVI, MKV, WebM, M4V, 3GP, FLV, WMV, TS
Step 2: Configure Settings
Language (Required)
- Select the spoken language from the dropdown
- Accurate language selection improves results
Diarization (Optional)
- Enable to automatically identify different speakers
- Speakers will be labeled as "Speaker 1", "Speaker 2", etc.
- You can rename speakers in the editor
Additional Options
- Keep source - Retain uploaded media file
- Folder - Organize into a specific folder
Step 3: Start Transcription
- Click Upload
- Your file enters the processing queue
- Monitor progress in the "Workspace" section
Step 4: Review & Edit
Once processing completes:
-
Navigate to your completed session
-
Click to open the transcript editor
-
The editor displays:
- Audio/video player
- Editable transcript text
- Speaker labels (if diarization was enabled)
- Timestamps
-
Edit the transcript:
- Click any text to edit
- Click timestamps to jump in audio
- Rename speakers by clicking their labels
- Changes auto-save
Step 5: Export
When ready:
- Click Export button
- Choose your format:
- DOCX - Microsoft Word
- TXT - Plain text
- PDF - Non-editable document
- JSON - Structured data
- CSV - Spreadsheet format
- HTML - Web format
- Configure export options (timestamps, speaker labels, etc.)
- Download your transcript
Understanding File Uploads
File Size Limits
Upload files up to 40 GB (audio or video).
Note: Depending on your plan, different limits may apply. Contact your administrator if you need to upload larger files.
How Uploads Work
Scriptix uses smart upload technology:
- Large files - Automatically broken into smaller chunks and uploaded piece by piece
- Interrupted uploads - Can resume from where they left off if your connection drops
- Progress tracking - See real-time progress as your file uploads
- Reliable delivery - Automatic retries ensure your files arrive safely
Transcription Settings Explained
Language Selection
Selecting the correct language is crucial for accuracy.
How to choose:
- Select the language dropdown during upload
- Choose the language that matches the spoken audio
- If you're unsure, check the available languages in the dropdown menu
Tip: The more accurately you select the language, the better your transcription results will be.
Speaker Diarization
Diarization automatically identifies and separates different speakers in your audio.
How it works:
- Enable "diarization" checkbox during upload
- The system analyzes voice characteristics
- Speakers are labeled sequentially (Speaker 1, Speaker 2, etc.)
- You can rename speakers in the editor
When to use:
- Interviews and conversations
- Panel discussions
- Meetings with multiple participants
- Podcasts with hosts and guests
Best results:
- Clear audio with distinct voices
- Minimal speaker overlap
- Good microphone quality