Performance Tips

This guide outlines best practices for optimizing the responsiveness and transcription quality when using the Scriptix Real-time API.

Performance is influenced by how you encode and stream audio. The size, frequency, and format of your audio chunks all affect both latency and accuracy.

Recommended Chunk Size

To achieve the best balance between speed and quality:

Audio Format	Recommended Chunk Size	Reason
PCM 16kHz (default)	8 KB – 64 KB	Maintains low latency while preserving context
PCM 8kHz (call center)	Adjust accordingly	Smaller bandwidth, slower response—requires tuning

With 16kHz 16-bit PCM audio, 1 second of audio ≈ 32 KB of data. So, sending 256 ms of audio ≈ 8 KB.

Chunk Size vs. Performance

Chunk Size	Latency	Accuracy
~4 KB	Very fast	May reduce contextual quality
8–32 KB	Fast	Good balance
64 KB+	Slower	High contextual accuracy

Tip: Test with your actual audio source. Some streams benefit more from context than others.

Why Size Matters

Smaller chunks result in:

Faster responses
Less contextual information for the model

Larger chunks result in:

Slower responses
More accurate transcriptions due to richer context

Special Case: 8kHz Audio Models

Scriptix offers 8kHz private models for specific use cases like call center transcriptions.

If you're using an 8kHz model:

Use 16-bit little-endian PCM audio
Adjust chunk size to match the lower sample rate (e.g., 1 second ≈ 16 KB)
Expect slightly higher latency but optimized for narrow-band audio

Contact Scriptix support if you're interested in using 8kHz models.

Final Best Practices

Stream regularly – Avoid sending large bursts or long gaps
Maintain audio rate – Consistent format = consistent results
Monitor round-trip time (RTT) – Latency spikes may indicate buffer or network issues
Test different chunk sizes – Depending on your use case, smaller or larger blocks may yield better trade-offs

Audio Encoding – Accepted formats and conversion tips
WebSocket Connection – Streaming setup and lifecycle
Protocol – How transcript results are returned

Recommended Chunk Size​

Chunk Size vs. Performance​

Why Size Matters​

Special Case: 8kHz Audio Models​

Final Best Practices​

Related Topics​

Recommended Chunk Size

Chunk Size vs. Performance

Why Size Matters

Special Case: 8kHz Audio Models

Final Best Practices

Related Topics