Process Pipeline
All-in-one audio processing pipeline. Select steps via boolean flags. Returns a download URL for the processed audio.
Authorizations
Bearer token (API key). Format: Bearer {your_api_key}
Body
The all-in-one audio processing pipeline request payload.
Audio file URL (http/https).
"https://example.com/audio.mp3"
Enable Speech Enhancement step (noise reduction).
Enable Remove Mouth Sounds step.
Enable Remove Long Silences step.
Enable Super Resolution step (8 kHz to 48 kHz).
Enable Filler Words Removal step.
Enable Stuttering Removal step.
Enable Audio Normalization step.
Speech enhancement model: MossFormer2_SE_48K, FRCRN_SE_16K, or MossFormerGAN_SE_16K.
Enable VAD preprocessing for speech enhancement.
Silence detection threshold in dBFS.
Silences longer than this value (ms) are trimmed.
Silence padding to preserve at boundaries (ms).
Use Whisper for filler word detection.
Language code for filler word detection (e.g. en, zh).
Comma-separated filler words to remove. Empty = built-in defaults.
Minimum similarity for stutter detection.
0.5 <= x <= 1Normalization method: peak, rms, or both.
Target peak level in dBFS.
Target RMS level in dBFS.
Output format: wav, mp3, flac, or m4a.
Response
Processed audio download URL returned.
The response is of type object.