Speech Enhancement
Reduce background noise and enhance speech quality. Supports WAV/MP3/FLAC/AAC/OGG/OPUS/M4A/WEBM.
Authorizations
Bearer token (API key). Format: Bearer {your_api_key}
Body
The speech enhancement request payload.
Audio file URL (http/https). Supports WAV/MP3/FLAC/AAC/OGG/OPUS/M4A/WEBM.
"https://example.com/audio.mp3"
Enhancement model. Supported: MossFormer2_SE_48K, FRCRN_SE_16K, MossFormerGAN_SE_16K.
"MossFormer2_SE_48K"
Enable VAD (Voice Activity Detection) preprocessing.
Enable audio normalization after enhancement.
Normalization method: peak, rms, or both.
Target peak level in dBFS.
Target RMS level in dBFS.
Output format: wav, mp3, flac, or m4a.
Response
Enhanced audio returned.
The response is of type object.