Filler Words — Detect
Detect filler words (um, uh, er, hmm) in audio and return their timestamps.
Authorizations
Bearer token (API key). Format: Bearer {your_api_key}
Body
The filler words detect/remove request payload.
Audio URL (http/https).
"https://example.com/audio.mp3"
Output format: wav, mp3, flac, or m4a.
Use Whisper ASR for accurate, language-aware filler word detection.
Whisper model size: tiny, base, small, or medium. Used when use_whisper is true.
Language code for Whisper (e.g. en, zh, ja, fr). Used when use_whisper is true.
Comma-separated filler words to detect/remove. Empty = built-in defaults.
Response
Filler word timestamps returned.
The response is of type object.