Skip to main content
POST
/
v1
/
audio
/
speech-synthesis
Text to Speech
curl --request POST \
  --url https://apis.finevoice.ai/v1/audio/speech-synthesis \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "voice": "james",
  "text": "[happy] Hello! Welcome to FineVoice."
}
'
{
  "status": 123,
  "url": "<string>",
  "taskId": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "error": {
    "code": "<string>",
    "message": "<string>"
  },
  "urls": [
    "<string>"
  ],
  "service": "<string>",
  "port": "<string>",
  "timestamp": "<string>"
}

Authorizations

Authorization
string
header
required

Bearer token (API key). Format: Bearer {your_api_key}

Body

application/json

The text-to-speech request payload.

voice
string

The target voice model name. Retrieve available voices from the List AI Voices API.

Example:

"james"

text
string

The text content to synthesize. Supports emotion tags such as [happy], [sad], [breathe].

Example:

"[happy] Hello! Welcome to FineVoice."

Response

Task accepted. Returns a taskId for async polling or the result URL directly.

Standard response for audio processing tasks.

status
integer<int32>

HTTP-style status code (200 for success, 202 for in-progress).

url
string

Download URL of the generated audio file (available when completed).

taskId
string

Task identifier for async polling. Use with GET /v1/task/{task_id}.

Example:

"a1b2c3d4-e5f6-7890-abcd-ef1234567890"

error
object
urls
string[]

Multiple output URLs (e.g. for separation stems).

service
string
port
string
timestamp
string