For developers looking to integrate Deepgram's text-to-speech (TTS) capabilities within applications, it can be essential to programmatically list all available voice models. Deepgram provides API endpoints to meet this requirement, allowing you to retrieve metadata about public and project-specific voice models.
Deepgram offers two primary API endpoints to access information about available TTS voice models:
Public Models Endpoint
https://developers.deepgram.com/reference/get-models
Project-Specific Models Endpoint
https://developers.deepgram.com/reference/get-project-models
The response from these endpoints includes a tts
array with model metadata. Below is a sample structure of the data you will receive:
{
"name": "angus",
"canonical_name": "aura-angus-en",
"architecture": "aura",
"languages": [
"en",
"en-IE"
],
"version": "2024-11-19.0",
"uuid": "11111111-2222-3333-4444-555555555555",
"metadata": {
"accent": "Irish",
"color": "#BA80F5",
"image": "https://static.deepgram.com/examples/avatars/angus.jpg",
"sample": "https://static.deepgram.com/examples/voices/angus.wav",
"tags": [
"masculine"
]
}
}
This payload provides essential information such as the model's name, languages supported, version, UUID, and additional metadata like accent, image, and sample audio.
To integrate this functionality into your dashboard or any application:
Note: Authentication and proper request headers will be necessary to access these endpoints.
Leveraging these endpoints allows developers to offer dynamic voice model selection options within applications, enabling tailored TTS experiences. For assistance or persistent issues, please reach out to our community for support.