When using Deepgram's transcription service, some users have encountered an issue where random words in the transcription output have their end
field set to -1. This can be problematic, especially when calculating timing data based on transcription results.
In the output transcription data, each word typically includes start and end timestamps. When a word's end
timestamp is -1, it suggests a potential data processing error or misconfiguration in the API request.
Example Data
[
{
"word": "Hello",
"start": 10.809999,
"end": -1,
"confidence": 0.99121094,
"speaker": 0,
"punctuated_word": "Hello"
}
]
While the -1 end
field might seem unrelated to specific API settings like keywords, it's critical to ensure correct API request formatting and parameters. For instance, misconfigured query parameters, particularly keywords, could contribute indirectly to transcription anomalies.
When including keywords in your API request, ensure they are URL-encoded properly:
?keywords=Santa%20Ana:0.25
?keywords=Santa%20Ana:0.25&keywords=Shokes:0.25
Refer to Deepgram's documentation on keywords for further details.
encoding
(e.g., linear16
) and sample_rate
(e.g., 48000
).keywords
, are correctly formatted and encoded.Facing such issues could be indicative of parameter misconfigurations or internal processing errors. However, ensuring proper formatting and configuration of API requests, especially with non-trivial parameters like keywords
, can mitigate many common issues.
If issues persist or the system behavior seems inconsistent, reach out to your Deepgram support representative (if you have one) or visit our community for assistance: Deepgram Community