Speech Recognition for short audio files (sync)
1. Introduction
A speech recognition API that converts user-supplied short speech data (less than 30 seconds) into text. This API operates in a synchronous manner, which allows the development of applications or the establishment of an automatic speech-to-text conversion service using text obtained from speech data.
WARNING
⚠ Songs or audio with loud background music are not supported.
2. Example
Below is an example of how to use the API. In this example, a voice file called 'sample.wav' is sent to the API, which responds by converting the voice into text.
POST
curl 'https://apis.daglo.ai/stt/v1/sync/transcripts>' \
--header 'Authorization: Bearer <API_TOKEN>' \
--form 'file=@"sample.wav"'
{
"rid": "12345678-abcd-efgh-1234-abcdefghijkl",
"sttResult": {
"transcript": "Hello, I'm Action Power. See the art of action power, the leader in speech recognition."
}
}
3. Instructions
1) Getting an API Key
- Create an account in the API console.
- Go to the token menu and issue a new token.
- Copy the issued token information and use it as an authentication token when requested.
2) Sending a request
- Send a request to the specified endpoint with the required parameters.
- For more detailed API information, please refer to the API Reference .
a. POST
Transcribe short audio
- Endpoint
POST https://apis.daglo.ai/stt/v1/sync/transcripts
3) Verifying audio format is supported
- Audio duration: 30 seconds or less
- Supported File Formats
🔊 audio
.3gp, .3gpp .ac3, .aac, .aiff, .amr, .au, .flac, .m4a, .mp3, .mxf, .opus, .ra, .wav, .weba
📹 video
.asx, .avi, .ogm, .ogv, .m4v, .mov, .mp4, .mpeg, .mpg, .wmv
WARNING
⚠ Even if the format is the same, transcription may not proceed if the actual content (encoding) is different.
4) Getting a response
- Success
View response code
200
204
- No Content. The request was successful, but no result is returned.
- For transcription, transcription is complete, but the result is empty. :::
- Error
View response code
400
: Bad Request. Invalid format.401
: Unauthorized.403
: Forbidden. Unauthorized access.413
: Payload Too Large. Request too large.415
: Unsupported Media Type.429
: Too Many Requests.500
: Internal Server Error.503
: We are processing so many requests that we are temporarily unable to respond. Please try again in a moment.
Update history
- 20240902 ver1.0 API document has been created.