Skip to content

Speech Recognition for short audio files (sync)

1. Introduction

A speech recognition API that converts user-supplied short speech data (less than 30 seconds) into text. This API operates in a synchronous manner, which allows the development of applications or the establishment of an automatic speech-to-text conversion service using text obtained from speech data.

WARNING

⚠ Songs or audio with loud background music are not supported.

2. Example

Below is an example of how to use the API. In this example, a voice file called 'sample.wav' is sent to the API, which responds by converting the voice into text.

POST

shell
curl 'https://apis.daglo.ai/stt/v1/sync/transcripts>' \
  --header 'Authorization: Bearer <API_TOKEN>' \
  --form 'file=@"sample.wav"'
text
{
    "rid": "12345678-abcd-efgh-1234-abcdefghijkl",
    "sttResult": {
        "transcript": "Hello, I'm Action Power. See the art of action power, the leader in speech recognition."
    }
}

3. Instructions

1) Getting an API Key

  • Create an account in the API console.
  • Go to the token menu and issue a new token.
  • Copy the issued token information and use it as an authentication token when requested.

2) Sending a request

  • Send a request to the specified endpoint with the required parameters.
  • For more detailed API information, please refer to the API Reference .

a. POST Transcribe short audio

  • Endpoint
shell
POST https://apis.daglo.ai/stt/v1/sync/transcripts

3) Verifying audio format is supported

  • Audio duration: 30 seconds or less
  • Supported File Formats

🔊 audio

.3gp, .3gpp .ac3, .aac, .aiff, .amr, .au, .flac, .m4a, .mp3, .mxf, .opus, .ra, .wav, .weba

📹 video

.asx, .avi, .ogm, .ogv, .m4v, .mov, .mp4, .mpeg, .mpg, .wmv

WARNING

⚠ Even if the format is the same, transcription may not proceed if the actual content (encoding) is different.

4) Getting a response

  • Success
View response code
  • 200
  • 204
    • No Content. The request was successful, but no result is returned.
    • For transcription, transcription is complete, but the result is empty. :::
  • Error
View response code
  • 400: Bad Request. Invalid format.
  • 401: Unauthorized.
  • 403: Forbidden. Unauthorized access.
  • 413: Payload Too Large. Request too large.
  • 415: Unsupported Media Type.
  • 429: Too Many Requests.
  • 500: Internal Server Error.
  • 503: We are processing so many requests that we are temporarily unable to respond. Please try again in a moment.

Update history

  • 20240902 ver1.0 API document has been created.