You are viewing a section of Superface that will be deprecated in Q4 2023. Until then, you can continue to use it as you have before but we recommend that you migrate to our latest version soon. For more details visit superface.ai/docs/migrate.

Speech to text conversion

speech/recognize@1.0.0

4 providers

Speech recognition

Real-time speech recognition.

Input

Audio content

Language code

Audio encoding

Maximum alternatives

Result

Results

1.Choose a provider

Mock implementation assemblyai azure-speech-to-text google-apis-speech ibm-cloud-speech-to-text Request a missing provider

2.Use Recognize with mock in your code

Below instructions are for our Node.js SDK. Use OneService for other languages.

npm i @superfaceai/one-sdk

Copied!

Structure details

Input (object)

audioContent

required

Audio data in the encoding specified by audioEncodig input parameter.

languageCode

required

The language (and potentially also the region) of the speech expressed as a BCP-47 language tag, e.g. 'en-US'.

string

audioEncoding

optional

Encoding of audio data sent. This input is optional for WAV audio files and required for other audio formats.

enum

wav

maxAlternatives

optional

Maximum number of recognition hypotheses to be returned. The server may return fewer than maxAlternatives. Valid values are 0-30. Default value is 1.

number

Example

Copied!

Result (object)

results

required

Sequential list of transcription results corresponding to sequential portions of audio.

list

alternatives

required

Alternative hypotheses.

list

transcript

required

Transcript text representing the words recognized in audio input.

string

confidence

optional

The confidence estimate between 0.0 and 1.0. A higher number indicates an estimated greater likelihood that the recognized words are correct. The default of 0.0 is a sentinel value indicating confidence was not set.

number

Example

Copied!

Implementation details

Provider

mock

Use case

Recognize

Author

@superface

Source

View source

Verified