Also known as speech-to-text (STT), transcription is the process of converting speech to text. We have built this endpoint with strong support for African languages.
If Python and JavaScript aren’t your vibe, try using the ReST API directly. Go to the API Reference section. We’re working to add SDKs for more languages.
The transcribe()
function can be used to transcribe audio. Pass either a url
or a content
to the transcribe function.
Examples are provided below as a guide for you.
If you provide the url
, we will download the file from the specified location.
content
(file) or url
(str), but do not provide both.mp3
, wav
, m4a
, and ogg
file formats.url
, ensure that access to the file is not blocked by authentication.The response for speech generation is in bytes.
application/json
request_id
is returned for issue resolution with our support team.Below is an example of a response from the transcription endpoint.
Our speech-to-text model supports the following languages:
ha
ig
yo
am
en
More info on languages can be found on the Languages page
When transcribing, you should use the language code (e.g. en
, yo
, ig
) and not the full text.