
AI-powered, automated, and super-accurate speech recognition infrastructure tool for all of your audio and video data.
Made with ❤️ and ☕ in Bucharest
All the available models for automatic speech-to-text recognition using static files (i.e. files that you upload via our web platform or API, and you get the transcript after the file is processed)
ID
LANGUAGE
STATUS
DOMAIN
SPEAKER RECOGNITION
MULTIPLE CHANNELS
PUNCTUATION
CAPITALIZATION
ENTITIES RECOGNITION
NUMERALS CONVERSION
CUSTOM VOCABULARY
PROFANITY FILTER
ro_RO
Romanian (RO)
LIVE
Multi-domain
en_GB
English (GB)
LIVE
General
de_DE
German (DE)
LIVE
General
es_ES
Spanish (ES)
LIVE
General
fr_FR
French (FR)
LIVE
General
All the available models for automatic speech-to-text recognition using live streams (i.e. you connect your microphone viw our web platform, and you get the transcript in real-time, while you record yourself).
ID
LANGUAGE
STATUS
DOMAIN
SPEAKER RECOGNITION
MULTIPLE CHANNELS
PUNCTUATION
CAPITALIZATION
ENTITIES RECOGNITION
NUMERALS CONVERSION
CUSTOM VOCABULARY
PROFANITY FILTER
ro_RO
Romanian (Romania)
ONLY ON REQUEST
General
All the available formats currently supported by the platform.
AUDIO FORMATS
MP3
FLAC
OGG
WAV
WMA
AIFF
AAC
M4A
VIDEO FORMATS
MP4
MPEG
QT
WMV
AVI
FLV
WEBM
TS
ASF
MKV