Automatic Speech Recognition (ASR) is a subfield of computational linguistics that enables machines to recognise speech and convert the speech data into textual form.
In ASR, an audio file or speech spoken to a microphone is processed and converted to text, therefore it is also known as Speech-to-Text (STT). Then this text is fed to a Natural Language Processing/Understanding (NLP/NLU) to understand and extract key information (such as intentions, sentiments), and then appropriate action is taken. There are also stand-alone applications of ASR, e.g. transcribing dictation, or producing real-time subtitles for videos.
We're frequently asked by our customers, "How does Slang Retail Assistant compare to Google Voice Search?" so we ran a benchmarking test where we compared their performances. Read this blog post for a quick summary!
How to mitigate challenges in designing software at an early-stage startup
Slang Labs becomes one of the first players in the In-App Voice Assistant space globally to be backed by Google