AI Termcirca 1960· Added Jun 8, 2026
Speech-to-Text (STT)
Speech-to-text technology converts spoken language into written text.
Speech-to-text (STT) technology is a subset of voice recognition that focuses on translating spoken language into written text. This process involves capturing sound waves, interpreting them with algorithms, and matching them to known word patterns. STT technology is employed in various applications, including automatic transcription services, real-time captioning, and interactive voice response systems, enhancing accessibility and productivity.
Examples
- Using Google's STT API for automated transcription of meetings.
- Live captioning during webinars or live streams for accessibility.
- Voice-dictation features in smartphones for texting without typing.
Common misconceptions
- STT can perfectly transcribe any voice input; in reality, accuracy varies with accents and noise levels.
- It understands context; STT only translates words without understanding meaning.
- STT systems require no training for accurate results—they often do need customization.
Related terms
Want more like this?
Open the full library
Fresh AI mastery content every 2 hours.