AI Termcirca 1960· Added Jun 8, 2026

Speech-to-Text (STT)

Speech-to-text technology converts spoken language into written text.

Speech-to-text (STT) technology is a subset of voice recognition that focuses on translating spoken language into written text. This process involves capturing sound waves, interpreting them with algorithms, and matching them to known word patterns. STT technology is employed in various applications, including automatic transcription services, real-time captioning, and interactive voice response systems, enhancing accessibility and productivity.

Examples

Using Google's STT API for automated transcription of meetings.
Live captioning during webinars or live streams for accessibility.
Voice-dictation features in smartphones for texting without typing.

Common misconceptions

STT can perfectly transcribe any voice input; in reality, accuracy varies with accents and noise levels.
It understands context; STT only translates words without understanding meaning.
STT systems require no training for accurate results—they often do need customization.

Want more like this?

Open the full library

Plain-English AI lessons, prompts and guides.

Start free Browse library