Skip main navigation

New offer! Get 30% off one whole year of Unlimited learning. Subscribe for just £249.99 £174.99. T&Cs apply

Speech recognition

Speech recognition enables machines to transcribe spoken language into text. Watch Ming Yan explain more.

Speech recognition technology, a pivotal field within digital media, has seen significant advancements, transforming the way we interact with machines through voice.

Definition of speech recognition

Speech recognition is the technology enabling machines to interpret and transcribe human speech into text, facilitating communication through voice signals.

Fundamentals of speech recognition

Speech recognition technology aims to convert speech signals into text and commands, involving two levels: basic word recognition and advanced semantic understanding. It represents a significant research direction within digital speech signal processing.

Historical progression

The field began with early experimental systems like Audry in 1952 and has advanced through decades, focusing on small vocabulary isolated words to large vocabulary continuous speech, shifting from template-matching to statistical model-based methods.

Recognition process

The process involves acquiring acoustic and language models, detecting speech, framing, feature extraction, and decoding to achieve recognition results.

Technical challenges

Challenges include handling dialects and accents, background noise, and the variability of oral language, which deviates from standard grammar.

Applications

Speech recognition is widely applied in human-computer interaction, gaming, control systems for various devices, and automated customer service, highlighting its versatility in digital signal processing.

Impact of deep learning

Deep learning has revolutionized speech recognition, influencing language models and processing flows, and has driven the productization and industry reshuffling of speech recognition technology.

In summary, speech recognition stands as a pivotal technology in digital media, significantly enhanced by deep learning and with broad applications across various fields.

This article is from the free online

Introduction to Digital Media

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now