Incorporating Knowledge Sources into Statistical Speech Recognition

by ; ; ;
Format: Hardcover
Pub. Date: 2009-03-01
Publisher(s): Springer Verlag
  • Free Shipping Icon

    This Item Qualifies for Free Shipping!*

    *Excludes marketplace orders.

List Price: $209.99

Rent Textbook

Select for Price
There was a problem. Please try again later.

Rent Digital

Rent Digital Options
Online:30 Days access
Downloadable:30 Days
$57.24
Online:60 Days access
Downloadable:60 Days
$76.32
Online:90 Days access
Downloadable:90 Days
$95.40
Online:120 Days access
Downloadable:120 Days
$114.48
Online:180 Days access
Downloadable:180 Days
$124.02
Online:1825 Days access
Downloadable:Lifetime Access
$190.80
$124.02

New Textbook

We're Sorry
Sold Out

Used Textbook

We're Sorry
Sold Out

How Marketplace Works:

  • This item is offered by an independent seller and not shipped from our warehouse
  • Item details like edition and cover design may differ from our description; see seller's comments before ordering.
  • Sellers much confirm and ship within two business days; otherwise, the order will be cancelled and refunded.
  • Marketplace purchases cannot be returned to eCampus.com. Contact the seller directly for inquiries; if no response within two days, contact customer service.
  • Additional shipping costs apply to Marketplace purchases. Review shipping costs at checkout.

Summary

Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible.

Table of Contents

Introduction and Book Overviewp. 1
Automatic Speech Recognition - A Way of Human-Machine Communicationp. 1
Approaches to Speech Recognitionp. 4
Knowledge-based Approachesp. 4
Corpus-based Approachesp. 6
State-of-the-art ASR Performancep. 7
Studies on Incorporating Knowledge Sourcesp. 10
Sources of Variability in Speechp. 10
Existing Ways of Incorporating Knowledge Sourcesp. 12
Major Challenges to Overcomep. 15
Book Outlinep. 16
Statistical Speech Recognitionp. 19
Pattern Recognition Overviewp. 19
Theory of Hidden Markov Modelsp. 22
Markov Chainp. 22
General form of an HMMp. 23
Principle Cases of HMMp. 25
Pattern Recognition for HMM-Based ASR Systemsp. 35
Front-end Feature Extractionp. 36
HMM-Based Acoustic Modelp. 43
Pronunciation Lexiconp. 49
Language Modelp. 50
Search Algorithmp. 51
Graphical Framework to Incorporate Knowledge Sourcesp. 55
Graphical Model Representationp. 56
Probability Theoryp. 56
Graphical Modelp. 59
Junction Tree Algorithmp. 63
Procedure of GFIKSp. 68
Causal Relationship between Information Sourcesp. 70
Direct Inference on Bayesian Networkp. 71
Junction Tree Decompositionp. 72
Junction Tree Inferencep. 75
Practical Issues of GFIKSp. 75
Types of Knowledge Sourcesp. 75
Different Levels of Incorporationp. 76
Speech Recognition Using GFIKSp. 79
Applying GFIKS at the HMM State Levelp. 79
Causal Relationship Between Information Sourcesp. 80
Inferencep. 81
Enhancing Model Reliabilityp. 81
Training and Recognition Issuesp. 82
Applying GFIKS at the HMM Phonetic-unit Levelp. 83
Causal Relationship between Information Sourcesp. 83
Inferencep. 85
Enhancing the Model Reliabilityp. 85
Deleted Interpolationp. 86
Training and Recognition Issuesp. 86
Experiments with Various Knowledge Sourcesp. 87
Incorporating Knowledge at the HMM State Levelp. 87
Incorporating Knowledge at the HMM Phonetic-unit Levelp. 116
Experiments Summary and Discussionp. 132
Conclusions and Future Directionsp. 139
Conclusionsp. 139
Theoretical Issuesp. 139
Application Issuesp. 140
Experimental Issuesp. 141
Future Directions: A Roadmap to a Spoken Language Dialog Systemp. 142
Speech Materialsp. 145
AURORA TIDigit Corpusp. 145
TIMIT Acoustic-Phonetic Speech Corpusp. 146
Wall Street Journal Corpusp. 148
ATR Basic Travel Expression Corpusp. 150
ATR English Database Corpusp. 150
ATR Software Toolsp. 153
Generic Properties of ATRASRp. 153
Data Preparationp. 153
SSS Data Generating Toolsp. 155
Acoustic Model Training Toolsp. 155
Language Model Training Toolsp. 157
Recognition Toolsp. 157
Composition of Bayesian Wide-phonetic Contextp. 163
Proof using Bayes's Rulep. 163
Variants of Bayesian Wide-phonetic Context Modelp. 164
Statistical Significance Testingp. 169
Statistical Hypothesis Testingp. 169
The Use of the Sign Test for ASRp. 172
Referencesp. 175
Indexp. 189
Table of Contents provided by Ingram. All Rights Reserved.

An electronic version of this book is available through VitalSource.

This book is viewable on PC, Mac, iPhone, iPad, iPod Touch, and most smartphones.

By purchasing, you will be able to view this book online, as well as download it, for the chosen number of days.

Digital License

You are licensing a digital product for a set duration. Durations are set forth in the product description, with "Lifetime" typically meaning five (5) years of online access and permanent download to a supported device. All licenses are non-transferable.

More details can be found here.

A downloadable version of this book is available through the eCampus Reader or compatible Adobe readers.

Applications are available on iOS, Android, PC, Mac, and Windows Mobile platforms.

Please view the compatibility matrix prior to purchase.