
Speech and Audio Signal Processing Processing and Perception of Speech and Music
by Gold, Ben; Morgan, Nelson; Ellis, Dan-
This Item Qualifies for Free Shipping!*
*Excludes marketplace orders.
Buy New
Rent Textbook
Used Textbook
We're Sorry
Sold Out
eTextbook
We're Sorry
Not Available
How Marketplace Works:
- This item is offered by an independent seller and not shipped from our warehouse
- Item details like edition and cover design may differ from our description; see seller's comments before ordering.
- Sellers much confirm and ship within two business days; otherwise, the order will be cancelled and refunded.
- Marketplace purchases cannot be returned to eCampus.com. Contact the seller directly for inquiries; if no response within two days, contact customer service.
- Additional shipping costs apply to Marketplace purchases. Review shipping costs at checkout.
Summary
Author Biography
Nelson Morgan is the Director of the International Computer Science Institute, an independent, not-for profit research laboratory affiliated with the University of California at Berkeley. Dr. Morgan is also Professor-in-Residence in the Electrical Engineering and Computer Sciences Department at UC Berkeley. Dr. Morgan is an IEEE Fellow.
Dan Ellis is Associate Professor in the Electrical Engineering Department of Columbia University. Dr. Ellis's Laboratory for Recognition and Organization of Speech and Audio (LabROSA) investigates how to extract high-level information from audio, including speech recognition, music description, and environmental sound processing.
Table of Contents
Preface To The 2011 Edition | p. xxi |
Introduction | p. 1 |
Historical Background | |
Synthetic A Udio: A Brief History | p. 9 |
Speech Analysis And Synthesis Overview | p. 21 |
Brief History Of Automatic Speech Recognition | p. 40 |
Speech-Recognition Overview | p. 59 |
Mathematical Background | |
Digital Signal Processing | p. 73 |
Digital Filtersand Discrete Fourier Transform | p. 87 |
Pattern Classification | p. 105 |
Statistical Pattern Classification | p. 124 |
Acoustics | |
Wave Basics | p. 141 |
Acoustic Tube Modeling Of Speech Production | p. 152 |
Musical Instrument Acoustics | p. 158 |
Room Acoustics | p. 179 |
Auditory Perception | |
Ear Physiology | p. 193 |
Psychoacoustics | p. 209 |
Models Of Pitch Perception | p. 218 |
Speech Perception | p. 232 |
Human Speech Recognition | p. 250 |
Speech Features | |
The Auditory System As A Filter Bank | p. 263 |
The Cepstrum As A Spectral Analyzer | p. 277 |
Linear Prediction | p. 286 |
A Utomatic Speech Recognition | |
Feature Extraction For Asr | p. 301 |
Linguistic Categories For Speech Recognition | p. 319 |
Deterministic Sequence Recognition For Asr | p. 337 |
Statistical Sequence Recognition | p. 350 |
Statistical Model Training | p. 364 |
Discriminant Acoustic Probability Estimation | p. 381 |
Acoustic Model Training: Further Topics | p. 394 |
Speech Recognition And Understanding | p. 416 |
Synthesis And Coding | |
Speech Synthesis | p. 431 |
Pitch Detection | p. 455 |
Vocoders | p. 473 |
Low-Rate Vocoders | p. 493 |
Medium-Rate And High-Rate Vocoders | p. 505 |
Perceptual A Udio Coding | p. 531 |
Other Applications | |
Some Aspects Of Computer Music Synthesis | p. 553 |
Music Signal Analysis | p. 567 |
Music Retrieval | p. 581 |
Source Separation | p. 59 |
Speech Transformations | p. 617 |
Speaker Verification | p. 633 |
Speaker Diarization | p. 644 |
Table of Contents provided by Publisher. All Rights Reserved. |
An electronic version of this book is available through VitalSource.
This book is viewable on PC, Mac, iPhone, iPad, iPod Touch, and most smartphones.
By purchasing, you will be able to view this book online, as well as download it, for the chosen number of days.
Digital License
You are licensing a digital product for a set duration. Durations are set forth in the product description, with "Lifetime" typically meaning five (5) years of online access and permanent download to a supported device. All licenses are non-transferable.
More details can be found here.
A downloadable version of this book is available through the eCampus Reader or compatible Adobe readers.
Applications are available on iOS, Android, PC, Mac, and Windows Mobile platforms.
Please view the compatibility matrix prior to purchase.