Speech Enhancement

Speech Enhancement : Theory and Practice, Second Edition

4.16 (6 ratings by Goodreads)
By (author) 

Free delivery worldwide

Available. Dispatched from the UK in 2 business days
When will my order arrive?


With the proliferation of mobile devices and hearing devices, including hearing aids and cochlear implants, there is a growing and pressing need to design algorithms that can improve speech intelligibility without sacrificing quality. Responding to this need, Speech Enhancement: Theory and Practice, Second Edition introduces readers to the basic problems of speech enhancement and the various algorithms proposed to solve these problems. Updated and expanded, this second edition of the bestselling textbook broadens its scope to include evaluation measures and enhancement algorithms aimed at improving speech intelligibility.
Fundamentals, Algorithms, Evaluation, and Future Steps
Organized into four parts, the book begins with a review of the fundamentals needed to understand and design better speech enhancement algorithms. The second part describes all the major enhancement algorithms and, because these require an estimate of the noise spectrum, also covers noise estimation algorithms. The third part of the book looks at the measures used to assess the performance, in terms of speech quality and intelligibility, of speech enhancement methods. It also evaluates and compares several of the algorithms. The fourth part presents binary mask algorithms for improving speech intelligibility under ideal conditions. In addition, it suggests steps that can be taken to realize the full potential of these algorithms under realistic conditions.
What's New in This Edition
Updates in every chapterA new chapter on objective speech intelligibility measuresA new chapter on algorithms for improving speech intelligibilityReal-world noise recordings (on accompanying CD)MATLAB (R) code for the implementation of intelligibility measures (on accompanying CD)MATLAB and C/C++ code for the implementation of algorithms to improve speech intelligibility (on accompanying CD)
Valuable Insights from a Pioneer in Speech Enhancement
Clear and concise, this book explores how human listeners compensate for acoustic noise in noisy environments. Written by a pioneer in speech enhancement and noise reduction in cochlear implants, it is an essential resource for anyone who wants to implement or incorporate the latest speech enhancement algorithms to improve the quality and intelligibility of speech degraded by noise.
Includes a CD with Code and Recordings
The accompanying CD provides MATLAB implementations of representative speech enhancement algorithms as well as speech and noise databases for the evaluation of enhancement algorithms.
show more

Product details

  • Hardback | 711 pages
  • 160.02 x 233.68 x 38.1mm | 1,111.3g
  • Taylor & Francis Inc
  • CRC Press Inc
  • Bosa Roca, United States
  • English
  • New edition
  • 2nd New edition
  • DVD goes on IBC; 1236; 24 Tables, black and white; 207 Illustrations, black and white
  • 1466504218
  • 9781466504219
  • 1,105,674

Review quote

"... indispensable for anyone trying to further his or her understanding in the field of digital speech processing. This book is critical in helping the professional address the growing demand to design algorithms that can improve speech intelligibility, in the presence of noise, without sacrificing quality for hearing aids and cochlear implants and address the equally important growing need to design rooms in which we can hear better naturally. Loizou's clarity of presentation of the mathematical foundation of different algorithms for speech enhancement has a comprehensibility that can only come with the level of expertise possessed by Loizou and serves well for all professionals from acoustical engineers to audiologists. This book provides an exceptional foundation and insight into past, present and future innovative processing techniques. This book is valuable for students and professionals of all experience levels."-Bonnie Schnitta, SoundSense, LLC, Wainscott, New York, USA, from Noise Control Engineering Journal, January-February 2015
"... by far the most comprehensive treatment of speech enhancement available. All the most important techniques in the broad field of speech enhancement are covered, yet the author at the same time manages to treat each topic in great detail. ... The algorithms are complex, but Loizou's exposition is outstanding. The second edition brings the material right up to date, covering recent significant breakthroughs in binary masking algorithms ... . One of the great strengths of this text is the availability of code, allowing readers to better understand, deploy, and extend existing algorithms for speech enhancement. ... This volume is in reality far more than a textbook on speech enhancement. It is also one of the most important works on the effect of noise on speech perception, and as such will make a huge contribution to the education of the next generation of auditory scientists, and feed technological developments in all aspects of speech communication, particularly for individuals with hearing impairment."-Prof. Martin Cooke, Ikerbasque and University of the Basque Country, Vitoria, Spain
"This textbook offers outstanding reference material for teaching the clinical application of spectral enhancement to the audiology community. Dr. Loizou offers the reader tremendous insight into the fundamentals of digital signal processing, speech production and perception, and the characteristics of various noise sources. ... The textbook is essential for engineers, audiologists, and other professionals who seek to improve the listener's ability to hear a target signal against a background filled with competing noise using the spectral enhancement technique."-Amyn M. Amlani, Ph.D., University of North Texas, Denton, USA
"... a highly informative presentation of the fundamentals, seminal and current algorithms, evaluation metrics, and future work that is desirable for any new or experienced students and researchers in the exciting area of speech enhancement. ... I greatly appreciate the excellent organization of dividing the book into Fundamentals, Algorithms, Evaluation, and Future Steps, which can allow instructors and researchers to quickly decide on the material they want to teach their students, or learn or review themselves ... Dr. Loizou takes students and researchers with a range of experiences on an amazing journey through the exciting field of speech enhancement."-Marek Trawicki, Marquette University, Wauwatosa, Wisconsin, USA
"The first edition of this book established itself as the best reference for single-channel speech enhancement. Amazingly, this new edition is even better, and could be the most authoritative work in the area of modern single-channel techniques for speech enhancement to date. ... This is a unique book, combining both thorough theoretical developments and practical implementations. I highly recommend it to those interested in speech enhancement, as well as applied signal processing."-Association of Computing Machinery (ACM) Computing Reviews, July 2013Reviewer: Vladimir Botchev, Analog Devices, Wilmington, Massachusetts, USA
show more

About Philipos C. Loizou

Philipos C. Loizou earned his bachelor's, master's, and doctorate degrees in electrical engineering from Arizona State University in Tempe. A pioneer in the field of speech enhancement and noise reduction in cochlear implants, Dr. Loizou was one of the first to develop specific enhancement algorithms that directly improve intelligibility. He was a postdoctoral fellow in the Department of Speech and Hearing Science at Arizona State University, an assistant professor at the University of Arkansas in Little Rock, and Cecil and Ida Green Professor in the Department of Electrical Engineering at the University of Texas at Dallas. Dr. Loizou was a fellow of the Acoustical Society of America. He was an associate editor of the International Journal of Audiology (2010-2012), IEEE Transactions on Biomedical Engineering (2009-2011), IEEE Transactions on Speech and Audio Processing (1999-2002), and IEEE Signal Processing Letters (2006-2009) and a member of the Speech Technical Committee (2008-2010) of the IEEE Signal Processing Society. He authored or coauthored numerous publications, including three textbooks.
For more information, see Dr. Loizou's profile at the University of Texas at Dallas.
Watch a video of Dr. Loizou talking about technology that would allow cochlear implant users to easily adjust settings on their hearing devices through a smartphone.
show more

Table of contents

IntroductionUnderstanding the Enemy: NoiseClasses of Speech Enhancement AlgorithmsBook OrganizationReferencesPart I FundamentalsDiscrete-Time Signal Processing and Short-Time Fourier AnalysisDiscrete-Time SignalsLinear Time-Invariant Discrete-Time Systemsz-TransformDiscrete-Time Fourier TransformShort-Time Fourier TransformSpectrographic Analysis of Speech SignalsSummaryReferencesSpeech Production and PerceptionSpeech SignalSpeech Production ProcessEngineering Model of Speech ProductionClasses of Speech SoundsAcoustic Cues in Speech PerceptionSummaryReferencesNoise Compensation by Human ListenersIntelligibility of Speech in Multiple-Talker ConditionsAcoustic Properties of Speech Contributing to RobustnessPerceptual Strategies for Listening in NoiseSummaryReferencesPart II AlgorithmsSpectral-Subtractive AlgorithmsBasic Principles of Spectral SubtractionGeometric View of Spectral SubtractionShortcomings of the Spectral Subtraction MethodSpectral Subtraction Using OversubtractionNonlinear Spectral SubtractionMultiband Spectral SubtractionMMSE Spectral Subtraction AlgorithmExtended Spectral SubtractionSpectral Subtraction Using Adaptive Gain AveragingSelective Spectral SubtractionSpectral Subtraction Based on Perceptual PropertiesPerformance of Spectral Subtraction AlgorithmsSummaryReferencesWiener FilteringIntroduction to Wiener Filter TheoryWiener Filters in the Time DomainWiener Filters in the Frequency DomainWiener Filters and Linear PredictionWiener Filters for Noise ReductionIterative Wiener FilteringImposing Constraints on Iterative Wiener FilteringConstrained Iterative Wiener FilteringConstrained Wiener FilteringEstimating the Wiener Gain FunctionIncorporating Psychoacoustic Constraints in Wiener FilteringCodebook-Driven Wiener FilteringAudible Noise Suppression AlgorithmSummaryReferencesStatistical-Model-Based MethodsMaximum-Likelihood EstimatorsBayesian EstimatorsMMSE EstimatorImprovements to the Decision-Directed ApproachImplementation and Evaluation of the MMSE EstimatorElimination of Musical NoiseLog-MMSE EstimatorMMSE Estimation of the pth-Power SpectrumMMSE Estimators Based on Non-Gaussian DistributionsMaximum A Posteriori (Map) EstimatorsGeneral Bayesian EstimatorsPerceptually Motivated Bayesian EstimatorsIncorporating Speech Absence Probability in Speech EnhancementMethods for Estimating the A Priori Probability of Speech AbsenceSummaryReferencesSubspace AlgorithmsIntroductionUsing SVD for Noise Reduction: TheorySVD-Based Algorithms: White NoiseSVD-Based Algorithms: Colored NoiseSVD-Based Methods: A Unified ViewEVD-Based Methods: White NoiseEVD-Based Methods: Colored NoiseEVD-Based Methods: A Unified ViewPerceptually Motivated Subspace AlgorithmsSubspace-Tracking AlgorithmsSummaryReferencesNoise-Estimation AlgorithmsVoice Activity Detection vs. Noise EstimationIntroduction to Noise-Estimation AlgorithmsMinimal-Tracking AlgorithmsTime-Recursive Averaging Algorithms for Noise EstimationHistogram-Based TechniquesOther Noise-Estimation AlgorithmsObjective Comparison of Noise-Estimation AlgorithmsSummaryReferencesPart III EvaluationEvaluating Performance of Speech Enhancement AlgorithmsQuality vs. IntelligibilityEvaluating Intelligibility of Processed SpeechEvaluating Quality of Processed SpeechEvaluating Reliability of Quality Judgments: Recommended PracticeSummaryReferencesObjective Quality and Intelligibility MeasuresObjective Quality MeasuresEvaluation of Objective Quality MeasuresQuality Measures: Summary of Findings and Future DirectionsSpeech Intelligibility MeasuresEvaluation of Intelligibility MeasuresIntelligibility Measures: Summary of Findings and Future DirectionsSummaryReferencesComparison of Speech Enhancement AlgorithmsNOIZEUS: A Noisy Speech Corpus for Quality Evaluation of Speech Enhancement AlgorithmsComparison of Enhancement Algorithms: Speech QualityComparison of Enhancement Algorithms: Speech IntelligibilitySummaryReferencesPart IV Future StepsAlgorithms That Can Improve Speech IntelligibilityReasons for the Absence of Intelligibility Improvement with Existing Noise-Reduction AlgorithmsAlgorithms Based on Channel Selection: A Different Paradigm for Noise ReductionChannel-Selection CriteriaIntelligibility Evaluation of Channel-Selection-Based Algorithms: Ideal ConditionsImplementation of Channel-Selection-Based Algorithms in Realistic ConditionsEvaluating Binary Mask Estimation AlgorithmsChannel Selection and Auditory Scene AnalysisSummaryReferencesAppendicesAppendix A: Special Functions and IntegralsAppendix B: Derivation of the MMSE EstimatorAppendix C: MATLAB (R) Code and Speech/Noise DatabasesIndex
show more

Rating details

6 ratings
4.16 out of 5 stars
5 33% (2)
4 50% (3)
3 17% (1)
2 0% (0)
1 0% (0)
Book ratings by Goodreads
Goodreads is the world's largest site for readers with over 50 million reviews. We're featuring millions of their reader ratings on our book pages to help you find your new favourite book. Close X