Skip to main content

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments

  • 1st Edition - September 4, 2024
  • Latest edition
  • Author: Xiao-Lei Zhang
  • Language: English

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing and its applic… Read more

Early spring sale

Nurture your knowledge

Grow your expertise with up to 25% off trusted resources.

Description

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing and its applications. The book begins by looking at the basics of deep learning and common deep network models, followed by front-end algorithms for deep learning-based speech denoising, speech detection, single-channel speech enhancement multi-channel speech enhancement, multi-speaker speech separation, and the applications of deep learning-based speech denoising in speaker verification and speech recognition.

Key features

  • Provides a comprehensive introduction to the development of deep learning-based robust speech processing
  • Covers speech detection, speech enhancement, dereverberation, multi-speaker speech separation, robust speaker verification, and robust speech recognition
  • Focuses on a historical overview and then covers methods that demonstrate outstanding performance in practical applications

Readership

Senior undergraduate students, graduate students, and professionals with a solid foundation in speech signal processing and machine learning who are engaged in intelligent speech processing

Table of contents

1. Introduction

2. Fundamentals of Deep Learning

3. Voice Activity Detection

4. Single-Channel Speech Enhancement

5. Multi-Channel Speech Enhancement

6. Multi-Speaker Speech Separation

7. Speaker Recognition

8. Speech Recognition

Product details

  • Edition: 1
  • Latest edition
  • Published: September 9, 2024
  • Language: English

About the author

XZ

Xiao-Lei Zhang

Xiao-Lei Zhang received his Ph.D. degree with honors from Tsinghua University, China, in 2012. He was a postdoctoral researcher with the Department of Electronic Engineering at Tsinghua University from 2012 to 2014. He was a visiting scholar at The Ohio State University, USA, from 2013 to 2014 and a postdoctoral researcher with the Department of Computer Science and Engineering, The Ohio State University, from 2014 to 2016. Since 2016 he has been a full professor at the Northwestern Polytechnical University, Xi'an, China.

His research interests are the topics in speech processing, machine learning, statistical signal processing, and artificial intelligence.

Affiliations and expertise
Northwestern Polytechnical University, Xi'an, China

View book on ScienceDirect

Read Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments on ScienceDirect