Speech enhancement based on adaptive wavelet denoising on multitaper spectrum

Tai Chiu Hsung, Pak Kong Lun

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

2 Citations (Scopus)


Classical speech enhancement algorithms often require a good estimation of the short-time power spectrum using, for instance, the periodogram methods. However, it is well known that traditional periodogram methods are prone to induce large variance, hence produces the "musical noise" after enhancement. To alleviate this problem, multitaper spectrum (MTS) estimators with wavelet denoising were proposed. In this paper, we investigate the properties of the MTS of noisy speech signals. We find that, in the log MTS domain, the variance of noise varies according to the magnitude of the underlying speech spectrum. It implies that when applying wavelet denoising to the log MTS, the constant threshold used in the traditional methods is not appropriate. Based on this observation, we further develop a wavelet denoising method with adaptive threshold for estimating power spectrum using multitaper. Simulation results show that the spectrum estimated using the proposed method is consistently more accurate than the traditional uniform thresholding methods. Hence, it further improves the current speech enhancement algorithms using the MTS approaches.
Original languageEnglish
Title of host publication2008 IEEE International Symposium on Circuits and Systems, ISCAS 2008
Number of pages4
Publication statusPublished - 19 Sep 2008
Event2008 IEEE International Symposium on Circuits and Systems, ISCAS 2008 - Seattle, WA, United States
Duration: 18 May 200821 May 2008


Conference2008 IEEE International Symposium on Circuits and Systems, ISCAS 2008
Country/TerritoryUnited States
CitySeattle, WA

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Electrical and Electronic Engineering

Cite this