Although a number of estimation methods working in different domains have been put forward, there are still demands for improvement, especially for noisy speech. In this paper, we present iPEEH, a general technique to raise performance of pitch estimators by enhancing harmonics. By analysis and experiments, it is found that missing and submerged harmonics are the root causes for failures of many pitch detectors. Hence, we propose to enhance the harmonics in spectrum before implementing the pitch detection. One enhancement algorithm that mainly applies the square operation to regenerate harmonics is presented in detail, including the theoretical analysis and implementation. Four speech databases with 11 types of additive noise and 5 noise levels are utilized in assessment. We compare the performance of algorithms before and after using iPEEH. Experimental results indicate that the proposed iPEEH can effectively reduce the detection errors. In some cases, the error rate reductions are higher than 20%. In addition, the advantage of iPEEH is manifold since it is demonstrated in experiments that the iPEEH is effective for various noise types, noise levels, multiple basic frequency-based estimators, and two audio types. Through this work, we investigated the underlying reasons for pitch detection failures and presented a novel direction for pitch detection. Besides, this approach, a preprocessing step in essence, indicates the significance of preprocessing for any intelligent systems.
- Fundamental frequency detection
ASJC Scopus subject areas
- Computer Science Applications
- Artificial Intelligence