|Affiliations:||Mathematical Institute of the Serbian Academy of Sciences and Arts||Title:||The impact of the pitch on the estimation of MFCC||Other Titles:||Uticaj osnovne učestanosti na estimaciju MFCC koeficijenata||Journal:||2011 19th Telecommunications Forum, TELFOR 2011 - Proceedings of Papers||First page:||651||Last page:||654||Conference:||19th Telecommunications Forum, TELFOR 2011; Belgrade; Serbia; 22 November 2011 through 24 November 2011||Issue Date:||1-Dec-2011||ISBN:||978-1-457-71498-6||DOI:||10.1109/TELFOR.2011.6143631||Abstract:||
In this paper, the impact of the pitch on the variability of MFCC, and their influence on the performance of the automatic speech recognition system, is analyzed. In case that a speaker has a high pitch, the distance between adjacent harmonics in the spectrum of voiced phonemes is larger, which results in poorer description of the spectral envelope. Additional problem arises in the case that a band-pass filter from the analysis filter bank covers the range between two harmonics, capturing the part of spectrum without energy, which consequently leads to the detection of sudden, non-existing changes in the spectral envelope. The reduction of these variations is analyzed by using a lower number of MFCC, by expending the bandwidths of the band-pass filters from the filter bank at lower frequencies, and also by low-pass filtering of the filter bank output. Some of the results are somewhat different from the similar results presented in the literature, for which the adequate explanations are offered.
Show full item record
checked on Nov 28, 2023
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.