Authors: Jakovljević, Nikša
Popović, Branislav
Janev, Marko 
Delić, Vlado
Affiliations: Mathematical Institute of the Serbian Academy of Sciences and Arts 
Title: The impact of the pitch on the estimation of MFCC
Other Titles: Uticaj osnovne učestanosti na estimaciju MFCC koeficijenata
Journal: 2011 19th Telecommunications Forum, TELFOR 2011 - Proceedings of Papers
First page: 651
Last page: 654
Conference: 19th Telecommunications Forum, TELFOR 2011; Belgrade; Serbia; 22 November 2011 through 24 November 2011
Issue Date: 1-Dec-2011
ISBN: 978-1-457-71498-6
DOI: 10.1109/TELFOR.2011.6143631
In this paper, the impact of the pitch on the variability of MFCC, and their influence on the performance of the automatic speech recognition system, is analyzed. In case that a speaker has a high pitch, the distance between adjacent harmonics in the spectrum of voiced phonemes is larger, which results in poorer description of the spectral envelope. Additional problem arises in the case that a band-pass filter from the analysis filter bank covers the range between two harmonics, capturing the part of spectrum without energy, which consequently leads to the detection of sudden, non-existing changes in the spectral envelope. The reduction of these variations is analyzed by using a lower number of MFCC, by expending the bandwidths of the band-pass filters from the filter bank at lower frequencies, and also by low-pass filtering of the filter bank output. Some of the results are somewhat different from the similar results presented in the literature, for which the adequate explanations are offered.
Publisher: IEEE

Show full item record

Page view(s)

checked on May 9, 2024

Google ScholarTM




Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.