Abstract
Several mel-band-based metrics and a single MFCC-based error metric were evaluated for best correspondence with human discrimination of single tones resynthesized from similar musical instrument time-varying spectra. Results show high levels of correspondence that are very close and often nearly identical to those found previously for harmonic and critical-band error metrics. The number of spectrum-related terms in the metrics required to achieve 85% R2 correspondence is about five for harmonics, ten for mel bands, and ten for MFCCs, leading to the conjecture that subjects discriminate more on the basis of the first few harmonics than on the broad spectral envelope.
| Original language | English |
|---|---|
| Pages (from-to) | 290-303 |
| Number of pages | 14 |
| Journal | AES: Journal of the Audio Engineering Society |
| Volume | 59 |
| Issue number | 5 |
| Publication status | Published - May 2011 |