Multilingual I-vector based statistical modeling for music genre classification

Jia Dai, Wei Xue, Wenju Liu

Research output: Contribution to journalConference article published in journalpeer-review

4 Citations (Scopus)

Abstract

For music signal processing, compared with the strategy which models each short-time frame independently, when the long-time features are considered, the time-series characteristics of the music signal can be better presented. As a typical kind of long-time modeling strategy, the identification vector (i-vector) uses statistical modeling to model the audio signal in the segment level. It can better capture the important elements of the music signal, and these important elements may benefit to the classification of music signal. In this paper, the ivector based statistical feature for music genre classification is explored. In addition to learn enough important elements for music signal, a new multilingual i-vector feature is proposed based on the multilingual model. The experimental results show that the multilingual i-vector based models can achieve better classification performances than conventional short-time modeling based methods.

Original languageEnglish
Pages (from-to)459-463
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2017-August
DOIs
Publication statusPublished - 2017
Externally publishedYes
Event18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 - Stockholm, Sweden
Duration: 20 Aug 201724 Aug 2017

Bibliographical note

Publisher Copyright:
Copyright © 2017 ISCA.

Keywords

  • I-vector
  • Multilingual
  • Music genre classification
  • Statistical feature

Fingerprint

Dive into the research topics of 'Multilingual I-vector based statistical modeling for music genre classification'. Together they form a unique fingerprint.

Cite this