RSHMM++ for extractive lecture speech summarization

Justin Jian Zhang, Shilei Huang, Pascale Fung

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

We propose an enhanced Rhetorical-State Hidden Markov Model (RSHMM++) for extracting hierarchical structural summaries from lecture speech. One of the most underutilized information in extractive summarization is rhetorical structure hidden in speech data. RSHMM++ automatically decodes this underlying information in order to provide better summaries. We show that RSHMM++ gives a 72.01% ROUGE-L F-measure, a 9.78% absolute increase in lecture speech summarization performance compared to the baseline system without using rhetorical information. We also propose Relaxed DTW for compiling reference summaries.

Original languageEnglish
Title of host publication2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings
PublisherIEEE Computer Society
Pages161-164
Number of pages4
ISBN (Print)9781424434725
DOIs
Publication statusPublished - 2008
Event2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Goa, India
Duration: 15 Dec 200819 Dec 2008

Publication series

Name2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings

Conference

Conference2008 IEEE Workshop on Spoken Language Technology, SLT 2008
Country/TerritoryIndia
CityGoa
Period15/12/0819/12/08

Keywords

  • Rhetorical information
  • Speech summarization

Fingerprint

Dive into the research topics of 'RSHMM++ for extractive lecture speech summarization'. Together they form a unique fingerprint.

Cite this