LEVERAGING MANIFOLD LEARNING FOR EXTRACTIVE BROADCAST NEWS SUMMARIZATION

被引：0

作者：

Liu, Shih-Hung ^{[2
]}

Chen, Kuan-Yu ^{[2
]}

Chen, Berlin ^{[1
]}

Wang, Hsin-Min ^{[2
]}

Hsu, Wen-Lian ^{[2
]}

机构：

[1] Natl Taiwan Normal Univ, Taipei, Taiwan

[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

关键词：

Manifold learning; nonlinear dimension reduction; local invariance; extractive summarization; SPEECH; TEXT;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Extractive speech summarization is intended to produce a condensed version of the original spoken document by selecting a few salient sentences from the document and concatenate them together to form a summary. In this paper, we study a novel use of manifold learning techniques for extractive speech summarization. Manifold learning has experienced a surge of research interest in various domains concerned with dimensionality reduction and data representation recently, but has so far been largely under-explored in extractive text or speech summarization. Our contributions in this paper are at least twofold. First, we explore the use of several manifold learning algorithms to capture the latent semantic information of sentences for enhanced extractive speech summarization, including isometric feature mapping (ISOMAP), locally linear embedding (LLE) and Laplacian eigenmap. Second, the merits of our proposed summarization methods and several widely-used methods are extensively analyzed and compared. The empirical results demonstrate the effectiveness of our unsupervised summarization methods, in relation to several state-of-the-art methods. In particular, a synergy of the manifold learning based methods and state-of-the-art methods, such as the integer linear programming (ILP) method, contributes to further gains in summarization performance.

引用

页码：5805 / 5809

页数：5

共 35 条

[1] [Anonymous], 2008, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, DOI DOI 10.1145/1390334.1390386
[2] [Anonymous], 2011, SPOKEN LANGUAGE UNDE
[3] [Anonymous], P 48 ANN M ASS COMP
[4] [Anonymous], 2003, ROUGE RECALL ORIENTE
[5] [Anonymous], 2011, Modern Information Retrieval: The Concepts and Technology behind Search
[6] BAXENDALE PB, 1958, IBM J
[7] Belkin M, 2002, ADV NEUR IN, V14, P585
[8] Carbonell J., 2001, P ANN INT ACM SIGIR, P19
[9] Soft indexing of speech content for search in spoken documents
Chelba, Ciprian
Silva, Jorge
Acero, Alex
[J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (03) : 458 - 478
[10] Cox T. F., 2000, MULTIDIMENSIONAL SCA, P2326

← 1 2 3 4 →