Topic aspect-oriented summarization via group selection

被引:27
作者
Fang, Hanyin [1 ]
Lu, Weiming [1 ]
Wu, Fei [1 ]
Zhang, Yin [1 ]
Shang, Xindi [1 ]
Shao, Jian [1 ]
Zhuang, Yueting [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
关键词
Large margin; Group norm; Submodular function; Text/image summarization;
D O I
10.1016/j.neucom.2014.08.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The summarization is desirable to efficiently apprehend the gist of the huge amount of data and becomes a significant challenge in many applications such as news article summarization and social media mining. Considering the summaries from multi-documents of one topic can describe various aspects of one given topic, this paper attempts to exploit appropriate priors to generate topic aspect-oriented summarization (abbreviated as TAOS). The underlying intuition of the proposed TAOS is that different topics can prefer different aspects and the different aspects can be represented by different preference of features(e.g., technical topic may prefer proper noun than sports topic). In order to materialize the intuition of TAOS, we first extract several groups of features according to topic factors, and then a group norm penalty (i.e., ewe, norm) and latent variables are utilized to select overlapping groups of features. We compare our proposed approach with some state-of-the-art methods on DUC2003, DUC2004 datasets for text summarization and NUS-Wide dataset for image summarization. The results show our method can generate meaningful summarization in terms of ROUGE and Jensen Shannon Divergence metrics. (C) 2014 Published by Elsevier B.V.
引用
收藏
页码:1613 / 1619
页数:7
相关论文
共 22 条
  • [1] [Anonymous], AAAI
  • [2] [Anonymous], 2009, P ACM INT C IM VID R
  • [3] [Anonymous], 2009, ICML, DOI DOI 10.1145/1553374.1553523
  • [4] Chen D., GROUP NORM LEARNING
  • [5] Clustering by passing messages between data points
    Frey, Brendan J.
    Dueck, Delbert
    [J]. SCIENCE, 2007, 315 (5814) : 972 - 976
  • [6] Hadi Y., 2006, Applied Computing 2006. 21st Annual ACM Symposium on Applied Computing, P1400, DOI 10.1145/1141277.1141601
  • [7] He Z, 2012, 26 AAAI C ART INT
  • [8] Learning the parts of objects by non-negative matrix factorization
    Lee, DD
    Seung, HS
    [J]. NATURE, 1999, 401 (6755) : 788 - 791
  • [9] Lin C.Y., 2004, P TEXT SUMM BRANCH O, P74
  • [10] Lin Chin-Yew, 2006, P HUM LANG TECHN C N, P463, DOI DOI 10.3115/1220835.1220894