Automatic summarization of online customer reviews

被引:0
作者
Zhan, Jiaming [1 ]
Loh, Han Tong [1 ]
Liu, Ying [2 ]
机构
[1] Natl Univ Singapore, Dept Mech Engn, Singapore 117548, Singapore
[2] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hong Kong, Hong Kong, Peoples R China
来源
WEBIST 2007: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL SEBEG/EL: SOCIETY, E-BUSINESS AND E-GOVERNMENT, E-LEARNING | 2007年
关键词
e-Commerce; customer reviews; multi-document summarization; web mining;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
Online customer reviews offer valuable information for merchants and potential shoppers in e-Commerce and e-Business. However, even for a single product, the number of reviews often amounts to hundreds or thousands. Thus, summarization of multiple reviews is helpful to extract the important issues that merchants and customers are concerned about. Existing methods of multi-document summarization divide documents into non-overlapping clusters first and then summarize each cluster of documents individually with the assumption that each cluster discusses a single topic. When applied to summarize customer reviews, it is however difficult to determine the number of clusters without the prior domain knowledge, and moreover, topics often overlap with each other in a collection of customer reviews. In this paper, we propose a summarization approach based on the topical structure of multiple customer reviews. Instead of clustering and summarization, our approach extracts topics from a collection of reviews and further ranks the topics based on their frequency. The summary is then generated according to the ranked topics. The evaluation results showed that our approach outperformed the baseline summarization systems, i.e. Copernic summarizer and clustering-summarization, in terms of users' responsiveness.
引用
收藏
页码:5 / +
页数:3
相关论文
共 29 条
  • [1] AHONEN H, 1999, P ICML 99 WORKSH MAC
  • [2] Barzilay R., 1997, Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization, P10
  • [3] BOROS E, 2001, P 24 ANN INT ACM SIG
  • [4] Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
  • [5] Choi FYY, 2000, 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, pA26
  • [6] NEW METHODS IN AUTOMATIC EXTRACTING
    EDMUNDSON, HP
    [J]. JOURNAL OF THE ACM, 1969, 16 (02) : 264 - +
  • [7] Gong Y., 2001, P 24 ANN INT ACM SIG, P19, DOI DOI 10.1145/383952.383955
  • [8] Hearst MA, 1997, COMPUT LINGUIST, V23, P33
  • [9] HOVY E, 1997, P ACL 97 EACL 97 WOR, P18
  • [10] Hu M., 2004, ACM SIGKDD INT C KNO, P168, DOI [10.1016/j.engappai.2017.08.006, DOI 10.1016/J.ENGAPPAI.2017.08.006, 10.1145/1014052.1014073]