A User Study on Snippet Generation: Text Reuse vs. Paraphrases

被引:9
作者
Chen, Wei-Fan [1 ]
Hagen, Matthias [2 ]
Stein, Benno [1 ]
Potthast, Martin [3 ]
机构
[1] Bauhaus Univ Weimar, Weimar, Germany
[2] Martin Luther Univ Halle Wittenberg, Halle, Germany
[3] Univ Leipzig, Leipzig, Germany
来源
ACM/SIGIR PROCEEDINGS 2018 | 2018年
关键词
D O I
10.1145/3209978.3210149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The snippets in the result list of a web search engine are built with sentences from the retrieved web pages that match the query. Reusing a web page's text for snippets has been considered fair use under the copyright laws of most jurisdictions. As of recent, notable exceptions from this arrangement include Germany and Spain, where news publishers are entitled to raise claims under a so-called ancillary copyright. A similar legislation is currently discussed at the European Commission. If this development gains momentum, the reuse of text for snippets will soon incur costs, which in turn will give rise to new solutions for generating truly original snippets. A key question in this regard is whether the users will accept any new approach for snippet generation, or whether they will prefer the current model of "reuse snippets." The paper in hand gives a first answer. A crowdsourcing experiment along with a statistical analysis reveals that our test users exert no significant preference for either kind of snippet. Notwithstanding the technological difficulty, this result opens the door to a new snippet synthesis paradigm.
引用
收藏
页码:1033 / 1036
页数:4
相关论文
共 26 条
  • [1] [Anonymous], 2017, P ACL
  • [2] [Anonymous], 2015, P EMNLP
  • [3] Bando Lorena Leal, 2010, INFORM INTERACTION C, P195, DOI DOI 10.1145/1840784.1840813
  • [4] MACHINE-MADE INDEX FOR TECHNICAL LITERATURE - AN EXPERIMENT
    BAXENDALE, PB
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1958, 2 (04) : 354 - 361
  • [5] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [6] Chopra Sumit, 2016, P NAACL HLT
  • [7] Cutrell E, 2007, CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1 AND 2, P407
  • [8] Recent automatic text summarization techniques: a survey
    Gambhir, Mahak
    Gupta, Vishal
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2017, 47 (01) : 1 - 66
  • [9] Granka L. A., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P478, DOI 10.1145/1008992.1009079
  • [10] Huang Y., 2008, SIGMOD, P315