Summary in context: Searching versus browsing

被引:20
作者
McDonald, DM [1 ]
Chen, HC [1 ]
机构
[1] Univ Arizona, Dept Management Informat Syst, Tucson, AZ 85721 USA
关键词
algorithms; experimentation; summarization; search; browse; generic summaries; information seeking; indicative summaries; text processing; natural language processing;
D O I
10.1145/1125857.1125861
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of text summaries in information-seeking research has focused on query-based summaries. Extracting content that resembles the query alone, however, ignores the greater context of the document. Such context may be central to the purpose and meaning of the document. We developed a generic, a query-based, and a hybrid summarizer, each with differing amounts of document context. The generic summarizer used a blend of discourse information and information obtained through traditional surface-level analysis. The query-based summarizer used only query-term information, and the hybrid summarizer used some discourse information along with query-term information. The validity of the generic summarizer was shown through an intrinsic evaluation using a well-established corpus of human-generated summaries. All three summarizers were then compared in an information-seeking experiment involving 297 subjects. Results from the information-seeking experiment showed that the generic summaries outperformed all others in the browse tasks, while the query-based and hybrid summaries outperformed the generic summary in the search tasks. Thus, the document context of generic summaries helped users browse, while such context was not helpful in search tasks. Such results are interesting given that generic summaries have not been studied in search tasks and the that majority of Internet search engines rely solely on query-based summaries.
引用
收藏
页码:111 / 141
页数:31
相关论文
共 59 条
[1]  
[Anonymous], P 6 WORKSH VER LARG
[2]  
AONE C, 1998, P MESS UND C, V7
[3]  
AONE C, 1999, ADV AUTOMATIC TEXT S, V1, P71
[4]  
Barzilay R., 1999, ADV AUTOMATIC TEXT S
[5]  
BLACK E, 1992, P 5 DARPA SPEECH NAT
[6]  
Boguraev B., 1997, P WORKSH INT SCAL TE
[7]  
BORNER K, 2002, P 2 ACM IEEE CS JOIN
[8]   AUTOMATIC CONDENSATION OF ELECTRONIC PUBLICATIONS BY SENTENCE SELECTION [J].
BRANDOW, R ;
MITZE, K ;
RAU, LF .
INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (05) :675-685
[9]  
Brin S., 1998, 7 INT WORLD WIDE WEB
[10]  
CARBONELL J, 1998, P SIGIR MELB AUSTR