Natural Language Generation for Sponsored-Search Advertisements

被引:0
作者
Bartz, Kevin [1 ]
Barr, Cory [1 ]
Aijaz, Adil [1 ]
机构
[1] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
来源
EC'08: PROCEEDINGS OF THE 2008 ACM CONFERENCE ON ELECTRONIC COMMERCE | 2008年
关键词
Sponsored search; pay-per-click; natural language processing; natural language generation; automation; query log; machine learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In sponsored search, advertisers bid on phrases representative of offered products or services.. For large advertisers, these phrases often come from quasi-algorithmically generated lists of thousands of terms prone to poor linguistic construction. A bidded term by itself is usually unsuitable for direct insertion into an ad copy template; it must be rephrased and capitalized properly to fit the template, possibly with additional language to avoid semantic ambiguity. We develop a natural language generation system to automate these steps, preparing a list of terms for insertion into an ad template. For each input term, our system first finds a proper word ordering by mining a corpus of Web search query logs. Next it determines whether the term is ambiguous and-if semantics dictate-attaches a clarifying modifier culled from query logs. Finally, it applies proper capitalization by analyzing pages from Web search engine results. Each step yields a plausible set of displayable forms from which a machine-learned model selects the best. The models are trained and tested on a large set of human-labeled data. The overall system significantly outperforms baseline systems that use simple heuristics.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 16 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Alegria I., 2006, P COLING ACL 2006 MA, P1, DOI 10.3115/1273073.1273074
[3]  
[Anonymous], PANEL DATA ECONOMETR
[4]  
[Anonymous], FRONTIERS ECONOMETRI
[5]   Adaptation of maximum entropy capitalizer: Little data can help a lot [J].
Chelba, Ciprian ;
Acero, Alex .
COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04) :382-399
[6]  
Conway Damian, 1998, P 2 ANN PERL C
[7]  
FAIN DC, 2006, P ACM C EL COMM 2 WO
[8]  
HARDT D, 2001, P CORP LING 2001 C, P266
[9]   Real life, real users, and real needs: a study and analysis of user queries on the web [J].
Jansen, BJ ;
Spink, A ;
Saracevic, T .
INFORMATION PROCESSING & MANAGEMENT, 2000, 36 (02) :207-227
[10]   Stochastic text generation [J].
Oberlander, J ;
Brew, C .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2000, 358 (1769) :1373-1386