Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements

被引:0
|
作者
Li, Yang [1 ]
Li, Gang
He, Luheng
Zheng, Jingjie
Li, Hong
Guan, Zhiwei
机构
[1] Google Res, Mountain View, CA 94043 USA
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language descriptions of user interface (UI) elements such as alternative text are crucial for accessibility and language-based interaction in general. Yet, these descriptions are constantly missing in mobile UIs. We propose widget captioning, a novel task for automatically generating language descriptions for UI elements from multimodal input including both the image and the structural representations of user interfaces. We collected a largescale dataset for widget captioning with crowdsourcing. Our dataset contains 162,859 language phrases created by human workers for annotating 61,285 UI elements across 21,750 unique UI screens. We thoroughly analyze the dataset, and train and evaluate a set of deep model configurations to investigate how each feature modality as well as the choice of learning strategies impact the quality of predicted captions. The task formulation and the dataset as well as our benchmark models contribute a solid basis for this novel multimodal captioning task that connects language and user interfaces.
引用
收藏
页码:5495 / 5510
页数:16
相关论文
共 50 条
  • [1] Natural Language-based User Interface for Mobile Devices with Limited Resources
    Park, So-Young
    Byun, Jeunghyun
    Rim, Hae-Chang
    Lee, Do-Gil
    Lim, Heuiseok
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2086 - 2092
  • [2] An XML-based runtime user interface description language for mobile computing devices
    Expertise Centre for Digital Media, Limburgs Universitair Centrum, Wetenschapspark 2, B-3590 Diepenbeek-Belgium, Belgium
    Lect. Notes Comput. Sci., (1-15):
  • [3] Software Support for User Interface Description Language
    Coyette, Adrien
    Faure, David
    Gonzalez-Calleros, Juan
    Vanderdonckt, Jean
    HUMAN-COMPUTER INTERACTION - INTERACT 2011, PT IV, 2011, 6949 : 740 - 741
  • [4] Automated Generation of User-Interface Prototypes based on Controlled Natural Language Description
    Juarez-Ramirez, Reyes
    Huertas, Carlos
    Inzunza, Sergio
    2014 38TH ANNUAL IEEE INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSACW 2014), 2014, : 246 - 251
  • [5] Natural language interface to mobile devices
    Zhou, LN
    Shaikh, M
    Zhang, DS
    INTELLIGENT INFORMATION PROCESSING II, 2005, 163 : 283 - 286
  • [6] Bridging the gap between a behavioural formal description technique and a user interface description language: Enhancing ICO with a graphical user interface markup language
    Barboni, Eric
    Martinie, Celia
    Navarre, David
    Palanque, Philippe
    Winckler, Marco
    SCIENCE OF COMPUTER PROGRAMMING, 2014, 86 : 3 - 29
  • [7] Bridging the gap between a behavioural formal description technique and a user interface description language: Enhancing ICO with a graphical user interface markup language
    Barboni, Eric
    Martinie, Célia
    Navarre, David
    Palanque, Philippe
    Winckler, Marco
    Science of Computer Programming, 2014, 86 : 3 - 29
  • [8] A Natural Language Planner Interface for Mobile Manipulators
    Howard, Thomas M.
    Tellex, Stefanie
    Roy, Nicholas
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 6652 - 6659
  • [9] Development approach based on extensible user interface description language
    Du, Yi
    Tian, Feng
    Dai, Guo-Zhong
    Ruan Jian Xue Bao/Journal of Software, 2015, 26 (07): : 1772 - 1784
  • [10] Natural Language User Interface For Software Engineering Tasks
    Wachtel, Alexander
    Klamroth, Jonas
    Tichy, Walter F.
    ACHI 2017: THE TENTH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER-HUMAN INTERACTIONS, 2017, : 34 - 39