Using Crowdsourcing for Multi-label Biomedical Compound Figure Annotation

被引:1
|
作者
Seco de Herrera, Alba Garcia [1 ]
Schaer, Roger [2 ]
Antani, Sameer [1 ]
Mueller, Henning [2 ]
机构
[1] Natl Lib Med, Lister Hill Natl Ctr Biomed Commun, Bethesda, MD USA
[2] Univ Appl Sci Western Switzerland HES SO, Sierre, Switzerland
来源
DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS | 2016年 / 10008卷
关键词
Multi-label annotation; Compound figures; Crowdsourcing;
D O I
10.1007/978-3-319-46976-8_24
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information analysis or retrieval for images in the biomedical literature needs to deal with a large amount of compound figures (figures containing several subfigures), as they constitute probably more than half of all images in repositories such as PubMed Central, which was the data set used for the task. The ImageCLEFmed benchmark proposed among other tasks in 2015 and 2016 a multi-label classification task, which aims at evaluating the automatic classification of figures into 30 image types. This task was based on compound figures and thus the figures were distributed to participants as compound figures but also in a separated form. Therefore, the generation of a gold standard was required, so that algorithms of participants can be evaluated and compared. This work presents the process carried out to generate the multi-labels of similar to 2650 compound figures using a crowdsourcing approach. Automatic algorithms to separate compound figures into subfigures were used and the results were then validated or corrected via crowdsourcing. The image types (MR, CT, X-ray,...) were also annotated by crowdsourcing including detailed quality control. Quality control is necessary to insure quality of the annotated data as much as possible. similar to 625 h were invested with a cost of similar to 870$.
引用
收藏
页码:228 / 237
页数:10
相关论文
共 50 条
  • [1] Crowdsourcing Multi-label Audio Annotation Tasks with Citizen Scientists
    Cartwright, Mark
    Dove, Graham
    Mendez, Ana Elisa Mendez
    Bello, Juan P.
    Nov, Oded
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [2] Multi-Label Inference for Crowdsourcing
    Zhang, Jing
    Wu, Xindong
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2738 - 2747
  • [3] Multi-label Crowdsourcing Learning
    Li S.-Y.
    Jiang Y.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (05): : 1497 - 1510
  • [4] Multi-Label Annotation of Music
    Ahsan, Hiba
    Kumar, Vijay
    Jawahar, C. V.
    2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 150 - 154
  • [5] Scalable Multi-label Annotation
    Deng, Jia
    Russakovsky, Olga
    Krause, Jonathan
    Bernstein, Michael S.
    Berg, Alex
    Li Fei-Fei
    32ND ANNUAL ACM CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2014), 2014, : 3099 - 3102
  • [6] Multi-Label Truth Inference for Crowdsourcing Using Mixture Models
    Zhang, Jing
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (05) : 2083 - 2095
  • [7] Multi-label Crowdsourcing Learning with Incomplete Annotations
    Li, Shao-Yuan
    Jiang, Yuan
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 232 - 245
  • [8] Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
    Ravenscroft, James
    Oellrich, Anika
    Saha, Shyamasree
    Liakata, Maria
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4115 - 4123
  • [9] challenges & approaches in multi-label image annotation
    Kalaivani, A.
    Chitrakal, S.
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [10] A Novel Model for Multi-label Image Annotation
    Wu, Xinjian
    Zhang, Li
    Li, Fanzhang
    Wang, Bangjun
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1953 - 1958