Using Crowdsourcing for Multi-label Biomedical Compound Figure Annotation

被引：1

作者：

Seco de Herrera, Alba Garcia ^{[1
]}

Schaer, Roger ^{[2
]}

Antani, Sameer ^{[1
]}

Mueller, Henning ^{[2
]}

机构：

[1] Natl Lib Med, Lister Hill Natl Ctr Biomed Commun, Bethesda, MD USA

[2] Univ Appl Sci Western Switzerland HES SO, Sierre, Switzerland

来源：

DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS | 2016年 / 10008卷

关键词：

Multi-label annotation; Compound figures; Crowdsourcing;

D O I：

10.1007/978-3-319-46976-8_24

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Information analysis or retrieval for images in the biomedical literature needs to deal with a large amount of compound figures (figures containing several subfigures), as they constitute probably more than half of all images in repositories such as PubMed Central, which was the data set used for the task. The ImageCLEFmed benchmark proposed among other tasks in 2015 and 2016 a multi-label classification task, which aims at evaluating the automatic classification of figures into 30 image types. This task was based on compound figures and thus the figures were distributed to participants as compound figures but also in a separated form. Therefore, the generation of a gold standard was required, so that algorithms of participants can be evaluated and compared. This work presents the process carried out to generate the multi-labels of similar to 2650 compound figures using a crowdsourcing approach. Automatic algorithms to separate compound figures into subfigures were used and the results were then validated or corrected via crowdsourcing. The image types (MR, CT, X-ray,...) were also annotated by crowdsourcing including detailed quality control. Quality control is necessary to insure quality of the annotated data as much as possible. similar to 625 h were invested with a cost of similar to 870$.

引用

页码：228 / 237

页数：10

共 50 条

[1] Crowdsourcing Multi-label Audio Annotation Tasks with Citizen Scientists
Cartwright, Mark
Dove, Graham
Mendez, Ana Elisa Mendez
Bello, Juan P.
Nov, Oded
CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[2] Multi-Label Inference for Crowdsourcing
Zhang, Jing
Wu, Xindong
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2738 - 2747
[3] Multi-label Crowdsourcing Learning
Li S.-Y.
Jiang Y.
Ruan Jian Xue Bao/Journal of Software, 2020, 31 (05): : 1497 - 1510
[4] Multi-Label Annotation of Music
Ahsan, Hiba
Kumar, Vijay
Jawahar, C. V.
2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 150 - 154
[5] Scalable Multi-label Annotation
Deng, Jia
Russakovsky, Olga
Krause, Jonathan
Bernstein, Michael S.
Berg, Alex
Li Fei-Fei
32ND ANNUAL ACM CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2014), 2014, : 3099 - 3102
[6] Multi-Label Truth Inference for Crowdsourcing Using Mixture Models
Zhang, Jing
Wu, Xindong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (05) : 2083 - 2095
[7] Multi-label Crowdsourcing Learning with Incomplete Annotations
Li, Shao-Yuan
Jiang, Yuan
PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 232 - 245
[8] Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
Ravenscroft, James
Oellrich, Anika
Saha, Shyamasree
Liakata, Maria
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4115 - 4123
[9] challenges & approaches in multi-label image annotation
Kalaivani, A.
Chitrakal, S.
2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
[10] A Novel Model for Multi-label Image Annotation
Wu, Xinjian
Zhang, Li
Li, Fanzhang
Wang, Bangjun
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1953 - 1958

← 1 2 3 4 5 →