Exploring scientific literature by textual and image content using DRIFT

被引:3
作者
Pocco, Ximena [1 ]
da Silva, Tiago [2 ]
Poco, Jorge [2 ]
Nonato, Luis Gustavo [3 ]
Gomez-Nieto, Erick [1 ]
机构
[1] Univ Catolica San Pablo, Dept Comp Sci, Quinta Vivanco S-N Urb, Arequipa, Peru
[2] Getulio Vargas Fdn, Sch Appl Math, Praia Botafogo 190, BR-22250900 Rio De Janeiro, RJ, Brazil
[3] Univ Sao Paulo, Inst Math & Comp Sci, Trabalhador Sao Carlense Ave 400, BR-13566590 Sao Carlos, SP, Brazil
来源
COMPUTERS & GRAPHICS-UK | 2022年 / 103卷
基金
巴西圣保罗研究基金会;
关键词
Scientific literature; Search interfaces; Multimodal processing; Visual analytics; VISUAL ANALYSIS; EXPLORATION; WEB; VISUALIZATION; COLLECTIONS; RETRIEVAL;
D O I
10.1016/j.cag.2022.02.005
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Digital libraries represent the most valuable resource for storing, querying, and retrieving scientific literature. Traditionally, the reader/analyst aims to compose a set of articles based on keywords, according to his/her preferences, and manually inspect the resulting list of documents. Except for the articles which share citations or common keywords, the results retrieved will be limited to those which fulfill a syntactic match. Besides, if instead of having an article as a reference, the user has an image, the process of finding and exploring articles with similar content becomes infeasible. This paper proposes a visual analytic methodology for exploring and analyzing scientific document collections that consider both textual and image content. The proposed technique relies on combining multiple Content-Based Image Retrieval (CBIR) components and multidimensional projection to map the documents to a visual space based on their similarity, thus enabling an interactive exploration. Moreover, we extend its analytical capabilities with visual resources to display complementary information on selected documents that uncover hidden patterns and semantic relations. We evidence the effectiveness of our methodology through three case studies and a user evaluation, which attest to its usefulness during the process of scientific collections exploration. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:140 / 152
页数:13
相关论文
共 45 条
[31]   Covid-on-the-Web: Exploring the COVID-19 scientific literature through visualization of linked data from entity and argument mining [J].
Menin, Aline ;
Michel, Franck ;
Gandon, Fabien ;
Gazzotti, Raphael ;
Cabrio, Elena ;
Corby, Olivier ;
Giboin, Alain ;
Marro, Santiago ;
Mayer, Tobias ;
Villata, Serena ;
Winckler, Marco .
QUANTITATIVE SCIENCE STUDIES, 2022, 2 (04) :1301-1323
[32]   Content-based image retrieval using Generic Fourier Descriptor and Gabor Filters [J].
He, Quan ;
Ji, ZhengQiao ;
Wu, Q. M. Jonathan .
VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, :525-528
[33]   USING SCIENTIFIC COMMUNICATION AND EDUCOMMUNICATION PRACTICES IN WEB 2.0 TO PRODUCE CULTURAL VIRAL CONTENT: AN EXPERIENCE PAPER [J].
Arellano, H. ;
Vasquez, D. ;
Vasquez, M. .
EDULEARN18: 10TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2018, :10791-10798
[34]   Automatic Semantic Labelling of Images by Their Content using Non-Parametric Bayesian Machine Learning and Image Search using Synthetically Generated Image Collages [J].
Niemeyer, Michael ;
Arandjelovic, Ognjen .
2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, :160-168
[35]   Modern Slavery Disclosure Practices: A Systematic Literature Review Using Bibliometric and Thematic Content Analyses [J].
Madhavika, Naduni ;
Mansi, Mansi ;
Pandey, Rakesh ;
Potdar, Balkrushna .
INTERNATIONAL JOURNAL OF DISCLOSURE AND GOVERNANCE, 2024,
[36]   Content-based image retrieval for medical diagnosis using fuzzy clustering and deep learning [J].
Sudhish, Dhanya K. ;
Nair, Latha R. ;
Shailesh, S. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
[37]   Scientific Literature Information Extraction Using Text Mining Techniques for Human Health Risk Assessment of Electromagnetic Fields [J].
Lee, Sang-Woo ;
Kwon, Jung-Hyok ;
Lee, Ben ;
Kim, Eui-Jik .
SENSORS AND MATERIALS, 2020, 32 (01) :149-157
[38]   Image indexing and content analysis in children's picture books using a large-scale database [J].
Huang, Chengwei ;
Jiang, Hao .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (15) :20679-20695
[39]   What's Hot and What's Not? - Exploring Trends in Bioinformatics Literature Using Topic Modeling and Keyword Analysis [J].
Hahn, Alexander ;
Mohanty, Somya D. ;
Manda, Prashanti .
BIOINFORMATICS RESEARCH AND APPLICATIONS (ISBRA 2017), 2017, 10330 :279-290
[40]   Content-based image retrieval techniques for the analysis of dermatological lesions using particle swarm optimization technique [J].
Jiji, G. Wiselin ;
DuraiRaj, P. Johnson .
APPLIED SOFT COMPUTING, 2015, 30 :650-662