共 36 条
MeSHHeading2vec: a new method for representing MeSH headings as vectors based on graph embedding algorithm
被引:33
作者:
Guo, Zhen-Hao
[1
]
You, Zhu-Hong
[1
,2
]
Huang, De-Shuang
[3
,4
]
Yi, Hai-Cheng
[2
]
Zheng, Kai
[5
]
Chen, Zhan-Heng
[1
]
Wang, Yan-Bin
[6
]
机构:
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[3] Tongji Univ, Shanghai, Peoples R China
[4] Tongji Univ, Inst Machines Learning & Syst Biol, Shanghai, Peoples R China
[5] China Univ Min & Technol, Xuzhou, Jiangsu, Peoples R China
[6] Zhejiang Univ, Hangzhou, Peoples R China
基金:
美国国家科学基金会;
国家重点研发计划;
关键词:
MeSHHeading2vec;
MeSH relationship network;
graph embedding;
computational prediction model;
SETS;
D O I:
10.1093/bib/bbaa037
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Effectively representing Medical Subject Headings (MeSH) headings (terms) such as disease and drug as discriminative vectors could greatly improve the performance of downstream computational prediction models. However, these terms are often abstract and difficult to quantify. In this paper, we converted the MeSH tree structure into a relationship network and applied several graph embedding algorithms on it to represent these terms. Specifically, the relationship network consisting of nodes (MeSH headings) and edges (relationships), which can be constructed by the tree num. Then, five graph embedding algorithms including DeepWalk, LINE, SDNE, LAP and HOPE were implemented on the relationship network to represent MeSH headings as vectors. In order to evaluate the performance of the proposed methods, we carried out the node classification and relationship prediction tasks. The results show that the MeSH headings characterized by graph embedding algorithms can not only be treated as an independent carrier for representation, but also can be utilized as additional information to enhance the representation ability of vectors. Thus, it can serve as an input and continue to play a significant role in any computational models related to disease, drug, microbe, etc. Besides, our method holds great hope to inspire relevant researchers to study the representation of terms in this network perspective.
引用
收藏
页码:2085 / 2095
页数:11
相关论文