Associating biological context with protein-protein interactions through text mining at PubMed scale
被引:2
|
作者:
Sosa, Daniel N.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Biomed Data Sci, Stanford, CA USAStanford Univ, Dept Biomed Data Sci, Stanford, CA USA
Sosa, Daniel N.
[1
]
Hintzen, Rogier
论文数: 0引用数: 0
h-index: 0
机构:
BenevolentAI, London, EnglandStanford Univ, Dept Biomed Data Sci, Stanford, CA USA
Hintzen, Rogier
[2
]
Xiong, Betty
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Biomed Data Sci, Stanford, CA USAStanford Univ, Dept Biomed Data Sci, Stanford, CA USA
Xiong, Betty
[1
]
de Giorgio, Alex
论文数: 0引用数: 0
h-index: 0
机构:
BenevolentAI, London, EnglandStanford Univ, Dept Biomed Data Sci, Stanford, CA USA
de Giorgio, Alex
[2
]
Fauqueur, Julien
论文数: 0引用数: 0
h-index: 0
机构:
BenevolentAI, London, EnglandStanford Univ, Dept Biomed Data Sci, Stanford, CA USA
Fauqueur, Julien
[2
]
Davies, Mark
论文数: 0引用数: 0
h-index: 0
机构:
BenevolentAI, London, EnglandStanford Univ, Dept Biomed Data Sci, Stanford, CA USA
Davies, Mark
[2
]
论文数: 引用数:
h-index:
机构:
Lever, Jake
[3
]
Altman, Russ B.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Stanford Univ, Dept Genet, Stanford, CA USAStanford Univ, Dept Biomed Data Sci, Stanford, CA USA
Altman, Russ B.
[4
,5
]
机构:
[1] Stanford Univ, Dept Biomed Data Sci, Stanford, CA USA
[2] BenevolentAI, London, England
[3] Univ Glasgow, Glasgow, Scotland
[4] Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Inferring knowledge from known relationships between drugs, proteins, genes, and diseases has great potential for clinical impact, such as predicting which existing drugs could be repurposed to treat rare diseases. Incorporating key biological context such as cell type or tissue of action into representations of extracted biomedical knowledge is essential for principled pharmacological discovery. Existing global, literature-derived knowledge graphs of interactions between drugs, proteins, genes, and diseases lack this essential information. In this study, we frame the task of associating biological context with protein-protein interactions extracted from text as a classification task using syntactic, semantic, and novel meta-discourse features. We introduce the Insider corpora, which are automatically generated PubMed-scale corpora for training classifiers for the context association task. These corpora are created by searching for precise syntactic cues of cell type and tissue relevancy to extracted regulatory relations. We report F1 scores of 0.955 and 0.862 for identifying relevant cell types and tissues, respectively, for our identified relations. By classifying with this framework, we demonstrate that the problem of context association can be addressed using intuitive, interpretable features. We demonstrate the potential of this approach to enrich text-derived knowledge bases with biological detail by incorporating cell type context into a protein-protein network for dengue fever.
机构:
Chinese Acad Sci, Key Lab Mol & Dev Biol, Inst Genet & Dev Biol, Beijing, Peoples R China
Chinese Acad Sci, Grad Sch, Beijing 100864, Peoples R ChinaChinese Acad Sci, Key Lab Mol & Dev Biol, Inst Genet & Dev Biol, Beijing, Peoples R China
He, Min
Wang, Yi
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Key Lab Mol & Dev Biol, Inst Genet & Dev Biol, Beijing, Peoples R ChinaChinese Acad Sci, Key Lab Mol & Dev Biol, Inst Genet & Dev Biol, Beijing, Peoples R China
Wang, Yi
Li, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Key Lab Mol & Dev Biol, Inst Genet & Dev Biol, Beijing, Peoples R ChinaChinese Acad Sci, Key Lab Mol & Dev Biol, Inst Genet & Dev Biol, Beijing, Peoples R China
机构:
Georgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USAGeorgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Xu, Guixian
Yin, Lanlan
论文数: 0引用数: 0
h-index: 0
机构:
Georgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Beijing Inst Tech, Coll Comp Sci, Beijing, Peoples R ChinaGeorgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Yin, Lanlan
Torii, Manabu
论文数: 0引用数: 0
h-index: 0
机构:
George Washington Univ, ISIS Ctr, Washington, DC 20052 USAGeorgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Torii, Manabu
Niu, Zhendong
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Inst Tech, Coll Comp Sci, Beijing, Peoples R ChinaGeorgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Niu, Zhendong
Wu, Cathy
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Inst Tech, Coll Comp Sci, Beijing, Peoples R China
George Washington Univ, PTR, Washington, DC 20052 USAGeorgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Wu, Cathy
Hu, Zhangzhi
论文数: 0引用数: 0
h-index: 0
机构:
Ctr Univ Natl, Coll Informat Sci, Beijing, Peoples R ChinaGeorgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Hu, Zhangzhi
Liu, Hongfang
论文数: 0引用数: 0
h-index: 0
机构:
Georgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USAGeorgetown Univ, Med Ctr, DBBB, Washington, DC 20007 USA
Liu, Hongfang
2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS,
2008,
: 461
-
+
机构:
Columbia Univ, Dept Microbiol, New York, NY 10032 USA
Columbia Univ, Inst Canc Res, New York, NY 10032 USACarnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
Boysen, Jacob H.
Fanning, Saranna
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Dept Microbiol, New York, NY 10032 USA
Columbia Univ, Inst Canc Res, New York, NY 10032 USA
Natl Univ Ireland Univ Coll Cork, Dept Microbiol, Cork, IrelandCarnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
Fanning, Saranna
Newberg, Justin
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Ctr Bioimage Informat, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Dept Biomed Engn, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
Newberg, Justin
Murphy, Robert F.
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Ctr Bioimage Informat, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Dept Biomed Engn, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Lane Ctr Computat Biol, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Dept Machine Learning, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
Murphy, Robert F.
Mitchell, Aaron P.
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
Columbia Univ, Dept Microbiol, New York, NY 10032 USA
Columbia Univ, Inst Canc Res, New York, NY 10032 USACarnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
机构:
Department of Pediatrics and Molecular Biology, University of Texas at Southwestern Medical Carter, Dallas, TX 75390, United StatesDepartment of Pediatrics and Molecular Biology, University of Texas at Southwestern Medical Carter, Dallas, TX 75390, United States
Acharya, Asha
Kuo, Min-Hao
论文数: 0引用数: 0
h-index: 0
机构:
Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48823, United StatesDepartment of Pediatrics and Molecular Biology, University of Texas at Southwestern Medical Carter, Dallas, TX 75390, United States