Low-Quality Structural and Interaction Data Improves Binding Affinity Prediction via Random Forest
被引:75
作者:
Li, Hongjian
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R ChinaChinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R China
Li, Hongjian
[1
]
Leung, Kwong-Sak
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R ChinaChinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R China
Leung, Kwong-Sak
[1
]
Wong, Man-Hon
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R ChinaChinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R China
Wong, Man-Hon
[1
]
Ballester, Pedro J.
论文数: 0引用数: 0
h-index: 0
机构:
INSERM, Canc Res Ctr Marseille, U1068, F-13009 Marseille, France
Inst Paoli Calmettes, F-13009 Marseille, France
Aix Marseille Univ, F-13284 Marseille, France
CNRS, UMR7258, F-13009 Marseille, FranceChinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R China
Ballester, Pedro J.
[2
,3
,4
,5
]
机构:
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin 999077, Hong Kong, Peoples R China
[2] INSERM, Canc Res Ctr Marseille, U1068, F-13009 Marseille, France
[3] Inst Paoli Calmettes, F-13009 Marseille, France
Docking scoring functions can be used to predict the strength of protein-ligand binding. It is widely believed that training a scoring function with low-quality data is detrimental for its predictive performance. Nevertheless, there is a surprising lack of systematic validation experiments in support of this hypothesis. In this study, we investigated to which extent training a scoring function with data containing low-quality structural and binding data is detrimental for predictive performance. We actually found that low-quality data is not only non-detrimental, but beneficial for the predictive performance of machine-learning scoring functions, though the improvement is less important than that coming from high-quality data. Furthermore, we observed that classical scoring functions are not able to effectively exploit data beyond an early threshold, regardless of its quality. This demonstrates that exploiting a larger data volume is more important for the performance of machine-learning scoring functions than restricting to a smaller set of higher data quality.
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Cheng, Tiejun
;
Li, Xun
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Li, Xun
;
Li, Yan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Li, Yan
;
Liu, Zhihai
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Liu, Zhihai
;
Wang, Renxiao
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Ding, Bo
;
Wang, Jian
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Wang, Jian
;
Li, Nan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Li, Nan
;
Wang, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Univ Calif San Diego, Dept Cellular & Mol Med, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
机构:
Univ Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USA
Durrant, Jacob D.
;
McCammon, J. Andrew
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USA
Univ Calif San Diego, Dept Pharmacol, La Jolla, CA 92093 USA
Univ Calif San Diego, Howard Hughes Med Inst, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USA
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Lahti, Jennifer L.
;
Tang, Grace W.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Tang, Grace W.
;
Capriotti, Emidio
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Univ Balearic Isl, Dept Math & Comp Sci, Palma De Mallorca, SpainStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Capriotti, Emidio
;
Liu, Tianyun
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Genet, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Liu, Tianyun
;
Altman, Russ B.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Stanford Univ, Dept Genet, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Cheng, Tiejun
;
Li, Xun
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Li, Xun
;
Li, Yan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Li, Yan
;
Liu, Zhihai
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
Liu, Zhihai
;
Wang, Renxiao
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R ChinaChinese Acad Sci, Shanghai Inst Organ Chem, State Key Lab Bioorgan Chem, Shanghai 200032, Peoples R China
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Ding, Bo
;
Wang, Jian
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Wang, Jian
;
Li, Nan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Li, Nan
;
Wang, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
Univ Calif San Diego, Dept Cellular & Mol Med, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
机构:
Univ Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USA
Durrant, Jacob D.
;
McCammon, J. Andrew
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USA
Univ Calif San Diego, Dept Pharmacol, La Jolla, CA 92093 USA
Univ Calif San Diego, Howard Hughes Med Inst, La Jolla, CA 92093 USAUniv Calif San Diego, Dept Chem & Biochem, NSF Ctr Theoret Biol Phys, La Jolla, CA 92093 USA
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Lahti, Jennifer L.
;
Tang, Grace W.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Tang, Grace W.
;
Capriotti, Emidio
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Univ Balearic Isl, Dept Math & Comp Sci, Palma De Mallorca, SpainStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Capriotti, Emidio
;
Liu, Tianyun
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Genet, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Liu, Tianyun
;
Altman, Russ B.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
Stanford Univ, Dept Genet, Stanford, CA 94305 USAStanford Univ, Dept Bioengn, Stanford, CA 94305 USA