Natural Language Processing (NLP) has revolutionized the way computers are used to study and interact with human languages and is increasingly influential in the study of protein and ligand binding, which is critical for drug discovery and development. This review examines how NLP techniques have been adapted to decode the "language" of proteins and small molecule ligands to predict protein-ligand interactions (PLIs). We discuss how methods such as long short-term memory (LSTM) networks, transformers, and attention mechanisms can leverage different protein and ligand data types to identify potential interaction patterns. Significant challenges are highlighted including the scarcity of high-quality negative data, difficulties in interpreting model decisions, and sampling biases in existing data sets. We argue that focusing on improving data quality, enhancing model robustness, and fostering both collaboration and competition could catalyze future advances in machine-learning-based predictions of PLIs.
机构:
Joint Ctr Struct Genom, San Diego, CA USA
Scripps Res Inst, Dept Integrat Struct & Computat Biol, La Jolla, CA 92037 USAJoint Ctr Struct Genom, San Diego, CA USA
Deller, Marc C.
;
Rupp, Bernhard
论文数: 0引用数: 0
h-index: 0
机构:
Med Univ Innsbruck, Dept Genet Epidemiol, A-6020 Innsbruck, AustriaJoint Ctr Struct Genom, San Diego, CA USA
机构:
Albert Einstein Coll Med, Res, New York, NY USA
Smt Nathiba Hargovandas Lakhmichand Municipal Med, Med, Ahmadabad, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
Desai, Dev
;
Kantliwala, Shiv, V
论文数: 0引用数: 0
h-index: 0
机构:
Griffith Univ, Publ Hlth, Brisbane, AustraliaAlbert Einstein Coll Med, Res, New York, NY USA
Kantliwala, Shiv, V
;
Vybhavi, Jyothi
论文数: 0引用数: 0
h-index: 0
机构:
RajaRajeswari Med Coll & Hosp, Physiol, Bangalore, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
Vybhavi, Jyothi
;
Ravi, Renju
论文数: 0引用数: 0
h-index: 0
机构:
Jazan Univ, Fac Med, Clin Pharmacol, Jizan, Saudi ArabiaAlbert Einstein Coll Med, Res, New York, NY USA
Ravi, Renju
;
Patel, Harshkumar
论文数: 0引用数: 0
h-index: 0
机构:
Gujarat Med Educ & Res Soc Med Coll, Internal Med, Vadnagar, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
Patel, Harshkumar
;
Patel, Jitendra
论文数: 0引用数: 0
h-index: 0
机构:
Gujarat Med Educ & Res Soc Med Coll, Physiol, Vadnagar, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
机构:
Karolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, SwedenKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Edfeldt, Kristina
;
Edwards, Aled M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Toronto, Struct Genom Consortium, Toronto, ON, CanadaKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Edwards, Aled M.
;
Engkvist, Ola
论文数: 0引用数: 0
h-index: 0
机构:
AstraZeneca, Discovery Sci, R&D, Gothenburg, SwedenKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Engkvist, Ola
;
Guenther, Judith
论文数: 0引用数: 0
h-index: 0
机构:
Bayer AG Res & Dev, Computat Mol Design, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Guenther, Judith
;
论文数: 引用数:
h-index:
机构:
Hartley, Matthew
;
Hulcoop, David G.
论文数: 0引用数: 0
h-index: 0
机构:
Open Targets, Wellcome Genome Campus, Hinxton, Cambs, England
European Bioinformat Inst EMBL EBI, Wellcome Genome Campus, Cambridge, EnglandKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Hulcoop, David G.
;
Leach, Andrew R.
论文数: 0引用数: 0
h-index: 0
机构:
European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Wellcome Genome Campus, Hinxton, EnglandKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Leach, Andrew R.
;
Marsden, Brian D.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Oxford, Ctr Med Discovery, NDM, Oxford, EnglandKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Marsden, Brian D.
;
Menge, Amelie
论文数: 0引用数: 0
h-index: 0
机构:
Goethe Univ Frankfurt, Inst Pharmaceut Chem, D-60438 Frankfurt, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Menge, Amelie
;
Misquitta, Leonie
论文数: 0引用数: 0
h-index: 0
机构:
NIH, Natl Lib Med, Bethesda, MD USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Misquitta, Leonie
;
Mueller, Susanne
论文数: 0引用数: 0
h-index: 0
机构:
Goethe Univ Frankfurt, Inst Pharmaceut Chem, D-60438 Frankfurt, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Mueller, Susanne
;
Owen, Dafydd R.
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer Worldwide Res, Dev & Med, Cambridge, MA USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Owen, Dafydd R.
;
Schuett, Kristof T.
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer, Worldwide Res Dev & Med, Machine Learning & Computat Sci, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Schuett, Kristof T.
;
Skelton, Nicholas
论文数: 0引用数: 0
h-index: 0
机构:
Genentech Inc, Dept Discovery Chem, South San Francisco, CA USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Skelton, Nicholas
;
Steffen, Andreas
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer, Worldwide Res Dev & Med, Machine Learning & Computat Sci, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Steffen, Andreas
;
论文数: 引用数:
h-index:
机构:
Tropsha, Alexander
;
Vernet, Erik
论文数: 0引用数: 0
h-index: 0
机构:
Digital Sci & Innovat, Novo Nord A S, Malov, DenmarkKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Vernet, Erik
;
Wang, Yanli
论文数: 0引用数: 0
h-index: 0
机构:
NIH, Natl Lib Med, Bethesda, MD USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Wang, Yanli
;
Wellnitz, James
论文数: 0引用数: 0
h-index: 0
机构:
Univ North Carolina, UNC Eshelman Sch Pharm, Div Chem Biol & Med Chem, Lab Mol Modeling, Chapel Hill, NC USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Wellnitz, James
;
Willson, Timothy M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ North Carolina, UNC Eshelman Sch Pharm, Struct Genom Consortium, Chapel Hill, NC USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Willson, Timothy M.
;
Clevert, Djork-Arne
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer, Worldwide Res Dev & Med, Machine Learning & Computat Sci, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Clevert, Djork-Arne
;
论文数: 引用数:
h-index:
机构:
Haibe-Kains, Benjamin
;
Schiavone, Lovisa Holmberg
论文数: 0引用数: 0
h-index: 0
机构:
AstraZeneca, Discovery Biol, Discovery Sci, R&D, Gothenburg, SwedenKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
机构:
Joint Ctr Struct Genom, San Diego, CA USA
Scripps Res Inst, Dept Integrat Struct & Computat Biol, La Jolla, CA 92037 USAJoint Ctr Struct Genom, San Diego, CA USA
Deller, Marc C.
;
Rupp, Bernhard
论文数: 0引用数: 0
h-index: 0
机构:
Med Univ Innsbruck, Dept Genet Epidemiol, A-6020 Innsbruck, AustriaJoint Ctr Struct Genom, San Diego, CA USA
机构:
Albert Einstein Coll Med, Res, New York, NY USA
Smt Nathiba Hargovandas Lakhmichand Municipal Med, Med, Ahmadabad, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
Desai, Dev
;
Kantliwala, Shiv, V
论文数: 0引用数: 0
h-index: 0
机构:
Griffith Univ, Publ Hlth, Brisbane, AustraliaAlbert Einstein Coll Med, Res, New York, NY USA
Kantliwala, Shiv, V
;
Vybhavi, Jyothi
论文数: 0引用数: 0
h-index: 0
机构:
RajaRajeswari Med Coll & Hosp, Physiol, Bangalore, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
Vybhavi, Jyothi
;
Ravi, Renju
论文数: 0引用数: 0
h-index: 0
机构:
Jazan Univ, Fac Med, Clin Pharmacol, Jizan, Saudi ArabiaAlbert Einstein Coll Med, Res, New York, NY USA
Ravi, Renju
;
Patel, Harshkumar
论文数: 0引用数: 0
h-index: 0
机构:
Gujarat Med Educ & Res Soc Med Coll, Internal Med, Vadnagar, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
Patel, Harshkumar
;
Patel, Jitendra
论文数: 0引用数: 0
h-index: 0
机构:
Gujarat Med Educ & Res Soc Med Coll, Physiol, Vadnagar, IndiaAlbert Einstein Coll Med, Res, New York, NY USA
机构:
Karolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, SwedenKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Edfeldt, Kristina
;
Edwards, Aled M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Toronto, Struct Genom Consortium, Toronto, ON, CanadaKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Edwards, Aled M.
;
Engkvist, Ola
论文数: 0引用数: 0
h-index: 0
机构:
AstraZeneca, Discovery Sci, R&D, Gothenburg, SwedenKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Engkvist, Ola
;
Guenther, Judith
论文数: 0引用数: 0
h-index: 0
机构:
Bayer AG Res & Dev, Computat Mol Design, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Guenther, Judith
;
论文数: 引用数:
h-index:
机构:
Hartley, Matthew
;
Hulcoop, David G.
论文数: 0引用数: 0
h-index: 0
机构:
Open Targets, Wellcome Genome Campus, Hinxton, Cambs, England
European Bioinformat Inst EMBL EBI, Wellcome Genome Campus, Cambridge, EnglandKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Hulcoop, David G.
;
Leach, Andrew R.
论文数: 0引用数: 0
h-index: 0
机构:
European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Wellcome Genome Campus, Hinxton, EnglandKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Leach, Andrew R.
;
Marsden, Brian D.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Oxford, Ctr Med Discovery, NDM, Oxford, EnglandKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Marsden, Brian D.
;
Menge, Amelie
论文数: 0引用数: 0
h-index: 0
机构:
Goethe Univ Frankfurt, Inst Pharmaceut Chem, D-60438 Frankfurt, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Menge, Amelie
;
Misquitta, Leonie
论文数: 0引用数: 0
h-index: 0
机构:
NIH, Natl Lib Med, Bethesda, MD USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Misquitta, Leonie
;
Mueller, Susanne
论文数: 0引用数: 0
h-index: 0
机构:
Goethe Univ Frankfurt, Inst Pharmaceut Chem, D-60438 Frankfurt, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Mueller, Susanne
;
Owen, Dafydd R.
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer Worldwide Res, Dev & Med, Cambridge, MA USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Owen, Dafydd R.
;
Schuett, Kristof T.
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer, Worldwide Res Dev & Med, Machine Learning & Computat Sci, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Schuett, Kristof T.
;
Skelton, Nicholas
论文数: 0引用数: 0
h-index: 0
机构:
Genentech Inc, Dept Discovery Chem, South San Francisco, CA USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Skelton, Nicholas
;
Steffen, Andreas
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer, Worldwide Res Dev & Med, Machine Learning & Computat Sci, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Steffen, Andreas
;
论文数: 引用数:
h-index:
机构:
Tropsha, Alexander
;
Vernet, Erik
论文数: 0引用数: 0
h-index: 0
机构:
Digital Sci & Innovat, Novo Nord A S, Malov, DenmarkKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Vernet, Erik
;
Wang, Yanli
论文数: 0引用数: 0
h-index: 0
机构:
NIH, Natl Lib Med, Bethesda, MD USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Wang, Yanli
;
Wellnitz, James
论文数: 0引用数: 0
h-index: 0
机构:
Univ North Carolina, UNC Eshelman Sch Pharm, Div Chem Biol & Med Chem, Lab Mol Modeling, Chapel Hill, NC USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Wellnitz, James
;
Willson, Timothy M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ North Carolina, UNC Eshelman Sch Pharm, Struct Genom Consortium, Chapel Hill, NC USAKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Willson, Timothy M.
;
Clevert, Djork-Arne
论文数: 0引用数: 0
h-index: 0
机构:
Pfizer, Worldwide Res Dev & Med, Machine Learning & Computat Sci, Berlin, GermanyKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden
Clevert, Djork-Arne
;
论文数: 引用数:
h-index:
机构:
Haibe-Kains, Benjamin
;
Schiavone, Lovisa Holmberg
论文数: 0引用数: 0
h-index: 0
机构:
AstraZeneca, Discovery Biol, Discovery Sci, R&D, Gothenburg, SwedenKarolinska Univ Hosp, Dept Med, Struct Genom Consortium, Stockholm, Sweden