All That Glitters Is Not Gold: Importance of Rigorous Evaluation of Proteochemometric Models
Polina Avdiunina, Shamieraah Jamal, Filipp Gusev, Olexandr Isayev
Abstract
Proteochemometric models (PCMs) are used in computational drug discovery to employ both protein and ligand representations jointly for bioactivity prediction. While machine learning (ML) and deep learning (DL) have come to dominate PCMs, often serving as a basis for scoring functions, rigorous evaluation standards have not always been consistently applied. In this study, using kinase-ligand bioactivity prediction as a model system, we highlight the critical roles of data set curation, permutation testing, class imbalances, and various data splitting strategies for mitigating plausible data leakage and embedding quality in determining model performance. Our findings indicate that data splitting and class imbalances are the most critical factors affecting PCM performance, emphasizing the challenges in the generalizing ability of ML/DL-PCMs. We evaluated various protein–ligand descriptors and embeddings, including those augmented with multiple sequence alignment information. However, permutation testing consistently demonstrated that protein embeddings contributed minimally to PCM efficacy. This study advocates for the adoption of stringent evaluation standards to enhance the generalizability of models to out-of-distribution data and improve benchmarking practices.
Keywords
Cite This Paper
@article{Avdiunina2025,
author = {Avdiunina, Polina and Jamal, Shamieraah and Gusev, Filipp and Isayev, Olexandr},
title = {All That Glitters Is Not Gold: Importance of Rigorous Evaluation of Proteochemometric Models},
year = {2025},
journal = {Journal of Chemical Information and Modeling},
volume = {65},
number = {19},
pages = {10239--10252},
doi = {10.1021/acs.jcim.5c00395},
url = {http://dx.doi.org/10.1021/acs.jcim.5c00395},
publisher = {American Chemical Society (ACS)},
keywords = {proteochemometric models, machine learning, deep learning, kinase-ligand bioactivity prediction, data curation, permutation testing, class imbalances, data splitting},
researchAreas = {drug-discovery, ai-for-science},
citations = {0}
} Copied to clipboard!
Related Research Areas
Related Publications
Transforming Computational Drug Discovery with Machine Learning and AI
ACS Medicinal Chemistry Letters , 9 , 1065–1069 (2018)
Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen
Nature Communications , 10 (2019)
MLatom 3: A Platform for Machine Learning-Enhanced Computational Chemistry Simulations and Workflows
Journal of Chemical Theory and Computation , 20 , 1193–1213 (2024)
Extending machine learning beyond interatomic potentials for predicting molecular properties
Nature Reviews Chemistry , 6 , 653–672 (2022)