<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Isayev Lab Publications</title><description>Research publications in machine learning, computational chemistry, and drug discovery from the Isayev Lab at Carnegie Mellon University.</description><link>https://olexandrisayev.com/</link><language>en-us</language><managingEditor>olexandr@cmu.edu (Olexandr Isayev)</managingEditor><webMaster>olexandr@cmu.edu (Olexandr Isayev)</webMaster><copyright>Copyright 2026 Olexandr Isayev</copyright><category>Science</category><category>Chemistry</category><category>Machine Learning</category><category>Publications</category><item><title>AIQM3: Targeting Coupled-Cluster Accuracy with Semi-Empirical Speed across Seven Main-Group Elements</title><link>https://doi.org/10.1021/acs.jctc.5c01794</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jctc.5c01794</guid><description>Chen, Yuxinxin and Hou, Yi-Fan and Zubatyuk, Roman and Isayev, Olexandr and Dral, Pavlo O.. Journal of Chemical Theory and Computation (2026)</description><pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate><category>AIQM3</category><category>semi-empirical methods</category><category>coupled cluster</category><category>main-group chemistry</category><category>machine learning potentials</category><category>hybrid quantum chemistry</category><author>Chen, Yuxinxin et al.</author></item><item><title>Applications of modular co-design for &lt;i&gt;de novo&lt;/i&gt; 3D molecule generation</title><link>https://doi.org/10.1039/d5dd00380f</link><guid isPermaLink="true">https://doi.org/10.1039/d5dd00380f</guid><description>Reidenbach, Danny and Nikitin, Filipp and Isayev, Olexandr and Paliwal, Saee Gopal. Digital Discovery (2026)</description><pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate><category>3D molecule generation</category><category>de novo design</category><category>modular co-design</category><category>generative models</category><category>structure-based design</category><author>Reidenbach, Danny et al.</author></item><item><title>AIMNet2: a neural network potential to meet your neutral, charged, organic, and elemental-organic needs</title><link>https://doi.org/10.1039/d4sc08572h</link><guid isPermaLink="true">https://doi.org/10.1039/d4sc08572h</guid><description>Machine learned interatomic potentials (MLIPs) are reshaping computational chemistry practices because of their ability to drastically exceed the accuracy-length/time scale tradeoff.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>neutral molecules</category><category>charged systems</category><category>organic compounds</category><category>elemental-organic hybrids</category><category>universal applicability</category><author>Anstine, Dylan M. et al.</author></item><item><title>All That Glitters Is Not Gold: Importance of Rigorous Evaluation of Proteochemometric Models</title><link>https://doi.org/10.1021/acs.jcim.5c00395</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jcim.5c00395</guid><description>Proteochemometric models (PCMs) are used in computational drug discovery to employ both protein and ligand representations jointly for bioactivity prediction. While machine learning (ML) and deep learning (DL) have come to dominate PCMs, often serving as a basis for scoring functions, rigorous evaluation standards have not always been consistently applied. In this study, using kinase-ligand bioactivity prediction as a model system, we highlight the critical roles of data set curation, permutation testing, class imbalances, and various data splitting strategies for mitigating plausible data leakage and embedding quality in determining model performance. Our findings indicate that data splitting and class imbalances are the most critical factors affecting PCM performance, emphasizing the challenges in the generalizing ability of ML/DL-PCMs. We evaluated various protein–ligand descriptors and embeddings, including those augmented with multiple sequence alignment information. However, permutation testing consistently demonstrated that protein embeddings contributed minimally to PCM efficacy. This study advocates for the adoption of stringent evaluation standards to enhance the generalizability of models to out-of-distribution data and improve benchmarking practices.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>proteochemometric models</category><category>machine learning</category><category>deep learning</category><category>kinase-ligand bioactivity prediction</category><category>data curation</category><category>permutation testing</category><category>class imbalances</category><category>data splitting</category><author>Avdiunina, Polina et al.</author></item><item><title>Anticipating the Selectivity of Intramolecular Cyclization Reaction Pathways with Neural Network Potentials</title><link>https://doi.org/10.1021/acs.jctc.5c01161</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jctc.5c01161</guid><description>Reaction mechanism search tools have demonstrated the ability to provide insights into likely products and rate-limiting steps of reacting systems. However, reactions involving several concerted bond changes─as can be found in many key steps of natural product syntheses─can complicate the search process. To mitigate these complications, we present a mechanism search strategy particularly suited to help expedite exploration of an exemplary family of such complex reactions, cyclizations. We provide a cost-effective strategy for identifying relevant elementary reaction steps by combining graph-based enumeration schemes and machine learning techniques for intermediate filtering. Key to this approach is our use of a neural network potential (NNP), AIMNet2-rxn, for computational evaluation of each candidate reaction pathway. In this article, we evaluate the NNP&apos;s ability to estimate activation energies, demonstrate the correct anticipation of stereoselectivity, and recapitulate complex enabling steps in natural product synthesis.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>reaction pathway exploration</category><category>graph-based enumeration</category><category>intermediate filtering</category><category>cyclization selectivity</category><category>aimnet2-rxn application</category><author>Casetti, Nicholas et al.</author></item><item><title>Proto-Yield: An Uncertainty-Aware Prototype Network for Yield Prediction in Real-world Chemical Reactions</title><link>https://doi.org/10.1145/3746252.3761323</link><guid isPermaLink="true">https://doi.org/10.1145/3746252.3761323</guid><description>Reaction yield prediction underpins computer-aided synthesis prediction (CASP). Formulated as a regression problem that takes both reactants and products as input, this task has been extensively studied using machine learning methods, based on handcrafted fingerprint features, SMILES encoded by Transformers, and molecular graphs encoded by Graph Neural Networks. However, a major limitation of these methods is their inability to effectively capture and model the underlying uncertainties, arising both from the inherently stochastic nature of chemical reaction processes and from inconsistencies or noise in how yields are measured and reported. What makes this seemingly simple regression problem even more challenging is the lack of any principled way to account for the underlying uncertainties, due to missing or unrecorded experimental process (commonly happens in chemical labs). Given these challenges, we propose a new formulation for yield prediction. Rather than assuming a single deterministic yield value for a given reaction, we model the outcome as a probabilistic distribution over three discrete yield regimes: high, medium, and low, reflecting the inherent uncertainty in the reaction process, which is often only partially observed. Accordingly, we propose Proto-Yield, an encoder-agnostic prototype network that models reactions as occurring in one of three yield regimes: high, medium, or low. Without access to full reaction processes, Proto-Yield learns to infer latent regimes and their associated yield distributions from noisy, incomplete training data. During inference, Proto-Yield outputs both a calibrated probability distribution over the yield regimes and the predicted yield conditioned on each regime. Extensive experiments on a 41,000-reaction patent corpus and two high-throughput benchmarks show that Proto-Yield improves R2 by up to 15% and reduces RMSE/MAE by 13% compared to baseline methods.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>yield prediction</category><category>uncertainty modeling</category><category>prototype network</category><category>probabilistic distributions</category><category>chemical reactions</category><author>Guo, Kehan et al.</author></item><item><title>Machine learning anomaly detection of automated HPLC experiments in the cloud laboratory</title><link>https://doi.org/10.1039/d5dd00253b</link><guid isPermaLink="true">https://doi.org/10.1039/d5dd00253b</guid><description>Autonomous experiments are vulnerable to unforeseen adverse events. We developed a transferable ML framework that flags affected HPLC runs in real time and provides expert-level quality control without human oversight.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>automated quality assurance</category><category>real-time monitoring</category><category>machine learning algorithms</category><category>data-driven quality control</category><category>cloud-based experimentation</category><author>Gusev, Filipp et al.</author></item><item><title>Machine learning interatomic potentials at the centennial crossroads of quantum mechanics</title><link>https://doi.org/10.1038/s43588-025-00930-6</link><guid isPermaLink="true">https://doi.org/10.1038/s43588-025-00930-6</guid><description>As quantum mechanics marks its centennial in 2025, machine learning interatomic potentials have emerged as transformative tools in molecular modeling, bridging quantum mechanical accuracy with classical efficiency. Here we examine their development through four defining challenges-achieving chemical accuracy, maintaining computational efficiency, ensuring interpretability and reaching universal generalizability. We highlight architectural innovations, physics-informed approaches, and foundation models trained on extensive data. Together, these developments chart a path toward predictive, transferable and physically grounded machine learning frameworks for next-generation computational chemistry.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>transferable potentials</category><category>physics-augmented learning</category><category>scalable molecular modeling</category><category>adaptive force fields</category><category>multiscale modeling</category><author>Kalita, Bhupalee et al.</author></item><item><title>AIMNet2‐NSE: A Transferable Reactive Neural Network Potential for Open‐Shell Chemistry</title><link>https://doi.org/10.1002/anie.202516763</link><guid isPermaLink="true">https://doi.org/10.1002/anie.202516763</guid><description>Kalita, Bhupalee and Zubatyuk, Roman and Anstine, Dylan M. and Bergeler, Maike and Settels, Volker and Stork, Conrad and Spicher, Sebastian and Isayev, Olexandr. Angewandte Chemie International Edition (2025)</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>AIMNet2-NSE</category><category>neural network potential</category><category>open-shell chemistry</category><category>reactive chemistry</category><category>transferable potential</category><category>radical chemistry</category><author>Kalita, Bhupalee et al.</author></item><item><title>Fast and Accurate Ring Strain Energy Predictions with Machine Learning and Application in Strain-Promoted Reactions</title><link>https://doi.org/10.1021/jacsau.5c00667</link><guid isPermaLink="true">https://doi.org/10.1021/jacsau.5c00667</guid><description>Ring strain energy (RSE) is crucial for understanding molecular reactivity, with broad implications in polymerization, click chemistry, drug discovery and beyond. However, quantitatively determining RSE through experiments or quantum mechanics (QM) is resource-intensive, limiting its application on a large scale. We present a machine learning (ML)-based workflow that enables the reliable and efficient prediction of RSE, entirely bypassing traditional QM calculations. Our workflow employs AIMNet2 machine learning interatomic potentials and Auto3D for the identification of low-energy conformers and RSE computation. Remarkably, it achieves an R 2 of 0.997 and a mean absolute error (MAE) of 0.896 kcal/mol when benchmarked against the ωB97M-D4/Def2-TZVPP method, while running orders of magnitude faster than DFT calculations. To demonstrate the utility of our workflow, we successfully differentiated reactive from nonreactive molecules in copper-free click chemistry, [3 + 2] cycloaddition reaction and ring-opening metathesis polymerization, underscoring its transferability to diverse molecular systems. Additionally, we compiled the RSE Atlas, a computational database encompassing 16,905 single-ring molecules, offering a valuable resource for investigating factors influencing RSE. Our approach transforms RSE into a readily computable property, facilitating its integration into reaction designs.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>conformer identification</category><category>strain energy computation</category><category>molecular mechanics</category><category>high-throughput screening</category><category>chemical reactivity</category><author>Liu, Zhen et al.</author></item><item><title>Efficient Molecular Crystal Structure Prediction and Stability Assessment with AIMNet2 Neural Network Potentials</title><link>https://doi.org/10.1021/acs.cgd.5c01001</link><guid isPermaLink="true">https://doi.org/10.1021/acs.cgd.5c01001</guid><description>Identifying thermodynamically stable crystal structures remains a key challenge in materials chemistry. Computational crystal structure prediction (CSP) workflows typically rank candidate structures by lattice energy to assess relative stability. Approaches using self-consistent first-principles calculations become prohibitively expensive, especially when millions of energy evaluations are required for complex molecular systems with many atoms per unit cell. Here, we provide a detailed analysis of our methodology and results from the seventh blind test of crystal structure prediction organized by the Cambridge Crystallographic Data Centre (CCDC). We present an approach that significantly accelerates CSP by training target-specific machine-learned interatomic potentials (MLIPs). AIMNet2 MLIPs are trained on density functional theory (DFT) calculations of molecular clusters, herein referred to as n-mers. We demonstrate that potentials trained on gas phase dispersion-corrected DFT reference data of n-mers successfully extend to crystalline environments, accurately characterizing the CSP landscape and correctly ranking structures by relative stability. Our methodology effectively captures the underlying physics of thermodynamic crystal stability using only molecular cluster data, avoiding the need for expensive periodic calculations. The performance of target-specific AIMNet2 interatomic potentials is illustrated across diverse chemical systems relevant to pharmaceutical, optoelectronic, and agrochemical applications, demonstrating their promise as efficient alternatives to full DFT calculations for routine CSP tasks.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>aimnet2</category><category>molecular crystals</category><category>crystal structure prediction (csp)</category><category>machine-learned interatomic potentials (mlips)</category><category>density functional theory (dft)</category><author>Nayal, Kamal Singh et al.</author></item><item><title>GEOM-drugs revisited: toward more chemically accurate benchmarks for 3D molecule generation</title><link>https://doi.org/10.1039/d5dd00206k</link><guid isPermaLink="true">https://doi.org/10.1039/d5dd00206k</guid><description>Revisiting GEOM drugs: corrected metrics and novel energy-based structural benchmark enable rigorous evaluation of 3D molecule generative models.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>benchmarking framework</category><category>molecular conformation accuracy</category><category>energy landscapes</category><category>model validation techniques</category><category>chemical structure prediction</category><author>Nikitin, Filipp et al.</author></item><item><title>Design of Tough 3D Printable Elastomers with Human‐in‐the‐Loop Reinforcement Learning</title><link>https://doi.org/10.1002/ange.202513147</link><guid isPermaLink="true">https://doi.org/10.1002/ange.202513147</guid><description>Rapp, Johann L. and Anstine, Dylan M. and Gusev, Filipp and Nikitin, Filipp and Yun, Kelly H. and Borden, Meredith A. and Bhat, Vittal and Isayev, Olexandr and Leibfarth, Frank A.. Angewandte Chemie (2025)</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>3D printing</category><category>elastomers</category><category>reinforcement learning</category><category>human-in-the-loop</category><category>materials design</category><category>soft materials</category><author>Rapp, Johann L. et al.</author></item><item><title>ANI-1xBB: An ANI-Based Reactive Potential for Small Organic Molecules</title><link>https://doi.org/10.1021/acs.jctc.5c00347</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jctc.5c00347</guid><description>Reactive potentials serve as essential tools for investigating chemical reactions with moderate computational costs. However, traditional reactive potentials often depend on fixed, semiempirical parameters, which limits their accuracy and transferability. Overcoming these limitations can significantly expand the applicability of reactive potentials, enabling the simulation of a broader range of reactions under diverse conditions and the prediction of reaction properties, such as barrier heights. This work introduces ANI-1xBB, a novel ANI-based reactive ML potential trained on off-equilibrium molecular conformers generated through an automated bond-breaking workflow. ANI-1xBB significantly enhances the prediction of reaction energetics, barrier heights, and bond dissociation energies, surpassing those of conventional ANI models. Our results show that ANI-1xBB improves transition state modeling and reaction pathway prediction while generalizing effectively to pericyclic reactions and radical-driven processes. Furthermore, the automated data generation strategy supports the efficient construction of large-scale, high-quality reactive data sets, reducing reliance on expensive QM calculations. This work highlights ANI-1xBB as a practical model for accelerating the development of reactive machine learning potentials, offering new opportunities for modeling reaction phenomena.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>off-equilibrium conformers</category><category>bond-breaking workflow</category><category>transition state modeling</category><category>pericyclic reactions</category><category>radical-driven processes</category><author>Zhang, Shuhao et al.</author></item><item><title>Including Physics-Informed Atomization Constraints in Neural Networks for Reactive Chemistry</title><link>https://doi.org/10.1021/acs.jcim.5c00341</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jcim.5c00341</guid><description>Machine learning interatomic potentials (MLIPs) have emerged as powerful tools for investigating atomistic systems with high accuracy and a relatively low computational cost. However, a common and unaddressed challenge with many current neural network (NN) MLIP models is their limited ability to accurately predict the relative energies of systems containing isolated or nearly isolated atoms, which appear in various reactive processes. To address this limitation, we present a mathematical technique for modifying any existing atom-centered NN architecture to account for the energies of isolated atoms. The result produces a consistent prediction of the atomization energy (AE) of a system using minimal constraints on the model. Using this technique, we build a model architecture that we call hierarchically interacting particle neural network (HIP-NN)-AE, an AE-constrained version of the HIP-NN, as well as ANI-AE, the AE-constrained version of the accurate NN engine for molecular energies (ANI). Our results demonstrate AE consistency of AE-constrained models, which drastically improves the AE predictions for the models. We compare the AE-constrained approach to unconstrained models as well as models from the literature in other scenarios, such as bond dissociation energies, bond dissociation pathways, and extensibility tests. These results show that the constraints improve the model performance in some of these tasks and do not negatively affect the performance on any tasks. The AE constraint approach thus offers a robust solution to the challenges posed by isolated atoms in energy prediction tasks.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>atomization energy constraints</category><category>neural networks</category><category>reactive chemistry</category><category>machine learning interatomic potentials</category><category>hip-nn-ae</category><category>ani-ae</category><category>bond dissociation energies</category><category>extensibility tests</category><author>Zhang, Shuhao et al.</author></item><item><title>High-throughput electronic property prediction of cyclic molecules with 3D-enhanced machine learning</title><link>https://doi.org/10.1039/d5sc04079e</link><guid isPermaLink="true">https://doi.org/10.1039/d5sc04079e</guid><description>Ring Vault contains 201 546 cyclic molecules across 11 elements. AIMNet2 with 3D information outperformed 2D models in predicting the electronic properties of cyclic molecules.</description><pubDate>Wed, 01 Jan 2025 00:00:00 GMT</pubDate><category>high-throughput prediction</category><category>molecular electronics</category><category>scalability</category><category>generalizability across elements</category><category>computational efficiency</category><author>Zheng, Peikun et al.</author></item><item><title>MLatom 3: A Platform for Machine Learning-Enhanced Computational Chemistry Simulations and Workflows</title><link>https://doi.org/10.1021/acs.jctc.3c01203</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jctc.3c01203</guid><description>Dral, Pavlo O. and Ge, Fuchun and Hou, Yi-Fan and Zheng, Peikun and Chen, Yuxinxin and Barbatti, Mario and Isayev, Olexandr and Wang, Cheng and Xue, Bao-Xin and Pinheiro Jr, Max and Su, Yuming and Dai, Yiheng and Chen, Yangtao and Zhang, Lina and Zhang, Shuang and Ullah, Arif and Zhang, Quanhao and Ou, Yanchi. Journal of Chemical Theory and Computation (2024)</description><pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate><category>MLatom</category><category>ML platform</category><category>computational chemistry workflows</category><category>atomistic simulations</category><category>machine learning interface</category><author>Dral, Pavlo O. et al.</author></item><item><title>&lt;i&gt;In silico&lt;/i&gt; screening of LRRK2 WDR domain inhibitors using deep docking and free energy simulations</title><link>https://doi.org/10.1039/d3sc06880c</link><guid isPermaLink="true">https://doi.org/10.1039/d3sc06880c</guid><description>In this work, we combined Deep Docking and free energy MD simulations for the in silico screening and experimental validation for potential inhibitors of leucine rich repeat kinase 2 (LRRK2) targeting the WD40 repeat (WDR) domain.</description><pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate><category>molecular dynamics</category><category>virtual screening</category><category>binding affinity</category><category>docking algorithms</category><category>computational drug design</category><author>Gutkin, Evgeny et al.</author></item><item><title>ANI/EFP: Modeling Long-Range Interactions in ANI Neural Network with Effective Fragment Potentials</title><link>https://doi.org/10.1021/acs.jctc.4c01052</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jctc.4c01052</guid><description>Deep learning Neural Networks (NN) have been developed in the field of molecular modeling for the purpose of circumventing the high computational cost of quantum-mechanical calculations while rivaling their accuracies. Although these networks have found great success, they generally lack the ability to accurately describe long-range interactions, which makes them unusable for extended molecular systems. Herein, we provide a method for partially retraining the deep learning general-use neural network ANI, in which the long-range interactions are represented via atomic electrostatic potentials. The electrostatic potentials, generated with polarizable effective fragment potentials (EFP), are used as an additional input feature for the network. This new ANI/EFP network can predict solute-solvent interaction energies on a trained data set with a kcal/mol accuracy. It also shows promise in predicting the interaction energies of a solute in solvent environments that have not been included in a training data set. The proposed protocol can be taken as an example and further developed, leading to highly accurate and transferable neural network potentials capable of handling long-range interactions and extended molecular systems.</description><pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate><category>electrostatic potential enhancement</category><category>ani model extension</category><category>long-range interaction correction</category><category>solute-solvent energy prediction</category><category>transferable neural network</category><author>Haghiri, Shahed et al.</author></item><item><title>Discovery of Crystallizable Organic Semiconductors with Machine Learning</title><link>https://doi.org/10.1021/jacs.4c05245</link><guid isPermaLink="true">https://doi.org/10.1021/jacs.4c05245</guid><description>Crystalline organic semiconductors are known to have improved charge carrier mobility and exciton diffusion length in comparison to their amorphous counterparts. Certain organic molecular thin films can be transitioned from initially prepared amorphous layers to large-scale crystalline films via abrupt thermal annealing. Ideally, these films crystallize as platelets with long-range-ordered domains on the scale of tens to hundreds of microns. However, other organic molecular thin films may instead crystallize as spherulites or resist crystallization entirely. Organic molecules that have the capability of transforming into a platelet morphology feature both high melting point (Tm) and crystallization driving force (ΔGc). In this work, we employed machine learning (ML) to identify candidate organic materials with the potential to crystallize into platelets by estimating the aforementioned thermal properties. Six organic molecules identified by the ML algorithm were experimentally evaluated; three crystallized as platelets, one crystallized as a spherulite, and two resisted thin film crystallization. These results demonstrate a successful application of ML in the scope of predicting thermal properties of organic molecules and reinforce the principles of Tm and ΔGc as metrics that aid in predicting the crystallization behavior of organic thin films.</description><pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate><category>crystal structure prediction</category><category>molecular design</category><category>thermodynamics of crystallization</category><category>organic electronics</category><category>property prediction models</category><author>Johnson, Holly M. et al.</author></item><item><title>De novo molecule design towards biased properties via a deep generative framework and iterative transfer learning</title><link>https://doi.org/10.1039/d3dd00210a</link><guid isPermaLink="true">https://doi.org/10.1039/d3dd00210a</guid><description>The RRCGAN, validated through DFT, demonstrates success in generating chemically valid molecules targeting energy gap values with 75% of the generated molecules have RE of &amp;lt;20% of the targeted values.</description><pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate><category>deep learning</category><category>biasing properties</category><category>transfer learning</category><category>molecular generation</category><category>property optimization</category><author>Sattari, Kianoosh et al.</author></item><item><title>Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR</title><link>https://doi.org/10.1038/s41573-023-00832-0</link><guid isPermaLink="true">https://doi.org/10.1038/s41573-023-00832-0</guid><description>Quantitative structure-activity relationship (QSAR) modelling, an approach that was introduced 60 years ago, is widely used in computer-aided drug design. In recent years, progress in artificial intelligence techniques, such as deep learning, the rapid growth of databases of molecules for virtual screening and dramatic improvements in computational power have supported the emergence of a new field of QSAR applications that we term &apos;deep QSAR&apos;. Marking a decade from the pioneering applications of deep QSAR to tasks involved in small-molecule drug discovery, we herein describe key advances in the field, including deep generative and reinforcement learning approaches in molecular design, deep learning models for synthetic planning and the application of deep QSAR models in structure-based virtual screening. We also reflect on the emergence of quantum computing, which promises to further accelerate deep QSAR applications and the need for open-source and democratized resources to support computer-aided drug design.</description><pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate><category>neural networks</category><category>molecular design</category><category>deep reinforcement learning</category><category>qsar applications</category><category>computational power</category><author>Tropsha, Alexander et al.</author></item><item><title>Exploring the frontiers of condensed-phase chemistry with a general reactive machine learning potential</title><link>https://doi.org/10.1038/s41557-023-01427-3</link><guid isPermaLink="true">https://doi.org/10.1038/s41557-023-01427-3</guid><description>Abstract Atomistic simulation has a broad range of applications from drug design to materials discovery. Machine learning interatomic potentials (MLIPs) have become an efficient alternative to computationally expensive ab initio simulations. For this reason, chemistry and materials science would greatly benefit from a general reactive MLIP, that is, an MLIP that is applicable to a broad range of reactive chemistry without the need for refitting. Here we develop a general reactive MLIP (ANI-1xnr) through automated sampling of condensed-phase reactions. ANI-1xnr is then applied to study five distinct systems: carbon solid-phase nucleation, graphene ring formation from acetylene, biofuel additives, combustion of methane and the spontaneous formation of glycine from early earth small molecules. In all studies, ANI-1xnr closely matches experiment (when available) and/or previous studies using traditional model chemistry methods. As such, ANI-1xnr proves to be a highly general reactive MLIP for C, H, N and O elements in the condensed phase, enabling high-throughput in silico reactive chemistry experimentation.</description><pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate><category>automated sampling</category><category>condensed-phase chemistry</category><category>high-throughput experimentation</category><category>ani-1xnr</category><category>machine learning force fields</category><author>Zhang, Shuhao et al.</author></item><item><title>Generative Models as an Emerging Paradigm in the Chemical Sciences</title><link>https://doi.org/10.1021/jacs.2c13467</link><guid isPermaLink="true">https://doi.org/10.1021/jacs.2c13467</guid><description>Traditional computational approaches to design chemical species are limited by the need to compute properties for a vast number of candidates, e.g., by discriminative modeling. Therefore, inverse design methods aim to start from the desired property and optimize a corresponding chemical structure. From a machine learning viewpoint, the inverse design problem can be addressed through so-called generative modeling. Mathematically, discriminative models are defined by learning the probability distribution function of properties given the molecular or material structure. In contrast, a generative model seeks to exploit the joint probability of a chemical species with target characteristics. The overarching idea of generative modeling is to implement a system that produces novel compounds that are expected to have a desired set of chemical features, effectively sidestepping issues found in the forward design process. In this contribution, we overview and critically analyze popular generative algorithms like generative adversarial networks, variational autoencoders, flow, and diffusion models. We highlight key differences between each of the models, provide insights into recent success stories, and discuss outstanding challenges for realizing generative modeling discovered solutions in chemical applications.</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>generative-modeling</category><category>inverse-design-methods</category><category>molecular-generation</category><category>virtual-screening</category><category>admet-properties</category><author>Anstine, Dylan M. et al.</author></item><item><title>Machine Learning Interatomic Potentials and Long-Range Physics</title><link>https://doi.org/10.1021/acs.jpca.2c06778</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jpca.2c06778</guid><description>Advances in machine learned interatomic potentials (MLIPs), such as those using neural networks, have resulted in short-range models that can infer interaction energies with near ab initio accuracy and orders of magnitude reduced computational cost. For many atom systems, including macromolecules, biomolecules, and condensed matter, model accuracy can become reliant on the description of short- and long-range physical interactions. The latter terms can be difficult to incorporate into an MLIP framework. Recent research has produced numerous models with considerations for nonlocal electrostatic and dispersion interactions, leading to a large range of applications that can be addressed using MLIPs. In light of this, we present a Perspective focused on key methodologies and models being used where the presence of nonlocal physics and chemistry are crucial for describing system properties. The strategies covered include MLIPs augmented with dispersion corrections, electrostatics calculated with charges predicted from atomic environment descriptors, the use of self-consistency and message passing iterations to propagated nonlocal system information, and charges obtained via equilibration schemes. We aim to provide a pointed discussion to support the development of machine learning-based interatomic potentials for systems where contributions from only nearsighted terms are deficient.</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>nonlocal physics</category><category>self-consistency</category><category>message passing</category><category>equilibration schemes</category><category>long-range interactions</category><author>Anstine, Dylan M. et al.</author></item><item><title>Themed collection on Insightful Machine Learning for Physical Chemistry</title><link>https://doi.org/10.1039/d3cp90129g</link><guid isPermaLink="true">https://doi.org/10.1039/d3cp90129g</guid><description>This themed collection includes a collection of articles on Insightful Machine Learning for Physical Chemistry.</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>neural networks</category><category>force fields</category><category>feature engineering</category><category>hyperparameter optimization</category><category>benchmarking</category><author>Clark, Aurora E. et al.</author></item><item><title>Synergy of semiempirical models and machine learning in computational chemistry</title><link>https://doi.org/10.1063/5.0151833</link><guid isPermaLink="true">https://doi.org/10.1063/5.0151833</guid><description>Fedik, Nikita and Nebgen, Benjamin and Lubbers, Nicholas and Barros, Kipton and Kulichenko, Maksim and Li, Ying Wai and Zubatyuk, Roman and Messerly, Richard and Isayev, Olexandr and Tretiak, Sergei. The Journal of Chemical Physics (2023)</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>semi-empirical methods</category><category>machine learning</category><category>hybrid quantum chemistry</category><category>computational chemistry</category><category>method synergy</category><author>Fedik, Nikita et al.</author></item><item><title>Active Learning Guided Drug Design Lead Optimization Based on Relative Binding Free Energy Modeling</title><link>https://doi.org/10.1021/acs.jcim.2c01052</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jcim.2c01052</guid><description>In silico identification of potent protein inhibitors commonly requires prediction of a ligand binding free energy (BFE). Thermodynamics integration (TI) based on molecular dynamics (MD) simulations is a BFE calculation method capable of acquiring accurate BFE, but it is computationally expensive and time-consuming. In this work, we have developed an efficient automated workflow for identifying compounds with the lowest BFE among thousands of congeneric ligands, which requires only hundreds of TI calculations. Automated machine learning (AutoML) orchestrated by active learning (AL) in an AL-AutoML workflow allows unbiased and efficient search for a small set of best-performing molecules. We have applied this workflow to select inhibitors of the SARS-CoV-2 papain-like protease and were able to find 133 compounds with improved binding affinity, including 16 compounds with better than 100-fold binding affinity improvement. We obtained a hit rate that outperforms that expected of traditional expert medicinal chemist-guided campaigns. Thus, we demonstrate that the combination of AL and AutoML with free energy simulations provides at least 20× speedup relative to the naïve brute force approaches.</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>machine learning workflows</category><category>efficient optimization</category><category>relative binding affinity</category><category>lead optimization</category><category>algorithmic drug design</category><author>Gusev, Filipp et al.</author></item><item><title>Scalable hybrid deep neural networks/polarizable potentials biomolecular simulations including long-range effects</title><link>https://doi.org/10.1039/d2sc04815a</link><guid isPermaLink="true">https://doi.org/10.1039/d2sc04815a</guid><description>Jaffrelot Inizan, Théo and Plé, Thomas and Adjoua, Olivier and Ren, Pengyu and Gökcan, Hatice and Isayev, Olexandr and Lagardère, Louis and Piquemal, Jean-Philip. Chemical Science (2023)</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>deep neural networks</category><category>polarizable potentials</category><category>biomolecular simulations</category><category>long-range interactions</category><category>hybrid potentials</category><category>scalability</category><author>Jaffrelot Inizan, Théo et al.</author></item><item><title>The challenge of balancing model sensitivity and robustness in predicting yields: a benchmarking study of amide coupling reactions</title><link>https://doi.org/10.1039/d3sc03902a</link><guid isPermaLink="true">https://doi.org/10.1039/d3sc03902a</guid><description>Liu, Zhen and Moroz, Yurii S. and Isayev, Olexandr. Chemical Science (2023)</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>amide coupling</category><category>yield prediction</category><category>model robustness</category><category>reaction prediction</category><category>benchmarking</category><category>synthesis planning</category><author>Liu, Zhen et al.</author></item><item><title>Structure Prediction of Epitaxial Organic Interfaces with Ogre, Demonstrated for Tetracyanoquinodimethane (TCNQ) on Tetrathiafulvalene (TTF)</title><link>https://doi.org/10.1021/acs.jpcc.3c02384</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jpcc.3c02384</guid><description>[This retracts the article DOI: 10.1021/acsanm.4c06844.].</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>epitaxial interfaces</category><category>tcnq-ttf interface</category><category>crystal structure prediction</category><category>ogre software</category><category>machine learning potentials</category><author>Moayedpour, Saeed et al.</author></item><item><title>Δ &lt;sup&gt;2&lt;/sup&gt; machine learning for reaction property prediction</title><link>https://doi.org/10.1039/d3sc02408c</link><guid isPermaLink="true">https://doi.org/10.1039/d3sc02408c</guid><description>Zhao, Qiyuan and Anstine, Dylan M. and Isayev, Olexandr and Savoie, Brett M.. Chemical Science (2023)</description><pubDate>Sun, 01 Jan 2023 00:00:00 GMT</pubDate><category>delta machine learning</category><category>reaction property prediction</category><category>transfer learning</category><category>transition states</category><category>barrier heights</category><author>Zhao, Qiyuan et al.</author></item><item><title>Extending machine learning beyond interatomic potentials for predicting molecular properties</title><link>https://doi.org/10.1038/s41570-022-00416-3</link><guid isPermaLink="true">https://doi.org/10.1038/s41570-022-00416-3</guid><description>Fedik, Nikita and Zubatyuk, Roman and Kulichenko, Maksim and Lubbers, Nicholas and Smith, Justin S. and Nebgen, Benjamin and Messerly, Richard and Li, Ying Wai and Boldyrev, Alexander I. and Barros, Kipton and Isayev, Olexandr and Tretiak, Sergei. Nature Reviews Chemistry (2022)</description><pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate><category>machine learning potentials</category><category>molecular properties</category><category>review</category><category>atomistic methods</category><category>neural networks</category><category>property prediction</category><author>Fedik, Nikita et al.</author></item><item><title>Simulations of Pathogenic E1α Variants: Allostery and Impact on Pyruvate Dehydrogenase Complex-E1 Structure and Function</title><link>https://doi.org/10.1021/acs.jcim.2c00630</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jcim.2c00630</guid><description>Pyruvate dehydrogenase complex (PDC) deficiency is a major cause of primary lactic acidemia resulting in high morbidity and mortality, with limited therapeutic options. The E1 component of the mitochondrial multienzyme PDC (PDC-E1) is a symmetric dimer of heterodimers (&amp;#x3b1;&amp;#x3b2;/&amp;#x3b1;&apos;&amp;#x3b2;&apos;) encoded by the &lt;i&gt;PDHA1&lt;/i&gt; and &lt;i&gt;PDHB&lt;/i&gt; genes, with two symmetric active sites each consisting of highly conserved phosphorylation loops A and B. &lt;i&gt;PDHA1&lt;/i&gt; mutations are responsible for 82-88% of cases. Greater than 85% of E1&amp;#x3b1; residues with disease-causing missense mutations (DMMs) are solvent-inaccessible, with &amp;#x223c;30% among those involved in subunit-subunit interface contact (SSIC). We performed molecular dynamics simulations of wild-type (WT) PDC-E1 and E1 variants with E1&amp;#x3b1; DMMs at R349 and W185 (residues involved in SSIC), to investigate their impact on human PDC-E1 structure. We evaluated the change in E1 structure and dynamics and examined their implications on E1 function with the specific DMMs. We found that the dynamics of phosphorylation Loop A, which is crucial for E1 biological activity, changes with DMMs that are at least about 15 &amp;#xc5; away. Because communication is essential for PDC-E1 activity (with alternating active sites), we also investigated the possible communication network within WT PDC-E1 via centrality analysis. We observed that DMMs altered/disrupted the communication network of PDC-E1. Collectively, these results indicate allosteric effect in PDC-E1, with implications for the development of novel small-molecule therapeutics for specific recurrent E1&amp;#x3b1; DMMs such as replacements of R349 responsible for &amp;#x223c;10% of PDC deficiency due to E1&amp;#x3b1; DMMs.</description><pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate><category>molecular dynamics simulations</category><category>e1α variants</category><category>pdc deficiency</category><category>allosteric effects</category><category>communication networks</category><category>small-molecule therapeutics</category><author>Gokcan, Hatice et al.</author></item><item><title>Prediction of protein p &lt;i&gt;K&lt;/i&gt; &lt;sub&gt;a&lt;/sub&gt; with representation learning</title><link>https://doi.org/10.1039/d1sc05610g</link><guid isPermaLink="true">https://doi.org/10.1039/d1sc05610g</guid><description>We developed new empirical ML model for protein pKaprediction with MAEs below 0.5 for all amino acid types.</description><pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate><category>representation learning</category><category>protein pka prediction</category><category>empirical models</category><category>machine learning in biochemistry</category><category>amino acid analysis</category><author>Gokcan, Hatice et al.</author></item><item><title>Generative and reinforcement learning approaches for the automated de novo design of bioactive compounds</title><link>https://doi.org/10.1038/s42004-022-00733-0</link><guid isPermaLink="true">https://doi.org/10.1038/s42004-022-00733-0</guid><description>Korshunova, Maria and Huang, Niles and Capuzzi, Stephen and Radchenko, Dmytro S. and Savych, Olena and Moroz, Yuriy S. and Wells, Carrow I. and Willson, Timothy M. and Tropsha, Alexander and Isayev, Olexandr. Communications Chemistry (2022)</description><pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate><category>generative models</category><category>reinforcement learning</category><category>de novo drug design</category><category>bioactive compounds</category><category>virtual screening</category><category>automated design</category><author>Korshunova, Maria et al.</author></item><item><title>Auto3D: Automatic Generation of the Low-Energy 3D Structures with ANI Neural Network Potentials</title><link>https://doi.org/10.1021/acs.jcim.2c00817</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jcim.2c00817</guid><description>Computational programs accelerate the chemical discovery processes but often need proper three-dimensional molecular information as part of the input. Getting optimal molecular structures is challenging because it requires enumerating and optimizing a huge space of stereoisomers and conformers. We developed the Python-based Auto3D package for generating the low-energy 3D structures using SMILES as the input. Auto3D is based on state-of-the-art algorithms and can automatize the isomer enumeration and duplicate filtering process, 3D building process, geometry optimization, and ranking process. Tested on 50 molecules with multiple unspecified stereocenters, Auto3D is guaranteed to find the stereoconfiguration that yields the lowest-energy conformer. With Auto3D, we provide an extension of the ANI model. The new model, dubbed ANI-2xt, is trained on a tautomer-rich data set. ANI-2xt is benchmarked with DFT methods on geometry optimization and electronic and Gibbs free energy calculations. Compared with ANI-2x, ANI-2xt provides a 42% error reduction for tautomeric reaction energy calculations when using the gold-standard coupled-cluster calculation as the reference. ANI-2xt can accurately predict the energies and is several orders of magnitude faster than DFT methods.</description><pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate><category>geometry optimization</category><category>tautomer-rich dataset</category><category>stereoconfiguration prediction</category><category>machine learning acceleration</category><category>energy landscape exploration</category><author>Liu, Zhen et al.</author></item><item><title>The transformational role of GPU computing and deep learning in drug discovery</title><link>https://doi.org/10.1038/s42256-022-00463-x</link><guid isPermaLink="true">https://doi.org/10.1038/s42256-022-00463-x</guid><description>There has been a recent focus within Alzheimer&apos;s Disease (AD) research on potential CSF biomarkers as diagnostic tools. PET imaging and CSF biomarkers have been used to determine the amyloid status of individuals when characterising AD. While these measures have largely correlated with each other, it is also suggested that there could be patients who can have discordant results. However, it is still unclear what proportion of AD subjects demonstrate the discordance. Additionally, whether these changes are associated with different pathological and clinical characteristics of the patients. Here, we investigated the discordance between CSF A&amp;#x3b2;42:40 and amyloid status and evaluated whether there are any significant changes in their levels of tau aggregation. 313 participants&apos; data were selected from the ADNI database. The cutoff for A&amp;#x3b2;42:40 was then calculated using the Youden Index resulting from receiver operating characteristic (ROC) curve. The formula used was J&amp;#xa0;=&amp;#xa0;maxc Se (c) + Sp (c) - 1, resulting in a maximum cutoff of J&amp;#xa0;=&amp;#xa0;0.057 for A&amp;#x3b2;42:40. This cutoff produced a discordance rate of 10.5%, with n&amp;#xa0;=&amp;#xa0;33 of 313 patients being discordant. We then performed t-tests to investigate the potential clinical differences between concordant and discordant patients. Independent samples t-tests indicated that levels of pTau181 and total Tau significantly differed between the concordant and discordant groups. pTau181 was significantly lower in the discordant group (t(71.2)=-3.166, p&amp;#xa0;=&amp;#xa0;.002), as was Tau (t(53.9)=-2.046, p&amp;#xa0;=&amp;#xa0;.046). Further data is found in Figure 1. Demonstration of different levels of tau deposition in the discordant groups implies there may be other pathological processes influencing neurodegeneration in these subjects. A discordance of 10% between CSF amyloid and PET amyloid implies that we should evaluate these subjects in greater detail before enrolling them in intervention studies. References 1. Hansson, O. et al. (2019). https://doi.org/10.1186/s13195-019-0485-0 2. Pyun, JM. et al. (2024) https://doi.org/10.1038/s41398-024-02766-6.</description><pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate><category>gpu computing</category><category>deep learning</category><category>alzheimer&apos;s disease</category><category>csf biomarkers</category><category>diagnostic tools</category><author>Pandey, Mohit et al.</author></item><item><title>Toward Chemical Accuracy in Predicting Enthalpies of Formation with General-Purpose Data-Driven Methods</title><link>https://doi.org/10.1021/acs.jpclett.2c00734</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jpclett.2c00734</guid><description>Zheng, Peikun and Yang, Wudi and Wu, Wei and Isayev, Olexandr and Dral, Pavlo O.. The Journal of Physical Chemistry Letters (2022)</description><pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate><category>enthalpy of formation</category><category>chemical accuracy</category><category>thermochemistry</category><category>data-driven methods</category><category>general-purpose models</category><author>Zheng, Peikun et al.</author></item><item><title>Best practices in machine learning for chemistry</title><link>https://doi.org/10.1038/s41557-021-00716-z</link><guid isPermaLink="true">https://doi.org/10.1038/s41557-021-00716-z</guid><description>Artrith, Nongnuch and Butler, Keith T. and Coudert, François-Xavier and Han, Seungwu and Isayev, Olexandr and Jain, Anubhav and Walsh, Aron. Nature Chemistry (2021)</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>best practices</category><category>machine learning workflows</category><category>neural network applications</category><category>methodology guidelines</category><category>interdisciplinary approaches</category><author>Artrith, Nongnuch et al.</author></item><item><title>Crowdsourced mapping of unexplored target space of kinase inhibitors</title><link>https://doi.org/10.1038/s41467-021-23165-1</link><guid isPermaLink="true">https://doi.org/10.1038/s41467-021-23165-1</guid><description>Cichońska, Anna and Ravikumar, Balaguru and Allaway, Robert J. and Wan, Fangping and Park, Sungjoon and Isayev, Olexandr and Li, Shuya and Mason, Michael and Lamb, Andrew and Tanoli, Ziaurrehman and Jeon, Minji and Kim, Sunkyu and Popova, Mariya and Capuzzi, Stephen and Zeng, Jianyang and Dang, Kristen and Koytiger, Gregory and Kang, Jaewoo and Wells, Carrow I. and Willson, Timothy M. and Tan, Mehmet and Huang, Chih-Han and Shih, Edward S. C. and Chen, Tsai-Min and Wu, Chih-Hsun and Fang, Wei-Quan and Chen, Jhih-Yu and Hwang, Ming-Jing and Wang, Xiaokang and Ben Guebila, Marouen and Shamsaei, Behrouz and Singh, Sourav and Nguyen, Thin and Karimi, Mostafa and Wu, Di and Wang, Zhangyang and Shen, Yang and Öztürk, Hakime and Ozkirimli, Elif and Özgür, Arzucan and Lim, Hansaim and Xie, Lei and Kanev, Georgi K. and Kooistra, Albert J. and Westerman, Bart A. and Terzopoulos, Panagiotis and Ntagiantas, Konstantinos and Fotis, Christos and Alexopoulos, Leonidas and Boeckaerts, Dimitri and Stock, Michiel and De Baets, Bernard and Briers, Yves and Luo, Yunan and Hu, Hailin and Peng, Jian and Dogan, Tunca and Rifaioglu, Ahmet S. and Atas, Heval and Atalay, Rengul Cetin and Atalay, Volkan and Martin, Maria J. and Jeon, Minji and Lee, Junhyun and Yun, Seongjun and Kim, Bumsoo and Chang, Buru and Turu, Gábor and Misák, Ádám and Szalai, Bence and Hunyady, László and Lienhard, Matthias and Prasse, Paul and Bachmann, Ivo and Ganzlin, Julia and Barel, Gal and Herwig, Ralf and Oršolić, Davor and Lučić, Bono and Stepanić, Višnja and Šmuc, Tomislav and Oprea, Tudor I. and Schlessinger, Avner and Drewry, David H. and Stolovitzky, Gustavo and Wennerberg, Krister and Guinney, Justin and Aittokallio, Tero. Nature Communications (2021)</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>kinase inhibitors</category><category>virtual screening</category><category>target space</category><category>crowdsourcing</category><category>chemical biology</category><category>selectivity</category><author>Cichońska, Anna et al.</author></item><item><title>Harnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World</title><link>https://doi.org/10.1109/jiot.2021.3073904</link><guid isPermaLink="true">https://doi.org/10.1109/jiot.2021.3073904</guid><description>As COVID-19 hounds the world, the common cause of finding a swift solution to manage the pandemic has brought together researchers, institutions, governments, and society at large. The Internet of Things (IoT), artificial intelligence (AI)—including machine learning (ML) and Big Data analytics—as well as Robotics and Blockchain, are the four decisive areas of technological innovation that have been ingenuity harnessed to fight this pandemic and future ones. While these highly interrelated smart and connected health technologies cannot resolve the pandemic overnight and may not be the only answer to the crisis, they can provide greater insight into the disease and support frontline efforts to prevent and control the pandemic. This article provides a blend of discussions on the contribution of these digital technologies, propose several complementary and multidisciplinary techniques to combat COVID-19, offer opportunities for more holistic studies, and accelerate knowledge acquisition and scientific discoveries in pandemic research. First, four areas, where IoT can contribute are discussed, namely: 1) tracking and tracing; 2) remote patient monitoring (RPM) by wearable IoT (WIoT); 3) personal digital twins (PDTs); and 4) real-life use case: ICT/IoT solution in South Korea. Second, the role and novel applications of AI are explained, namely: 1) diagnosis and prognosis; 2) risk prediction; 3) vaccine and drug development; 4) research data set; 5) early warnings and alerts; 6) social control and fake news detection; and 7) communication and chatbot. Third, the main uses of robotics and drone technology are analyzed, including: 1) crowd surveillance; 2) public announcements; 3) screening and diagnosis; and 4) essential supply delivery. Finally, we discuss how distributed ledger technologies (DLTs), of which blockchain is a common example, can be combined with other technologies for tackling COVID-19.</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>covid-19</category><category>iot</category><category>ai</category><category>drug development</category><category>vaccine discovery</category><category>blockchain</category><author>Firouzi, Farshad et al.</author></item><item><title>Active Learning in Bayesian Neural Networks for Bandgap Predictions of Novel Van der Waals Heterostructures</title><link>https://doi.org/10.1002/aisy.202100080</link><guid isPermaLink="true">https://doi.org/10.1002/aisy.202100080</guid><description>Fronzi, Marco and Isayev, Olexandr and Winkler, David A. and Shapter, Joseph G. and Ellis, Amanda V. and Sherrell, Peter C. and Shepelin, Nick A. and Corletto, Alexander and Ford, Michael J.. Advanced Intelligent Systems (2021)</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>Bayesian neural networks</category><category>active learning</category><category>bandgap prediction</category><category>van der Waals heterostructures</category><category>2D materials</category><author>Fronzi, Marco et al.</author></item><item><title>Learning molecular potentials with neural networks</title><link>https://doi.org/10.1002/wcms.1564</link><guid isPermaLink="true">https://doi.org/10.1002/wcms.1564</guid><description>AbstractThe potential energy of molecular species and their conformers can be computed with a wide range of computational chemistry methods, from molecular mechanics to ab initio quantum chemistry. However, the proper choice of the computational approach based on computational cost and reliability of calculated energies is a dilemma, especially for large molecules. This dilemma is proved to be even more problematic for studies that require hundreds and thousands of calculations, such as drug discovery. On the other hand, driven by their pattern recognition capabilities, neural networks started to gain popularity in the computational chemistry community. During the last decade, many neural network potentials have been developed to predict a variety of chemical information of different systems. Neural network potentials are proved to predict chemical properties with accuracy comparable to quantum mechanical approaches but with the cost approaching molecular mechanics calculations. As a result, the development of more reliable, transferable, and extensible neural network potentials became an attractive field of study for researchers. In this review, we outlined an overview of the status of current neural network potentials and strategies to improve their accuracy. We provide recent examples of studies that prove the applicability of these potentials. We also discuss the capabilities and shortcomings of the current models and the challenges and future aspects of their development and applications. It is expected that this review would provide guidance for the development of neural network potentials and the exploitation of their applicability.This article is categorized under:Data Science &amp;gt; Artificial Intelligence/Machine LearningMolecular and Statistical Mechanics &amp;gt; Molecular InteractionsSoftware &amp;gt; Molecular Modeling</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>neural network potentials</category><category>machine learning</category><category>molecular mechanics</category><category>drug discovery</category><category>computational chemistry</category><author>Gokcan, Hatice et al.</author></item><item><title>OpenChem: A Deep Learning Toolkit for Computational Chemistry and Drug Design</title><link>https://doi.org/10.1021/acs.jcim.0c00971</link><guid isPermaLink="true">https://doi.org/10.1021/acs.jcim.0c00971</guid><description>Deep learning models have demonstrated outstanding results in many data-rich areas of research, such as computer vision and natural language processing. Currently, there is a rise of deep learning in computational chemistry and materials informatics, where deep learning could be effectively applied in modeling the relationship between chemical structures and their properties. With the immense growth of chemical and materials data, deep learning models can begin to outperform conventional machine learning techniques such as random forest, support vector machines, and nearest neighbor. Herein, we introduce OpenChem, a PyTorch-based deep learning toolkit for computational chemistry and drug design. OpenChem offers easy and fast model development, modular software design, and several data preprocessing modules. It is freely available via the GitHub repository.</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>neural network models</category><category>modular design</category><category>data augmentation</category><category>hyperparameter tuning</category><category>open-source toolkit</category><author>Korshunova, Maria et al.</author></item><item><title>A critical overview of computational approaches employed for COVID-19 drug discovery</title><link>https://doi.org/10.1039/d0cs01065k</link><guid isPermaLink="true">https://doi.org/10.1039/d0cs01065k</guid><description>We cover diverse methodologies, computational approaches, and case studies illustrating the ongoing efforts to develop viable drug candidates for treatment of COVID-19.</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>computational drug design</category><category>covid-19 therapeutics</category><category>virtual screening strategies</category><category>methodology benchmarking</category><category>drug development pipeline</category><author>Muratov, Eugene N. et al.</author></item><item><title>Machine-Learning-Guided Discovery of &lt;sup&gt;19&lt;/sup&gt; F MRI Agents Enabled by Automated Copolymer Synthesis</title><link>https://doi.org/10.1021/jacs.1c08181</link><guid isPermaLink="true">https://doi.org/10.1021/jacs.1c08181</guid><description>Modern polymer science suffers from the curse of multidimensionality. The large chemical space imposed by including combinations of monomers into a statistical copolymer overwhelms polymer synthesis and characterization technology and limits the ability to systematically study structure-property relationships. To tackle this challenge in the context of &lt;sup&gt;19&lt;/sup&gt;F magnetic resonance imaging (MRI) agents, we pursued a computer-guided materials discovery approach that combines synergistic innovations in automated flow synthesis and machine learning (ML) method development. A software-controlled, continuous polymer synthesis platform was developed to enable iterative experimental-computational cycles that resulted in the synthesis of 397 unique copolymer compositions within a six-variable compositional space. The nonintuitive design criteria identified by ML, which were accomplished by exploring &lt;0.9% of the overall compositional space, lead to the identification of &gt;10 copolymer compositions that outperformed state-of-the-art materials.</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>iterative experimental-computational cycles</category><category>high-throughput screening</category><category>nonintuitive design criteria</category><category>state-of-the-art materials comparison</category><category>polymer informatics</category><author>Reis, Marcus et al.</author></item><item><title>Artificial intelligence-enhanced quantum chemical method with broad applicability</title><link>https://doi.org/10.1038/s41467-021-27340-2</link><guid isPermaLink="true">https://doi.org/10.1038/s41467-021-27340-2</guid><description>Zheng, Peikun and Zubatyuk, Roman and Wu, Wei and Isayev, Olexandr and Dral, Pavlo O.. Nature Communications (2021)</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>artificial intelligence</category><category>quantum chemistry</category><category>semi-empirical methods</category><category>broad applicability</category><category>hybrid methods</category><category>AIQM</category><author>Zheng, Peikun et al.</author></item><item><title>Machine learned Hückel theory: Interfacing physics and deep neural networks</title><link>https://doi.org/10.1063/5.0052857</link><guid isPermaLink="true">https://doi.org/10.1063/5.0052857</guid><description>Zubatiuk, Tetiana and Nebgen, Benjamin and Lubbers, Nicholas and Smith, Justin S. and Zubatyuk, Roman and Zhou, Guoqing and Koh, Christopher and Barros, Kipton and Isayev, Olexandr and Tretiak, Sergei. The Journal of Chemical Physics (2021)</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>Huckel theory</category><category>deep learning</category><category>semi-empirical</category><category>physics-informed</category><category>electronic structure</category><author>Zubatiuk, Tetiana et al.</author></item><item><title>Development of Multimodal Machine Learning Potentials: Toward a Physics-Aware Artificial Intelligence</title><link>https://doi.org/10.1021/acs.accounts.0c00868</link><guid isPermaLink="true">https://doi.org/10.1021/acs.accounts.0c00868</guid><description>ConspectusMachine learning interatomic potentials (MLIPs) are widely used for describing molecular energy and continue bridging the speed and accuracy gap between quantum mechanical (QM) and classical approaches like force fields. In this Account, we focus on the out-of-the-box approaches to developing transferable MLIPs for diverse chemical tasks. First, we introduce the &quot;Accurate Neural Network engine for Molecular Energies,&quot; ANAKIN-ME, method (or ANI for short). The ANI model utilizes Justin Smith Symmetry Functions (JSSFs) and realizes training for vast data sets. The training data set of several orders of magnitude larger than before has become the key factor of the knowledge transferability and flexibility of MLIPs. As the quantity, quality, and types of interactions included in the training data set will dictate the accuracy of MLIPs, the task of proper data selection and model training could be assisted with advanced methods like active learning (AL), transfer learning (TL), and multitask learning (MTL).Next, we describe the AIMNet &quot;Atoms-in-Molecules Network&quot; that was inspired by the quantum theory of atoms in molecules. The AIMNet architecture lifts multiple limitations in MLIPs. It encodes long-range interactions and learnable representations of chemical elements. We also discuss the AIMNet-ME model that expands the applicability domain of AIMNet from neutral molecules toward open-shell systems. The AIMNet-ME encompasses a dependence of the potential on molecular charge and spin. It brings ML and physical models one step closer, ensuring the correct molecular energy behavior over the total molecular charge.We finally describe perhaps the simplest possible physics-aware model, which combines ML and the extended Hückel method. In ML-EHM, &quot;Hierarchically Interacting Particle Neural Network,&quot; HIP-NN generates the set of a molecule- and environment-dependent Hamiltonian elements αμμ and K‡. As a test example, we show how in contrast to traditional Hückel theory, ML-EHM correctly describes orbital crossing with bond rotations. Hence it learns the underlying physics, highlighting that the inclusion of proper physical constraints and symmetries could significantly improve ML model generalization.</description><pubDate>Fri, 01 Jan 2021 00:00:00 GMT</pubDate><category>machine learning potentials</category><category>ani</category><category>aimnet</category><category>transfer learning</category><category>active learning</category><category>physics-aware ai</category><category>extended hückel method</category><category>molecular energy</category><author>Zubatiuk, Tetiana et al.</author></item></channel></rss>