FragGT: Fragment-based evolutionary molecule generation with gene types
Computational Chemistry Highlights
by Jan Jensen
2w ago
Joshua Meyers and Nathan Brown (2024)Highlighted by Jan Jensen Figure 1 from the paper. (c) The authors. Reproduced under the CC-BY license Genetic algorithms (GAs) allow for changes at the atom level (as opposed to molecular fragments) allow for a very fine-grained search of chemical space. However, some of the resulting molecules are not chemically sensible and one usually has to include a synthetic accessibility constraint in the scoring function.  However, another approach is to use fragments and include synthetic accessibility in the fragmentation scheme, which is what this study d ..read more
Visit website
Invalid SMILES are beneficial rather than detrimental to chemical language models
Computational Chemistry Highlights
by Jan Jensen
1M ago
Michael A. Skinnider (2024) Highlighted by Jan Jensen Figure 3c from the paper. (c) The author. Reproduced under the CC-BY License Language models (LMs) don't always produce valid SMILES and while for modern methods the percentage of invalid SMILES tends to be relatively small, much effort has been expended on making it as small as possible. SELFIES was invented as a way to make this percentage 0, since SELFIES is design to always produce valid SMILES. However, several studies have shown that SMILES-based LMs tends to produce molecular distributions that is closer to the training set, compar ..read more
Visit website
An evolutionary algorithm for interpretable molecular representations
Computational Chemistry Highlights
by Jan Jensen
2M ago
Philipp M. Pflüger, Marius Kühnemund, Felix Katzenburg, Herbert Kuchen, and Frank Glorius (2024)Highlighted by Jan Jensen Parts of Figures 2 and 6 combined. (c) 2024 Elsevier, Inc This paper presents a very novel approach to XAI that allows for direct comparison with chemical intuition. Molecular fingerprints (either binary or count) are defined using randomly generated SMARTS patterns and then uses a genetic algorithm to find the optimum fingerprint of a certain length. Here the optimum is defined as the one giving the lowest error when used with CatBoost. The GA search requires many thous ..read more
Visit website
AiZynth Impact on Medicinal Chemistry Practice at AstraZeneca
Computational Chemistry Highlights
by Jan Jensen
3M ago
Jason D. Shields, Rachel Howells, Gillian Lamont, Yin Leilei, Andrew Madin, Christopher E. Reimann, Hadi Rezaei, Tristan Reuillon, Bryony Smith, Clare Thomson, Yuting Zhengc and Robert E. Ziegler (2024) Highlighted by Jan Jensen Figure 3 from this paper (c) the authors 2020. Reproduced under the CC-BY license This is one of the rare papers where experimental chemists talk candidly about their experiences using ML models developed by others. In this case it is AiZynthFinder, which is developed at AstraZeneca Gothenburg and predicts retrosynthetic paths, while the users are most synt ..read more
Visit website
TS-Tools: Rapid and Automated Localization of Transition States Based on a Textual Reaction SMILES Input
Computational Chemistry Highlights
by Jan Jensen
4M ago
Thijs Stuyver (2024) Highlighted by Jan Jensen Figure 2 from the paper. (c) the author 2024 reproduced under the CC-BY-NC-ND licence This paper caught my eye for several reasons. It's an open source implementation of Maeda's AFIR method, but modified for double-ended TS searches. The setup is completely automated and interfaced to  xTB so it is fast. It's applied to really challenging problems such as solvent assisted bimolecular reactions and uncovers some important shortcomings of the xTB method.  This work is licensed under a Creative Commons Attribution 4.0 International Lic ..read more
Visit website
Accurate transition state generation with an object-aware equivariant elementary reaction diffusion model
Computational Chemistry Highlights
by Jan Jensen
5M ago
Chenru Duan, Yuanqi Du, Haojun Jia, and Heather J. Kulik (2023) Highlighted by Jan Jensen Part of Figure 1 from the paper.  As anyone who has tried it will know, finding TSs is one of the most difficult, fiddly, and frustrating tasks in computational chemistry. While there are several methods aimed at automating the process, they tend to have a mixed success rate or be computationally expensive and, often, both. This paper looks to be an important first step in the right direction. The method produces a guess at a TS structure based on the coordinates of the reactants and products. Nota ..read more
Visit website
Growing strings in a chemical reaction space for searching retrosynthesis pathways
Computational Chemistry Highlights
by Jan Jensen
7M ago
Federico Zipoli, Carlo Baldassari, Matteo Manica, Jannis Born, and Teodoro Laino (2023) Highlighted by Jan Jensen Part of Figure 10 from the paper. (c) The authors 2023. Reproduced under the CC-NC-ND Prediction of retrosynthetic reaction trees are typically done by stringing together individual retrosynthetic steps that have the highest predicted confidences. The confidence is typically related to the frequency of the reaction in the training set. This approach has two main problems that this paper addresses. One problem is that "rare" reactions are seldom selected even if they might actuall ..read more
Visit website
Few-Shot Learning for Low-Data Drug Discovery
Computational Chemistry Highlights
by Jan Jensen
8M ago
Daniel Vella and Jean-Paul Ebejer (2023) Highlighted by Jan Jensen TOC graphic from the article This paper is an update and expansion to this seminal paper by Pande and co-workers (you should definitely read both). It compares the ability to distinguish active and inactive compounds for few-shots methods to more conventional approaches for very small datasets. It concludes that the former outperform the latter for some data sets and not for others, which is surprising given that few-shot methods are designed with very small data sets in mind. Few shot methods learn a graph-based embedding tha ..read more
Visit website
Ranking Pareto optimal solutions based on projection free energy
Computational Chemistry Highlights
by Jan Jensen
9M ago
Ryo Tamura, Kei Terayama, Masato Sumita, and Koji Tsuda (2023) Highlighted by Jan Jensen Figure 1 from the paper. (c) APS 2023. Reproduced under the CC-BY license. One of the main challenges in multi-objective optimisation is how to weigh the different objectives to get the desired results. Pareto optimisation can in principle solve this problem, but of you get too many solutions you have to select a subset for testing, which basically involves (manually) weighing the importance of each objective. This paper proposes a new way to select the potentially most interesting candidates. The idea i ..read more
Visit website
Accelerated dinuclear palladium catalyst identification through unsupervised machine learning
Computational Chemistry Highlights
by Jan Jensen
10M ago
Julian A. Hueffel, Theresa Sperger, Ignacio Funes-Ardoiz, Jas S. Ward, Kari Rissanen, Franziska Schoenebeck (2021) Highlighted by Jan Jensen Figure 1 from the paper. (c) 2021 the authors. I've been meaning to highlight this paper for years but forgot. However, in the last week k-means clustering came up twice in two completely unrelated contexts, which reminded me of this beautiful paper where the authors managed to use ML to make successful predictions based only five data points!  Pd catalysts can exist in either in a dimer or monomer form depending on the ligands and there are no he ..read more
Visit website

Follow Computational Chemistry Highlights on FeedSpot

Continue with Google
Continue with Apple
OR