Edit atom indices of RDKit Mol object #memo #cheminformatics
Is Life Worth Living?
by iwatobipen
1w ago
Atom indecies are unique number of each atom. And RDKit adds the index when mol object is generated. RDKit makes mol object from SMILES, Inchi, molblock and lots of formats. To make canonical representation of molecules, the atom indices are asigned allways same roules in automatically. The indices are asigned regardless of scaffold. So if you have two molecules which have different substituents and same scaffold will not have same indices on the scaffold. To memorize the indices atom map numbers are used sometime but I would like to know is it possible to do same thing with atom indices, it m ..read more
Visit website
Enumerate molecules with CXSMILES #RDKit #cheminforamtics #memo
Is Life Worth Living?
by iwatobipen
2w ago
Recent version of RDKit supports not only SMILES but also CXSMILES (chemaxon extended smiles). As name shown that CXSMILES can add lots of informations in SMILES strings but it’s little bit difficult to understand. But I found that is useful to enumerate molecules ;) For example I would like to enumerate core and two R-groups from SMILES, there are lots of way to do that. MolZip is one of the useful way to do that however annotation is required to enumerate molecules. Today I tried to enumerate molecules from CXSMILES with molecule enumerator. More details of Enumerator is described in Greg’s ..read more
Visit website
Current status of DMTA cycle in AZ #memo #DDT #publication #AI/ML
Is Life Worth Living?
by iwatobipen
3w ago
March is end of fiscal year in most of Japanese company. I spent lots of time for paper work in these days… ;P As many readers know that, DMTA cycle is key of drug discovery/optimization process and lots of computational and high thoughput experimental apporaches are available in the process recenlty. I think AZ is the front runner of these area and I respect the activity because they do not only paper publication but also sharing the code. And by sharing the code, the code is improved by good open science community. Today I read nice open access publication, published by Gian Marco et. al. fr ..read more
Visit website
Predict protein-ligand complex dynamics with python #dynamicbind #cheminformatics #memo
Is Life Worth Living?
by iwatobipen
1M ago
Computer aided drug design is one of the powerful approach for drug discovery these days. Docking study of target protein and ligands is common proceduer to evaluate whether the compound fit target protein’s pocket or not. However there is a limitation in the method. Most of the docking apporach handle protein and ligand as rigid body, so it’s difficult to consider the dynamics. Molecular dynamics can solve the limitation but it requires lots of computational resouces. Recent progress of Protein Language Model(PLM), these model can predict protein’s 3D structure from sequences. But these predi ..read more
Visit website
Pocket awaer structure generation #DiffDec #cheminformatics
Is Life Worth Living?
by iwatobipen
1M ago
Diffusion model is the one of hot area of generative model. It’s not only computer vision but also cheminformatics. Diffusion model is interesting because it generates object from some noise. BTW, de novo compound design with target protein structure information is really attractive but difficult approach in drug design. There are some approaches to conduct it with diffusion model such as DiffSBDD. DiffSBDD generates molecule with pockect information but sometime generates strange molecules. One reason of it is DiffSBDD generates whole molecule from diffusion process without any information of ..read more
Visit website
Update rdkit/shape-it #RDKit #shape-it #cheminformatics
Is Life Worth Living?
by iwatobipen
1M ago
Today I tried to build rdkit/shape-it because I could not build shape-it with current version of rdkit. So I struggled error message to fix the issue.(I’m not so good at C++ ;P) There are two issues in the current code. 1. Version of c++ in the CMakeLists.txt is old, so I changed it from c++14 to c++17 2. Version of Catch2 is old too. Recent version of catch2 is 3.x. Old version of catch2 cause problem during the make process, so I update CMakeLists.txt and some cpp files. After the modification, build process worked well and I could call shape-it from python. Here is an install process. $ g ..read more
Visit website
Visualize feature importance with marimo #cheminformatics #RDKit #marimo
Is Life Worth Living?
by iwatobipen
2M ago
I posted new generation of notebook, marimo recently. It is cool and easy to make interactive analysis environment with python. I’m interested in the package and am thinking how to use in chemoinformatics tasks. In QSAR tasks, chemoinformaticians are often asked the reason of prediction of the model. So XAI (explainable AI) is an attractive area in the field. rkakamilan shared really useful posts about visualize feature importance of ML models in his blog site with code. Now I’m writing code for visualize ML weights with compound structure and most of code came from his post. There are lots of ..read more
Visit website
Run conformer generation in parallel #RDKit #Cheminformatcs #lig3dlens
Is Life Worth Living?
by iwatobipen
2M ago
There are useful open source packages not only SBDD but also LBDD in these days. VSFlow is the one of useful package for LBDD and I used it in my cheminformatcs tutorial before. And recently I found another interesting package for LBDD named lig3dlens which is developed by researchers at healx, AI drug discovery company. The features of lig3dlens are well described in their blog post. https://healx.ai/lig3dlens-an-open-source-ligand-based-3d-virtual-screening-toolbox/ The package can generate 3d conformer and aling molecules not noly shape base but also electrostatics base with espsim. It seem ..read more
Visit website
New ML package for cheminformatics #cheminformatics #QSAR #ML
Is Life Worth Living?
by iwatobipen
3M ago
I introduced scikit-mol in my blog post before. The package integrates scikit-learn and rdkit. It’s easy to use because user can build QSAR model from scikit-learn’s API. I like the package. And recently I found another useful package for cheminformatics named ‘molflux‘ witch is developed by researchers in Exsicentia, famous AI Drug Discovery pharma. molflux supports lots of useful methods for building cheminformatics related predictive models. Featurization with various fingerprints and molecular descriptors. Splitting data with various strategies not only mathmatical way but also cheminform ..read more
Visit website
New type of python notebook #marimo #cheminformatics #RDKit
Is Life Worth Living?
by iwatobipen
3M ago
Jupyter-lab, Jupyter-note book, streamlt and other packages are useful for data science beucase it can analyze and visualize data step by step. I like streamlit and dash for making simple web app. And some days ago I found new and cool package named marimo. From the documentaion, marimo is an open-source reactive notebook for Python — reproducible, git-friendly, executable as a script, and shareable as an app. You can read details and examples from original site. I have interest the package so I tried to use it on cheminformatics tasks. Following example is.. 1, Make smiles input for ..read more
Visit website

Follow Is Life Worth Living? on FeedSpot

Continue with Google
Continue with Apple
OR