Vis enkel innførsel

dc.contributor.authorKneiding, Hannes
dc.contributor.authorNova Flores, Ainara
dc.contributor.authorBalcells Badia, David
dc.date.accessioned2024-12-09T13:19:29Z
dc.date.available2024-12-09T13:19:29Z
dc.date.issued2023-09-25
dc.description.abstractTransition metal complexes (TMCs) play a key role in several areas of high interest, including medicinal chemistry, renewable energies, and nanoporous materials. The development of TMCs enabling these technologies remains challenged by the need to optimize multiple properties within very large chemical spaces, in which the thirty transition metals can be combined with a virtually infinite number of ligands. In this work, we provide the open tmQMg-L dataset including 30K TMC ligands, which combines large chemical diversity with synthesizability. The charge and metal-coordination mode of the ligands were robustly defined with a novel algorithm based on graph and natural bond orbital theories. The tmQMg-L dataset was leveraged in the automated generation of 1.37M TMCs resulting from all possible combinations between a square planar palladium(II) scaffold and a pool of 50 different ligands. This TMC space was used to benchmark a multiobjective genetic algorithm (MOGA) that optimized two properties over a Pareto front; namely the polarizability (alpha) and the HOMO-LUMO gap (epsilon). The MOGA evolved 130 TMC hits with maximal (alpha, epsilon) values in a way that could be easily rationalized by analyzing the nature of the ligands selected. Instead of the traditional mutation and crossover of fragments within a single ligand, this MOGA implemented full-ligand genetic operations acting on all coordination sites, maximizing chemical diversity. Further, we extended this MOGA algorithm with the Pareto-Lighthouse functionality (PL-MOGA), which allows for controlling both the aim and scope of the multiobjective optimization over the Pareto front. In explicit spaces containing billions of TMCs, the PL-MOGA enabled the explainable generation of thousands of novel and highly diverse TMC hits. We believe that the combined use of the tmQMg-L dataset and PL-MOGA algorithm will facilitate the discovery of TMCs with optimal properties within untapped chemical spaces.en_US
dc.identifier.citationKneiding, Nova Flores, Balcells Badia. Directional Multiobjective Optimization of Metal Complexes at the Billion-Scale with the tmQMg-L Dataset and PL-MOGA Algorithm. ChemRxiv. 2023en_US
dc.identifier.cristinIDFRIDAID 2206905
dc.identifier.doi10.26434/chemrxiv-2023-k3tf2-v2
dc.identifier.issn2573-2293
dc.identifier.urihttps://hdl.handle.net/10037/35929
dc.language.isoengen_US
dc.publisherCambrigde University Pressen_US
dc.relation.journalChemRxiv
dc.relation.projectIDSigma2: n4654ken_US
dc.relation.projectIDEU – Horisont Europa (EC/HEU): 945371en_US
dc.relation.projectIDNorges forskningsråd: 262695en_US
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/945371/Norway/TraCS - Training in Computational Science/CompSci/en_US
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2023 The Author(s)en_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0en_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)en_US
dc.titleDirectional Multiobjective Optimization of Metal Complexes at the Billion-Scale with the tmQMg-L Dataset and PL-MOGA Algorithmen_US
dc.type.versionsubmittedVersionen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
Med mindre det står noe annet, er denne innførselens lisens beskrevet som Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)