TY - JOUR
T1 - Aqueous pKa Prediction for Tautomerizable Compounds Using Equilibrium Bond Lengths
AU - Caine, Bethan
AU - Bronzato, Maddalena
AU - Fraser, Torquil
AU - Kidley, Nathan J.
AU - Dardonville, Christophe
AU - Popelier, Paul
PY - 2020/2/12
Y1 - 2020/2/12
N2 - The accurate prediction of aqueous pKa values for tautomerizable compounds is a formidable task, even for the most established in silico tools. Empirical approaches often fall short due to a lack of pre-existing knowledge of dominant tautomeric forms. In a rigorous first-principles approach, calculations for low-energy tautomers must be performed, in protonated and deprotonated forms, often both in gas and solvent phases, thus representing a significant computational task. Here we report an alternative approach, predicting pKa values for herbicide/therapeutic derivatives of 1,3-cyclohexanedione and 1,3-cyclopentanedione to within just 0.24 units. A model, using a single ab initio bond length from one protonation state, is as accurate as other, more complex regression approaches using more input features, and outperforms the program Marvin. Our approach can be used for other tautomerizable species, to predict trends across congeneric series and to correct experimental pKa values.
AB - The accurate prediction of aqueous pKa values for tautomerizable compounds is a formidable task, even for the most established in silico tools. Empirical approaches often fall short due to a lack of pre-existing knowledge of dominant tautomeric forms. In a rigorous first-principles approach, calculations for low-energy tautomers must be performed, in protonated and deprotonated forms, often both in gas and solvent phases, thus representing a significant computational task. Here we report an alternative approach, predicting pKa values for herbicide/therapeutic derivatives of 1,3-cyclohexanedione and 1,3-cyclopentanedione to within just 0.24 units. A model, using a single ab initio bond length from one protonation state, is as accurate as other, more complex regression approaches using more input features, and outperforms the program Marvin. Our approach can be used for other tautomerizable species, to predict trends across congeneric series and to correct experimental pKa values.
U2 - 10.1038/s42004-020-0264-7
DO - 10.1038/s42004-020-0264-7
M3 - Article
SN - 2399-3669
JO - Communications Chemistry
JF - Communications Chemistry
ER -