Volume 4 Supplement 1

7th German Conference on Chemoinformatics: 25 CIC-Workshop

Open Access

In silico pK a prediction

  • Robert Körner1, 2Email author,
  • Iurii Sushko1,
  • Sergii Novotarskyi1, 2 and
  • Igor V Tetko1, 2
Journal of Cheminformatics20124(Suppl 1):P55

DOI: 10.1186/1758-2946-4-S1-P55

Published: 1 May 2012

The biopharmaceutical profile of a compound depends directly on the dissociation constants of its acidic and basic groups, commonly expressed as the negative decadic logarithm pKa of the acid dissociation constant (Ka). The acid dissociation constant (also protonation or ionization constant) Ka is an equilibrium constant defined as the ratio of the protonated and the deprotonated form of a compound. The pKa value of a compound strongly influences its pharmacokinetic and biochemical properties. Its accurate estimation is therefore of great interest in areas such as biochemistry, medicinal chemistry, pharmaceutical chemistry, and drug development. Aside from the pharmaceutical industry, it also has relevance in environmental ecotoxicology, as well as the agrochemicals and specialty chemicals industries.

In literature, a vast number of different approaches for pKa prediction can be found [1]. These approaches can be divided into two different classes. On the one hand there are direct calculations, so called ab initio methods, trying to determine the pKa value by quantum chemical or mechanical computation. On the other hand, statistical models, trained on chemical or structural descriptors. These descriptors can be, for example, of quantum chemical, semi empirical, graph topological or simple statistical nature. This type of modeling is called QSPR (Quantitative Structure Property Relationship).

In our recent work, we develop such a QSPR model using localized molecular descriptors to train multiple linear regression and artificial neural networks to estimate dissociation constants (pKa). The performance of our approach is similar to that of a semi-empirical model based on frontier electron theory [2] as well as a prediction model based on Graph Kernels [3].

How such a prediction model can be built, is shown by an example performed with OCHEM, an online chemical database with an environment for modeling (http://ochem.eu/). It is a publicly accessible database for chemical compound data and predictive models. Further, users get the facility to develop, apply, and distribute predictive models, so it is unique in its combination of compound data and predictive models.

Authors’ Affiliations

Helmholtz Zentrum München


  1. Rupp M, Körner R, Tetko IV: Predicting the pKa of small molecule. Combinatorial chemistry & high throughput screening. 2011, 14 (5): 307-327.Google Scholar
  2. Tehan BG, et al: Estimation of pKa Using Semiempirical Molecular Orbital Methods. Part 1: Application to Phenols and Carboxylic Acids. Quant Struct-Act Relat. 2002, 21: 457-472. 10.1002/1521-3838(200211)21:5<457::AID-QSAR457>3.0.CO;2-5.View ArticleGoogle Scholar
  3. Rupp M, Körner R, Tetko IV: Estimation of Acid Dissociation Constants Using Graph Kernels. Molecular Informatics. 2010, 29: 731-740. 10.1002/minf.201000072.View ArticleGoogle Scholar


© Körner et al; licensee BioMed Central Ltd. 2012

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.