Protein pKa calculations

In computational biology, protein pK_a calculations are used to estimate the pK_a values of amino acids as they exist within proteins. These calculations complement the pK_a values reported for amino acids in their free state, and are used frequently within the fields of molecular modeling, structural bioinformatics, and computational biology.

Amino acid p''K''_a values

of amino acid side chains play an important role in defining the pH-dependent characteristics of a protein. The pH-dependence of the activity displayed by enzymes and the pH-dependence of protein stability, for example, are properties that are determined by the pK_a values of amino acid side chains.
The pK_a values of an amino acid side chain in solution is typically inferred from the pK_a values of model compounds. See Amino acid for the pK_a values of all amino acid side chains inferred in such a way. There are also numerous experimental studies that have yielded such values, for example by use of NMR spectroscopy.
The table below lists the model pK_a values that are often used in a protein pK_a calculation, and contains a third column based on protein studies.

Amino Acid	pK_a	pK_a
Asp	3.9	4.0
Glu	4.3	4.4
Arg	12.0	13.5
Lys	10.5	10.4
His	6.08	6.8
Cys	8.28	8.3
Tyr	10.1	9.6
N-term		8.0
C-term		3.6

The effect of the protein environment

When a protein folds, the titratable amino acids in the protein are transferred from a solution-like environment to an environment determined by the 3-dimensional structure of the protein. For example, in an unfolded protein an aspartic acid typically is in an environment which exposes the titratable side chain to water. When the protein folds the aspartic acid could find itself buried deep in the protein interior with no exposure to solvent.
Furthermore, in the folded protein the aspartic acid will be closer to other titratable groups in the protein and will also interact with permanent charges and dipoles in the protein.
All of these effects alter the pK_a value of the amino acid side chain, and pK_a calculation methods generally calculate the effect of the protein environment on the model pK_a value of an amino acid side chain.
Typically the effects of the protein environment on the amino acid pK_a value are divided into pH-independent effects and pH-dependent effects. The pH-independent effects are added to the model pK_a value to give the intrinsic pK_a value. The pH-dependent effects cannot be added in the same straightforward way and have to be accounted for using Boltzmann summation, Tanford–Roxby iterations or other methods.
The interplay of the intrinsic pK_a values of a system with the electrostatic interaction energies between titratable groups can produce quite spectacular effects such as non-Henderson–Hasselbalch titration curves and even back-titration effects.
The image below shows a theoretical system consisting of three acidic residues. One group is displaying a back-titration event.

p''K''_a calculation methods

Several software packages and webserver are available for the calculation of protein pK_a values. See links below or

Using the Poisson–Boltzmann equation

Some methods are based on solutions to the Poisson–Boltzmann equation, often referred to as FDPB-based methods. The PBE is a modification of Poisson's equation that incorporates a description of the effect of solvent ions on the electrostatic field around a molecule.
The , the , , , and use the FDPB method to compute pK_a values of amino acid side chains.
FDPB-based methods calculate the change in the pK_a value of an amino acid side chain when that side chain is moved from a hypothetical fully solvated state to its position in the protein. To perform such a calculation, one needs theoretical methods that can calculate the effect of the protein interior on a pK_a value, and knowledge of the pKa values of amino acid side chains in their fully solvated states.

Empirical methods

A set of empirical rules relating the protein structure to the pK_a values of ionizable residues have been developed by . These rules form the basis for the program called PROPKA for rapid predictions of pK_a values.
A recent empirical pK_a prediction program was released by with the online server

Molecular dynamics (MD)-based methods

methods of calculating pK_a values make it possible to include full flexibility of the titrated molecule.
Molecular dynamics based methods are typically much more computationally expensive, and not necessarily more accurate, ways to predict pK_a values than approaches based on the Poisson–Boltzmann equation. Limited conformational flexibility can also be realized within a continuum electrostatics approach, e.g., for considering multiple amino acid sidechain rotamers. In addition, current commonly used molecular force fields do not take electronic polarizability into account, which could be an important property in determining protonation energies.

Determining p''K''_a values from titration curves or free energy calculations

From the titration of protonatable group, one can read the so-called pK_a which is equal to the pH value where the group is half-protonated. The pK_a is equal to the Henderson–Hasselbalch pK_a
if the titration curve follows the Henderson–Hasselbalch equation. Most pK_a calculation methods silently assume that all titration curves are Henderson–Hasselbalch shaped, and pK_a values in pK_a calculation programs are therefore often determined in this way. In the general case of multiple interacting protonatable sites, the pK_a value is not thermodynamically meaningful. In contrast, the Henderson–Hasselbalch pK_a value can be computed from the protonation free energy via
and is thus in turn related to the protonation free energy of the site via
The protonation free energy can in principle be computed from the protonation probability of the group which can be read from its titration curve
Titration curves can be computed within a continuum electrostatics approach with formally exact but more elaborate analytical or Monte Carlo methods, or inexact but fast approximate methods. MC methods that have been used to compute titration curves are Metropolis MC or Wang–Landau MC. Approximate methods that use a mean-field approach for computing titration curves are the Tanford–Roxby method and hybrids of this method that combine an exact statistical mechanics treatment within clusters of strongly interacting sites with a mean-field treatment of intercluster interactions.
In practice, it can be difficult to obtain statistically converged and accurate protonation free energies from titration curves if is close to a value of 1 or 0. In this case, one can use various free energy calculation methods to obtain the protonation free energy such as biased Metropolis MC, free-energy perturbation, thermodynamic integration, the non-equilibrium work method or the Bennett acceptance ratio method.
Note that the pK value does in general depend on the pH value.
This dependence is small for weakly interacting groups like well solvated amino acid sidechains on the protein surface, but can be large for strongly interacting groups like those buried in enzyme active sites or integral membrane proteins.

Software for protein p''K''_a calculations

Accelrys CHARMm based pK_a calculation
Poisson–Boltzmann based pK_a calculations
Multi-Conformation Continuum Electrostatics
pK_a computation with multiple pH adapted conformations
Proton and Electron TITration
Generalized Monte Carlo Titration
Empirical calculation of pK_a values using Residue Depth as a major feature

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...

Protein pKa calculations

Amino acid p''K''a values