Fast automated placement of polar hydrogen atoms in protein-ligand complexes
© Lippert and Rarey; licensee BioMed Central Ltd. 2009
Received: 28 May 2009
Accepted: 12 August 2009
Published: 12 August 2009
Hydrogen bonds play a major role in the stabilization of protein-ligand complexes. The ability of a functional group to form them depends on the position of its hydrogen atoms. An accurate knowledge of the positions of hydrogen atoms in proteins is therefore important to correctly identify hydrogen bonds and their properties. The high mobility of hydrogen atoms introduces several degrees of freedom: Tautomeric states, where a hydrogen atom alters its binding partner, torsional changes where the position of the hydrogen atom is rotated around the last heavy-atom bond in a residue, and protonation states, where the number of hydrogen atoms at a functional group may change. Also, side-chain flips in glutamine and asparagine and histidine residues, which are common crystallographic ambiguities must be identified before structure-based calculations can be conducted.
We have implemented a method to determine the most probable hydrogen atom positions in a given protein-ligand complex. Optimality of hydrogen bond geometries is determined by an empirical scoring function which is used in molecular docking. This allows to evaluate protein-ligand interactions with an established model. Also, our method allows to resolve common crystallographic ambiguities such as as flipped amide groups and histidine residues. To ensure high speed, we make use of a dynamic programming approach.
Our results were checked against selected high-resolution structures from an external dataset, for which the positions of the hydrogen atoms have been validated manually. The quality of our results is comparable to that of other programs, with the advantage of being fast enough to be applied on-the-fly for interactive usage or during score evaluation.
Pharmaceutical research focuses on finding novel ligands for proteins known to be disease-modifying. This research can be assisted by docking calculations which provide in-silico estimations of the binding mode and the binding affinity of putative ligand molecules and a protein . Polar interactions, to which hydrogen bonds belong, play a major role in non-covalent protein-ligand interactions . These hydrogen bonds have a direction which depends on the position of the involved hydrogen atoms. Hence their positions need to be known to correctly assess protein-ligand interactions with respect to their hydrogen bonds. However, they cannot be taken from the input structure but must be calculated for two reasons: First, as stated in the induced fit theory, the protein may respond to a bound ligand with changes in its conformation . The energetically most simple change in conformation is a change in the position of the hydrogen atoms, resulting in a change of the active site's spatio-chemical properties. Second, protein structures may contain ambiguities that result from the experimental method, especially if it is X-ray crystallography, which has been used to determine most of the publicly available structures in the Protein Data Bank (PDB) . Of its currently 50,000 structures, more than 40,000 have been determined using this method. Even though it is a very mature techology, certain ambiguities remain in the obtained structures. Particularly the resolution makes it difficult to detect hydrogen atoms , which must therefore be added in the following refinement of the data. We have modeled four degrees of freedom to predict positions of hydrogen atoms in protein structures:
Tautomeric states, especially in histidine
Two different tautomeric states can be observed for histidine residues. Also, the hydrogen atom at carboxyl groups may change its binding partner.
Torsional angle changes
In hydroxyl, thiol and amine groups, the orientation of the hydrogen atoms is not fixed. Due to the low energy barrier, they may rotate freely around the bond that connects the group to the rest of the molecule
Protonation states have been modeled for four functional groups: Groups that may carry a negative charge by losing one hydrogen include carboxyl and thiol residues, and groups that may carry a positive charge by taking an additional hydrogen atom charge are amine and imidazol groups.
The identity of atoms in amide groups as well as in imidazole rings is hard to determine at common resolutions for protein crystal structures. Hence they may be rotated by 180° with respect to the PDB file's coordinates. In the following we will refer to the rotation of a functional group by by 180° as "side-chain flips".
As for the ligand, we only take into account changes of the torsional angle at hydroxyl, thiol and amine groups.
Several approaches addressing hydrogen placements in crystal structures have been developed. A thorough review can be obtained from Forrest and Honig . Besides programs which calculate the position of hydrogen atoms by molecular dynamics minimization or place them solely by geometric criteria, the most frequently applied are WHAT IF , MolProbity , and the Hbuild  procedure implemented in the X-PLOR package. The programs differ mostly in their objective function and their optimization method. Hbuild uses the CHARMM force field to evaluate the quality of the formed hydrogen bonds, whereas WHAT IF features an empirical scoring function for hydrogen bonds. MolProbity on the other hand, uses "contact-dot" surfaces to model favorable interactions. The programs' optimization procedures can be grouped grossly in three categories. Stochastic, such as the simulated annealing used in WHAT IF, greedy as used in Hbuild and exhaustive search, as used in MolProbity.
Recently, two further methods have been published. The first is the Computational Titration algorithm  which uses a lightweight forcefield with the concept of "hydropathic interactions" as its objective function and an exhaustive enumeration for optimization. The second and most recent method is Protonate3D , which chooses the optimal states according to a chemical model derived from the MMFF94  force field. It applies an exhaustive search on all combinations of admissible states of chemical groups and limits the search space by a prioritization of favorable states.
In this paper we describe Protoss, a new and fast method to calculate hydrogen atom positions based on optimal hydrogen bond networks. In contrast to the previously mentioned programs, our approach differs in two aspects: On the one hand, we ensure on the speed of the calculation by using an efficient dynamic programming approach with "memoization" , i.e. storing partial solutions and combining them to globally optimal solutions. On the other hand, we wanted to model the protein-ligand interface with an established method. Our objective function is based on the hydrogen bonding term of the Boehm scoring function  which has been designed to correctly reflect protein-ligand interactions and is used in the FlexX  molecular molecular docking program.
In the results section, a validation based on a dataset from Forrest and Honig is given. We demonstrate that we were able to reproduce the results with a quality that is comparable to that of the programs in a fraction of time, making the method suitable for high-throughput modeling applications.
The algorithm starts by identifying hydrogen bond networks in the protein-ligand interface. In our context a hydrogen bond network is the maximal set of functional groups for which alternative modes exist and that are able to form hydrogen bonds among themselves. The networks are modeled as graphs: Every modeled degree of freedom is represented by a node, i.e. for each amino acid for which rules exist, included water molecules and all functional groups of the ligand that are treated. Edges stand for interactions between amino acids. Every node is assigned a set of admissible modes, which come from a set of pre-defined rules. The problem is now to find the modes in each network that yield the best hydrogen bond network with respect to our objective function. This can be done efficiently with a dynamic programming approach. The Protoss algorithm is split into two phases, initialization and optimization. The initialization is performed only once for a protein-ligand complex. The generated information can be used for alternative docking poses.
Amino acids (and water) with their modeled degrees of freedom.
Degrees of Freedom
Protonation, dihedral angle
Flip state, protonation
Protonation, dihedral angle
Both hydrogen atoms
Overview of penalties assigned to modes.
Protonated carboxyl group
Positively charged histidine
Optimization via dynamic programming
The first step of the optimization is to transform the graph into a tree. The most central node in the graph is chosen as the root of the tree (although theoretically any node could be chosen). Then, all biconnected components in the graph are replaced by single nodes. These new nodes inherit all edges of the nodes that it replaces and keeps references to all nodes that are part of the biconnected component.
The problem of finding the modes of amino acids that yield the best hydrogen bond network can be solved with a recursive procedure. An algorithm in pseudocode is given in Algorithm 1.
Algorithm 1 – Dynamic programming procedure used in Protoss
The global array "sub" is used for memoization, the array "penalty" contains penalties for uncommon modes. "static_atoms" contains all atoms that have only one mode. Every mode has a unique identifier "id". The function score returns the interaction score of two modes. The attribute "toParent" of cycles denotes the node which was connected to the parent in the unreduced graph. The best modes are found via backtracking, which is not elucidated here for clarity reasons.
1: function CALC_SUBTREE(TreeNode N)
2: for C in N.children do ▻ Skipped if N is leaf
4: if N.isCycle then
5: DECOMPOSE_AND_SOLVE_CYCLE(N.cycle) ▻ Fills sub for N.cycle.toParent
7: for n in N.modes do
8: sub [n.id] = SCORE(n, static_atoms) + penalty [n.id]
9: for C in N.children do ▻ Skipped if N is leaf
10: best = -∞
11: if C.isCycle then
12: for c in C.cycle.toParent.modes do
13: best = max(best, SCORE(n, c) + sub [c.id])
15: for c in C.modes do
16: best = max(best, SCORE(n, c) + sub [c.id])
17: sub [n.id] + = best
18: function FIND_BEST_SCORE(Tree T)
20: best = -∞
21: for r in T.root.modes do
22: best = max(best, sub [r.id])
23: return best
The tree is traversed in a post-order fashion. For every tree node that is visited, the optimal solution of the subtree that it forms is computed and recorded. This is done by calculating two values for every mode: The first value is the interaction score of this mode with the atoms that have only a single mode, i.e. the atoms that are not changed throughout the procedure plus an optional penalty. The second value is summed over all children. For every child of the node, the maximum of the interaction scores with the current mode plus the maximum score of their corresponding subtrees is computed. The two values are added and recorded as the best achievable score for the current mode (lines 9 to 17). Because of the post-order traversal, the scores for the subtrees of the children have always been computed when the corresponding parent node is visited.
If more than one mode is found to yield an optimal score for one node, the mode that is equivalent to that in the PDB-file is chosen in case of protonation states, tautomers and side-chain flips. In nodes that represent rotational degrees of freedom, the median of all optimal modes that represent consecutive angles is chosen.
Decomposition of cyclic dependencies
A circular dependency does not allow for an application of the efficient dynamic programming procedure. Theoretically, all combinations of modes of its members would have to be evaluated. This quickly becomes infeasible because of the combinatorial explosion. Hooft et al. coped with this problem by using a simulated annealing procedure to find a good (but not necessarily the best) combination of modes.
Canutescu et al. limit the complexity of the problem with a branch and bound method. In Protoss we introduce a new concept to find the best scoring set of modes for the cyclic subgraphs.
We decompose cyclic dependencies by removing selected nodes from a compound until there are no cycles left in the graph. While all combinations of modes have to be tested for the removed nodes, the now acyclic part of the subgraph can be handled with the dynamic programming algorithm.
The cycle decomposition is conducted with a depth-first search. If a backedge is encountered during the search, the node of the cycle that has the fewest modes is removed and the search is restarted. This is repeated as long as the graph contains cycles. After all cycles have been decomposed, the dynamic programming algorithm is applied to the remaining non-cyclic parts for each combination of modes for the removed nodes.
We tested the Protoss algorithm in two scenarios. First, we compared our predicted hydrogen positions with those in high-resolution protein structures which were able to determine hydrogen atom positions. Second, we conducted an analysis of so called NQ-flips in the Astex  dataset. NQ flips denote the wrong assignment of oxygen and nitrogen atoms in amides.
Hydrogen position prediction
In order to evaluate the quality of Protoss, we applied it to a test set published by Forrest and Honig  to assess the accuracy of programs which correct hydrogen atom positions in proteins. The test set consists of 34 hydrogen atoms from seven protein structures, which have either a resolution ≤ 0.9Å if they were obtained by X-ray crystallography or a resolution of ≤ 1.8Å if they were obtained by neutron diffraction. The hydrogen atoms have been selected automatically based on the program surfv , and are included, if their predicted solvent accessible surface is zero.
Furthermore, Forrest and Honig define a subset of eight hydrogen atoms for which they were able to confirm the positions by visual inspection of the electron density maps. They are all from a manually refined structure of the xylose isomerase (1muw; Fenn, Ringe, Petsko: unpublished work) . In the following, we adopt the nomenclature used in the publication of the test set and call the complete set the "Buried" set and the subset the "Density" set.
Amide flip prediction
Apart from the rotation of functional groups, Protoss also predicts so-called NQ-flips in protein structures as well as histidine flips. An NQ-flip denotes the wrong assignment of the oxygen and nitrogen atoms in the amide groups of asparagine (N) and glutamine (Q). Histidine residues may be flipped by 180° due to wrong assignments of carbon and nitrogen atoms in the imidazole ring. These flips are a frequently occurring ambiguity in PDB files, and result from the inability to distinguish certain heavy atoms in X-ray crystallography at common resolutions. A thorough statistical study on this matter by Weichenberger and Sippl  shows that about 21.0% of all amides in their test set would have more favorable interactions if they were flipped. Other figures lie in the same range: McDonald and Thornton(15%), Word (20.5%)  and Hooft (18%) .
We ran Protoss on the Astex dataset  which features a diverse set of protein structures with their associated ligands. Out of 4066 amino acids with amide groups in these proteins, 740 (18.2%) were predicted to be flipped, which is in the same range as the aforementioned analyses.
However, a following visual inspection revealed that the vast majority of the flipped groups lie on the surface of the proteins. Since amide groups are very polar, this is not very surprising. It does however limit the relevance of the amide flips to protein-ligand docking. Only 106 (2.6%) of the flips were found to be buried in the protein and less than 20 (< 0.5%) near an active site.
Another flipped side chain occurs in the active site of 1ig3. The PDB conformation in Asn239 would result in atom clashes and a non-optimal hydrogen bond network. If it is flipped, it could interact with Ser236 which in turn interacts directly with the ligand. This is especially important since flipping Asn239 and allowing it to take part in the hydrogen bond network is needed to correctly assess the interaction with the ligand.
After the protein-ligand complex is read in and prepared by the FlexX library routines, and the interaction surfaces are assigned, two phases can be distinguished. First, the alternative modes are generated and the hydrogen bond networks are identified. Second, the optimization procedure to find the best hydrogen positions is conducted.
In a scenario, where repeated optimizations of the hydrogen bond network are conducted, the relatively slow first phase has to be carried out only once. The speed of the second phase however would allow for a fast optimization of the hydrogen bond network in consecutive calls. This scenario might be an adjustment of hydrogen atoms prior to a detailed scoring function evaluation of a protein-ligand complex. This is frequently encountered in docking calculations, where this step has to be done multiple times to evaluate predicted poses.
Strongly interconnected compounds
The time could be reduced by ignoring side chain flips and applying constraints on when an edge is inserted into the graph, i.e. apply a minimum quality threshold on the considered hydrogen bonds.
Further degrees of freedom
In principle, further degrees of freedom influence the hydrogen bond networks. In ligands, tautomeric and/or protonation states may change in response to the protein. Also, interfacial water molecules can be part of the networks. In its current form, Protoss does not consider different protonation states of the ligand or tautomeric forms in the ligand. The method is however capable of including these additinal degrees of freedom. And although Protoss can orient selected water molecules, it can not yet predict the presence of water molecules in the protein-ligand interface. Protonation, tautomers as well as interfacial water molecules are of importance for the prediction of protein-ligand complexes and are therefore an interesting direction for further research.
Predicting stability and alternative hydrogen bonding networks
Our method gains its speed from considering only the best scoring solutions. However, quite different solutions with a similar score might exist. This would for example be the case in active sites of enzymes which feature a catalytic triade where a proton is transferred onto a substrate. Both states, the one before and the one after the transfer of the proton, have valid networks that can both be encountered. Instead of obtaining only the best scoring solution, it would be desirable to obtain a set of stable conformations that are valid for the active site. Since Protoss is intended to be applied on different ligands and and poses individually upon molecular docking and scoring, alternative hydrogen bond networks are of less importance. Nevertheless, an extension in this direciton is methodologically feasible.
The method has been implemented in ISO-C using parts of the FlexX  source code. It is not available as a stand-alone version but will be used as a basis for protein preparation in further projects such as in two-dimensional protein-ligand depiction (Poseview ), re-scoring (HYDE ) and in the preparation of active sites for protein-ligand docking with FlexX.
We have implemented a program that automatically places hydrogen atoms in protein structures with particular focus on protein-ligand interfaces. Having that information is important for any subsequent calculations, particularly in structure-based design approaches to finding new ligands.
The prediction of the positions of hydrogen atoms is consistent with those in the test sets of Forrest and Honig, and the reported rate for amide chain flips is in unison with the rates reported in the literature. In the Density set, all hydrogen atoms are placed within 0.4 Å. For a lower tolerance, i.e. 0.2 Å, this quota is slightly worse. A placement within 0.4 Å however suffices for the FlexX scoring function, as it is robust enough to compensate for inaccuracies of this order. In fact, the inaccuracies can be traced back to this robustness. The important fact is that the hydrogen atoms are placed facing into the right direction, thus making it possible to correctly identify any interactions that they are involved in.
The novelty in this method is that it always finds a maximum score solution with respect to our objective function for all hydrogen positions in a hydrogen bond network, whereas previous methods usually tackled the large search space by resorting to heuristic strategies such as greedy algorithms or stochastic search methods. Our method typically takes less than a tenth second to optimize a hydrogen bond network in an active site, even if it contains cyclic dependencies. However, if cyclic dependencies exist that cannot be decomposed well, this time might be exceeded.
One open question is how to model hydrogen bonds that are formed with bulk solvent molecules. Since water molecules may act both as an acceptor and as a donor and are constantly moving, it is a difficult task to correctly identify and assess hydrogen bonds that may be formed. In summary, we believe that the Protoss method is a useful component for all software tools that model protein-ligand complexes.
Protoss needs to evaluate the quality of the formed hydrogen bond networks. For this task it uses the interaction model that has been incorporated in the docking program FlexX  and which is based on an empirical scoring function genuinely developed by Boehm .
Here, Δ is the deviation from the ideal value, d1 and d2 describe the tolerance towards deviation from the ideal geometry and depend on the functional groups that interact. The three values are multiplied and taken as a measure for the quality of the formed interaction.
such that interactions(m) is the sum of hydrogen bond scores as described above and penalty(m) the penalty for flipping groups or choosing uncommon protonation states as listed in Table 2.
Decomposition of cyclic dependencies
When a cycle is encountered in the dependency graph, nodes are removed until the graph becomes acyclic. Determining if a subset V' ⊆ V exists with |V'| ≤ k for a positive integer k is known as the Vertex Feedback Set problem, which is NP-hard . In our context the graphs are relatively small. Therefore we can rely on a greedy heuristic to minimize the number of times the remaining graph has to be traversed. This is important since the traversal has to be conducted for each combination of modes in V'. The algorithm applied here iteratively searches for cycles. Once a cycle is found, the node with the minimal number of modes becomes part of the subset.
Sampling of water orientations
In order to model the many possibilities that exist for the orientation of water molecules, we sample 60 orientations. We use an icosahedron that is centered at the position of the oxygen atom to ensure an equal distribution of the individual samples. An initial placement is created by orientating the water molecule such that the angle bisector of its opening angle points to one of the corners of the icosahedron. Then an orientation is created for each of the five corners that lie opposed to the first corner: The coordinates of the water's hydrogen atoms are chosen such that they lie in a plane with each of the second corners. This results in five orientation for each of the twelve corners, totaling up to 60 orientations for a water molecule.
- Rarey M, Degen J, Reulecke I: Bioinformatics – from genomes to therapies. 2007, Weinheim: Wiley-VCH, 2: 541-View ArticleGoogle Scholar
- Gohlke H, Klebe G: Approaches to the description and prediction of the binding affinity of small-molecule ligands to macromolecular receptors. Angewandte Chemie (International ed. in English). 2002, 41 (15): 2644-2676. 10.1002/1521-3773(20020802)41:15<2644::AID-ANIE2644>3.0.CO;2-O.View ArticleGoogle Scholar
- Koshland D: Application of a Theory of Enzyme Specificity to Protein Synthesis. Proceedings of the National Academy of Sciences of the United States of America. 1958, 44 (2): 98-104. 10.1073/pnas.44.2.98.View ArticleGoogle Scholar
- Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H, Shindyalov I, Bourne P: The Protein Data Bank. Nucleic Acids Research. 2000, 28: 235-242. 10.1093/nar/28.1.235.View ArticleGoogle Scholar
- Glusker J, Lewis M, Rossi M: Crystal Structure Analysis for Chemists and Biologists. 1994, New York: VCH PublishersGoogle Scholar
- Forrest L, Honig B: An assessment of the accuracy of Methods for predicting hydrogen positions in protein structures. Proteins. 2005, 61 (2): 296-309. 10.1002/prot.20601.View ArticleGoogle Scholar
- Hooft R, Sander C, Vriend G: Positioning hydrogen atoms by optimizing hydrogen-bond networks in protein structures. Proteins. 1996, 26 (4): 363-376. 10.1002/(SICI)1097-0134(199612)26:4<363::AID-PROT1>3.0.CO;2-D.View ArticleGoogle Scholar
- Word J, Lovell S, LaBean T, Taylor H, Zalis M, Presley B, Richardson J, Richardson D: Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms. Journal of Molecular Biology. 1999, 285 (4): 1711-1733. 10.1006/jmbi.1998.2400.View ArticleGoogle Scholar
- Brunger A, Karplus M: Polar hydrogen positions in proteins: empirical energy placement and neutron diffraction comparison. Proteins. 1988, 4 (2): 148-156. 10.1002/prot.340040208.View ArticleGoogle Scholar
- Bayden A, Fornabaio M, Scarsdale J, Kellogg G: Web application for studying the free energy of binding and protonation states of protein-ligand complexes based on HINT. J Comput Aided Mol Des. 2009, 23 (9): 621-632.View ArticleGoogle Scholar
- Labute P: Protonate3D: Assignment of ionization states and hydrogen coordinates to macromolecular structures. Proteins: Structure, Function, and Bioinformatics. 2009, 75: 187-205. 10.1002/prot.22234.View ArticleGoogle Scholar
- Halgren T: Merck molecular force field .1. Basis, form, scope, parameterization, and performance of MMFF94. Journal Of Computational Chemistry. 1996, 17 (5–6): 490-519. 10.1002/(SICI)1096-987X(199604)17:5/6<490::AID-JCC1>3.0.CO;2-P.View ArticleGoogle Scholar
- Michie D: Memo Functions and Machine Learning. Nature. 1968, 218 (5136): 19-22. 10.1038/218019a0.View ArticleGoogle Scholar
- Boehm H: The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure. Journal of Computer-Aided Molecular Design. 1994, 8 (3): 243-256. 10.1007/BF00126743.View ArticleGoogle Scholar
- Rarey M, Kramer B, Lengauer T, Klebe G: A fast flexible docking method using an incremental construction algorithm. Journal of Molecular Biology. 1996, 261 (3): 470-489. 10.1006/jmbi.1996.0477.View ArticleGoogle Scholar
- Kobayashi T, Takimura T, Sekine R, Kelly V, Kamata K, Sakamoto K, Nishimura S, Yokoyama S: Structural snapshots of the KMSKS loop rearrangement for amino acid activation by bacterial tyrosyl-tRNA synthetase. Journal of Molecular Biology. 2005, 346: 105-117. 10.1016/j.jmb.2004.11.034.View ArticleGoogle Scholar
- Canutescu A, Shelenkov A, Dunbrack R: A graph-theory algorithm for rapid protein side-chain prediction. Protein Science: a Publication of the Protein Society. 2003, 12 (9): 2001-2014.View ArticleGoogle Scholar
- Hartshorn M, Verdonk M, Chessari G, Brewerton S, Mooij W, Mortenson P, Murray C: Diverse, high-quality test set for the validation of protein-ligand docking performance. Journal of Medicinal Chemistry. 2007, 50 (4): 726-741. 10.1021/jm061277y.View ArticleGoogle Scholar
- Sridharan S, Nicholls A, Honig B: A new Vertex Algorithm to calculate Solvent accessible Surface Area. Biophys J. 1992, 61: 174.Google Scholar
- Manually refined xylose isomerase structure 1muw. available upon request from the authors of this article. 10.1107/S0907444998018514.
- Bon C, Lehmann M, Wilkinson C: Quasi-Laue neutron-diffraction study of the water arrangement in crystals of triclinic hen egg-white lysozyme. Acta Crystallographica. Section D, Biological Crystallography. 1999, 55 (Pt 5): 978-987. 10.1073/pnas.161301298.View ArticleGoogle Scholar
- Whittington D, Waheed A, Ulmasov B, Shah G, Grubb J, Sly W, Christianson D: Crystal structure of the dimeric extracellular domain of human carbonic anhydrase XII, a bitopic membrane protein overexpressed in certain cancer tumor cells. Proceedings of the National Academy of Sciences of the United States of America. 2001, 98 (17): 9545-9550.View ArticleGoogle Scholar
- Golebiowski A, Townes J, Laufersweiler M, Brugel T, Clark M, Clark C, Djung J, Laughlin S, Sabat M, Bookland R, VanRens J, De B, Hsieh L, Janusz M, Walter R, Webster M, Mekel M: The development of monocyclic pyrazolone based cytokine synthesis inhibitors. Bioorganic & Medicinal Chemistry Letters. 2005, 15 (9): 2285-2289.View ArticleGoogle Scholar
- Weichenberger C, Sippl M: Self-consistent assignment of asparagine and glutamine amide rotamers in protein crystal structures. Structure. 2006, 14 (6): 967-972.View ArticleGoogle Scholar
- Word J, Lovell S, Richardson J, Richardson D: Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. Journal of Molecular Biology. 1999, 285 (4): 1735-1747.View ArticleGoogle Scholar
- Wang R, Fang X, Lu Y, Wang S: The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures. Journal of Medicinal Chemistry. 2004, 47 (12): 2977-2980.View ArticleGoogle Scholar
- Shah N, Kuntz D, Rose D: Comparison of kifunensine and 1-deoxymannojirimycin binding to class I and II alpha-mannosidases demonstrates different saccharide distortions in inverting and retaining catalytic mechanisms. Biochemistry. 2003, 42 (47): 13812-13816.View ArticleGoogle Scholar
- Stierand K, Rarey M: From modeling to medicinal chemistry: automatic generation of two-dimensional complex diagrams. Chem Med Chem. 2007, 2 (6): 853-860.View ArticleGoogle Scholar
- Reulecke I, Lange G, Albrecht J, Klein R, Rarey M: Towards an integrated description of hydrogen bonding and dehydration: decreasing false positives in virtual screening with the HYDE scoring function. Chem Med Chem. 2008, 3 (6): 885-897.View ArticleGoogle Scholar
- Garey MR: Computers and Intractability. 1979, New York: W. H. Freeman and CompanyGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.