Volume 6 Supplement 1

9th German Conference on Chemoinformatics

Open Access

The application of statistical methods to cognate docking: A path forward?

  • Gunther Stahl1Email author,
  • Paul CD Hawkins2,
  • Mark McGann2,
  • Matthew T Geballe2 and
  • Gregory L Warren2
Journal of Cheminformatics20146(Suppl 1):P59

DOI: 10.1186/1758-2946-6-S1-P59

Published: 11 March 2014

Cognate docking has been used as a test for pose prediction quality in docking engines for decades. While cognate docking is not the problem that docking engines are put to in their normal use (that being cross docking), it is expected that good performance in cognate docking is a necessary but not sufficient condition for good performance in cross docking. In this talk we report a statistically rigorous analysis of cognate docking using tools in the OEDocking suite [1, 2]. We address a number of critically important aspects of the cognate docking problem that are frequently poorly handled in publications in this area; dataset quality, methods of comparison of the docked pose to the ligand model pose and analysis of the results to determine if and by how much a given method is actually better than another.

The first problem is handled through the use of our recently published Iridium-HT dataset [3]. To overcome the second problem we use a variety of measures to compare a docked pose to the ligand model pose. In addressing the third problem we utilize a variety of statistical methods to determine whether, and by how much, changes in the scoring functions actually improve cognate docking performance; a major challenge in this area is the paired nature of the deviation data. We caution against the mechanical application of statistical tests, however, and advocate for searching for substantive and meaningful significance, as well as statistical significance.

Authors’ Affiliations

OpenEye Scientific Software
OpenEye Scientific Software


  1. McGann M: FRED Pose prediction and virtual screening accuracy. J Chem Inf Model. 2011, 51: 578-596. 10.1021/ci100436p.View ArticleGoogle Scholar
  2. McGann M: FRED and HYBRID docking performance on standardized datasets. J Comput Aided Mol Des. 2012, 26: 897-906. 10.1007/s10822-012-9584-8.View ArticleGoogle Scholar
  3. Warren GL, Do T, Kelley BP, Nicholls A, Warren SD: Essential considerations for using protein–ligand structures in drug discovery. Drug Disc Today. 2012, 17: 1270-1282. 10.1016/j.drudis.2012.06.011.View ArticleGoogle Scholar


© Stahl et al; licensee Chemistry Central Ltd. 2014

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.