Volume 2 Supplement 1

5th German Conference on Cheminformatics: 23. CIC-Workshop

Open Access

Adaptation of formal concept analysis for the systematic exploration of structure-activity and structure-selectivity relationships

  • Eugen Lounkine1 and
  • Jürgen Bajorath1
Journal of Cheminformatics20102(Suppl 1):P21

DOI: 10.1186/1758-2946-2-S1-P21

Published: 04 May 2010

Formal Concept Analysis (FCA) is a data mining and visualization approach originating from information science. It operates on binary relationships between objects and attributes, which are reported in a formal context. Formal concepts are sets of objects that share a defined subset of attributes. FCA organizes these concepts in lattices that reflect their relationship in terms of shared objects and/or attributes and allows the identification of objects with defined sets of attributes [1].

Two adaptations of FCA that allow the systematic analysis of structure-activity and-selectivity relationships are presented. Fragment Formal Concept Analysis (FragFCA) assesses the distribution of molecular fragment combinations among ligands with closely related biological targets. This allows the identification of fragment signatures that exclusively occur in compounds with a defined activity profile. FragFCA also identifies fragment combinations that are characteristic of highly potent compounds against defined targets. Fragment signatures usually represent combinations of two or three fragments and can be used to differentiate active compounds of closely related targets for different target families [2, 3].

Molecular Formal Concept Analysis (MolFCA) is introduced for the systematic comparison of the selectivity of a compound against multiple targets and the extraction of compounds with complex selectivity profiles from biologically annotated databases. Selectivity is assessed based on pair-wise compound potency ratios. This allows the definition multiple selectivity queries involving the comparison of an arbitrary number of targets and compound potency values or ratios. The individual queries are applied in a sequential manner to retrieve compounds with desired selectivity against targets of interest. MolFCA operates on activity space representations of compounds and thus allows the identification of structurally diverse compounds matching a given selectivity profile [4].

Authors’ Affiliations

Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn


  1. Priss U: Ann Rev Inf Sci Technol. 2006, 40: 521-10.1002/aris.1440400120.View ArticleGoogle Scholar
  2. Lounkine E, Auer J, Bajorath J: J Med Chem. 2008, 51: 5342-10.1021/jm800515r.View ArticleGoogle Scholar
  3. Krüger F, Lounkine E, Bajorath J: Chem Med Chem. 2009, 4: 1174-View ArticleGoogle Scholar
  4. Lounkine E, Stumpfe D, Bajorath J: J Chem Inf Model. 2009, 49: 1359-10.1021/ci900095v.View ArticleGoogle Scholar


© Eugen and Jürgen; licensee BioMed Central Ltd. 2010

This article is published under license to BioMed Central Ltd.