A tandem regression-outlier analysis of a ligand cellular system for key structural modifications around ligand binding
- Ying-Ting Lin^{1}Email author
DOI: 10.1186/1758-2946-5-21
© Lin; licensee Chemistry Central Ltd. 2013
Received: 29 August 2012
Accepted: 24 April 2013
Published: 30 April 2013
Abstract
Background
A tandem technique of hard equipment is often used for the chemical analysis of a single cell to first isolate and then detect the wanted identities. The first part is the separation of wanted chemicals from the bulk of a cell; the second part is the actual detection of the important identities. To identify the key structural modifications around ligand binding, the present study aims to develop a counterpart of tandem technique for cheminformatics. A statistical regression and its outliers act as a computational technique for separation.
Results
A PPARγ (peroxisome proliferator-activated receptor gamma) agonist cellular system was subjected to such an investigation. Results show that this tandem regression-outlier analysis, or the prioritization of the context equations tagged with features of the outliers, is an effective regression technique of cheminformatics to detect key structural modifications, as well as their tendency of impact to ligand binding.
Conclusions
The key structural modifications around ligand binding are effectively extracted or characterized out of cellular reactions. This is because molecular binding is the paramount factor in such ligand cellular system and key structural modifications around ligand binding are expected to create outliers. Therefore, such outliers can be captured by this tandem regression-outlier analysis.
Background
In any chemical analysis of a single cell, the first step is the separation of wanted chemicals from the bulk of a cell. This is due to the fact that a cellular system has a complex, heterogeneous composition. Various methods [1] using hard equipment have been developed for such uses. After a single cell is separated from the other cells, the wanted component can be further isolated and then detected through what is called a tandem technique [1]. The first part of a tandem technique, as mentioned above, is the separation of wanted chemicals from the bulk of a cell; the second part is the detection of the components. By mimicking such a tandem technique, a computational counterpart was developed herein; a statistical regression and its outliers (influential observations [2]) act as a computational technique for separation, which can cause the important identities (i.e. the factors causing outliers) to be isolated from the bulk of a cellular system. As a pioneer investigation, one molecular descriptor and one class of descriptors will be prepared: the descriptor resembles the filter in the tandem equipment; the class resembles the detector.
In a ligand-dependent receptor-mediated cellular system (or ligand cellular system), key structural modifications surrounding ligand binding are expected to cause outliers. For example, hydrogen bond formation, or deformation, can cause drastic alterations in cellular reaction. These singular situations are, at times, the reasons for statistical breakdown points in many analyses that are otherwise correct (i.e. resulting in the outliers of a statistical regression [3, 4]). At the same time, such outliers can have the most prominent and often most informative features of the target-specific activity landscapes [5]. Therefore, the concept that after this tandem regression-outlier analysis, the features of these resulting outliers can correspond to important structural modifications around molecular binding in such a ligand cellular system, if correct, would be very useful.
For this first tandem part, we sought the most representative descriptor for the bulk system of a cell. We found Jurs_RNCG [6], after observing more than 521×17 data sets of PPARγ (peroxisome proliferator-activated receptor gamma) agonists [7–21]. The methods and results are depicted in the first body of materials, methods, and results. To connect this to the second tandem part, the descriptor that is sought in the first part has functionality, which yields outlier residues for the second part.
Acting as an assay for detection in the second tandem part, for which the electrotopological state (ES) class of descriptors [22–26] is used. All possible structural modifications in a given collected analog set are pre-assigned by ES descriptors. The ES descriptors involve atom types in various electro-topological states. For example, in ES terminology, an ES_Count_ssO of a molecular structure is the count of “ssO” linkages, and here the “ssO” represents a bonding oxygen atom (O) linked via two single bonds (ss). A structural modification is considered a fundamental element (an action) for the reaction of such a ligand cellular system. In actuality, any structural modification of such an analog set can be expressed by the change of an associated ES descriptor. The details and results of the second part are also depicted in the second body of materials, methods and results.
This tandem regression-outlier technique is, therefore, in mathematical terms, carried out so as to prioritize the context equations tagged with features of these outliers. We want to know if the top-ranked structural modifications correspond to the key interactions around molecular binding as we expected them to. This expectation was based on the fact that: I: this singular situation causes outliers in a regression. II: molecular binding is the paramount factor in such a ligand cellular system, and III, key structural modifications around ligand binding are expected to create singular situations, i.e. cause outliers in a statistical regression.
In the end, after this tandem regression-outlier analysis for the PPARγ agonist cellular system, a top ranked ES symbol can faithfully correspond to key interactions around molecular binding with the correct order of potency. The outcome of such an analysis confirmed the two main underlying and mutually-dependent speculations; one being that, in the second tandem part, the top ranked ES symbols reflect the key interactions around ligand binding, and the other that, in the first tandem part, the designation of Jurs_RNCG (relative negative charge) can effectively remove the general effects of such a ligand cellular system.
Methods
The first tandem filter: in order to seek the most representative descriptor for a ligand cellular system
where Y is the dependent variable that stands for the ligand-dependent receptor-mediated cellular reaction, X_{ch} is the descriptor to be chosen, and β_{0} and β_{ch} are the regression coefficients after the least squares fit. Once the ligand-dependent receptor-mediated cellular data of a given set of analogs are available, the r^{2} correlation fit can be obtained for each descriptor. Here, 521 descriptors of eminent classes are used. All descriptors in the working equation with correlation fits are prioritized by correlation coefficient. All calculations of descriptors were performed using the Discovery Studio 2.1 QSAR module [27]. The regression fits were conducted for each descriptor in the context equation and Pearson’s coefficients were performed using R 2.11.0 [28].
The second tandem detector: to prioritize the context equations tagged with all possible features of the outliers
where Y is the dependent variable standing for the ligand-dependent receptor-mediated cellular reaction, Jurs_RNCG is the calculated Jurs descriptor, ES are all the possible ES descriptors; and all βs are the estimated regression coefficients after the least squares fit. The context equations tagged with all possible ES descriptors are prioritized by correlation coefficient. 12 top-ranked ES descriptors monitored in the table indicate 12 important structural modifications in a given analog set.
Materials
Three data sets of analogs with two cores
To demonstrate the ability of this tandem regression-outlier analysis to remove all interference from any general effects in a ligand cellular system, three data sets of the ligand-dependent receptor-mediated data are used here. The first data set is a collection of 46 PPARγ agonists with the thiazolidinedione (TZD) core. The second data set is composed of 178 PPARγ agonists with a carboxylic acid core. The third data set is a merger of the first and second data set (i.e., 224 PPARγ agonists mixed with both TZD and carboxylic acid cores). The two main cores of PPARγ agonists and their merger are adopted, so as to observe the variations of top-ranked structural modifications. All EC_{50} (50% efficacy concentration) data were extracted from the literature [7–21]. The cellular reaction is the measurement of the activation of PPARγ within the construct of the cellular transactivation assays. Indeterminate and uncertain EC_{50} values were excluded. A negative logarithm of the EC_{50} values of PPARγ agonists was then taken. The original publication of all agonists and the activity quantities are listed in Additional file 1: Tables S1 and S2. All images of molecular structures were created by using Pybel [29, 30]. All molecular structures were energetically geometry-optimized using molecular mechanics and MMF97 calculations, which were implemented using the ChemBio3D software of the ChemBioOffice package [31].
Results
Jurs_RNCG as most representative descriptor
Dominant descriptors for each data size and the frequency of Jurs_RNCG (Jurs type descriptors) throughout 521 data sets are summarized here
Data size | Dominant descriptor (Near dominant) | Frequency of Jurs descriptors (Jurs_RNCG/Jurs type /521 data sets) |
---|---|---|
10 | Molecular_PolarSASA | 4/41/521 |
20 | Molecular_FractionalPolarSASA | 5/25/521 |
30 | ES_Count_dO^{b} | 4/14/521 |
40 | Num_RingBonds | 10/14/521 |
50 | SC_3_P^{a} | 9/16/521 |
60 | SC_3_P^{a} | 9/12/521 |
70 | IC^{a} | 18/31/521 |
80 | IC^{a} | 18/33/521 |
90 | Jurs_RNCG | 21/31/521 |
100 | IC^{a} (Jurs_RNCG) | 19/31/521 |
110 | Jurs_RNCG | 16/21/521 |
120 | Num_AtomClasses (Jurs_RNCG) | 21/23/521 |
130 | IC^{a} (Jurs_RNCG) | 28/29/521 |
140 | IC^{a} (Jurs_RNCG) | 39/306/521 |
150 | Jurs_RNCG | 39/39/521 |
160 | IC^{a} (Jurs_RNCG) | 53/53/521 |
170 | Jurs_RNCG | 369/369/521 |
In a realistic physical-chemical representation, one would prefer the Jurs_RNCG to the IC (Information Content) [32] as the most representative descriptor for all general effects. This is because Jurs_RNCG was originally designed based on the charge-related nature. The descriptor IC, as an index of graph theory, deals with the topological aspect in nature. Therefore, the Jurs_RNCG descriptor here is thought to be the most representative single descriptor for all general effects in the PPARγ agonist cellular system. After this designation, to our surprise, the Jurs_RNCG is further shown to be a linear combination of three important descriptors: LogD (partition coefficient), PSA (polar surface area), and shape-like descriptors in a subsequent work [33]. These three descriptors happen to be the three most important factors of investigation in medicinal chemistry over the past 50 years [34].
Top-ranked ES descriptors as important structural modifications around ligand binding
The top-ranked ES descriptors of 46 TZD PPARγ agonists
ES descriptors | Rank | Sign of β_{ES} |
---|---|---|
Count_ssO | 1 | - |
Sum_ssO | 2 | - |
Sum_sssN | 3 | + |
Count_sssN | 4 | + |
Count_ssCH2 | 5 | + |
Sum_aaO | 6 | + |
Count_aaO | 7 | + |
Count_sCH3 | 8 | - |
Sum_aaN | 9 | + |
Count_aaN | 10 | + |
Count_dsCH | 11 | - |
Sum_dsCH | 12 | - |
The top-ranked ES descriptors of 178 carboxylic acid PPARγ agonists
ES descriptors | Rank | Sign of β_{ ES } |
---|---|---|
Sum_ssO | 1 | - |
Count_ssO | 2 | - |
Sum_aaO | 3 | + |
Count_aaO | 4 | + |
Count_aaaC | 5 | + |
Sum_ssCH2 | 6 | - |
Count_aaaC | 7 | + |
Count_ssCH2 | 8 | - |
Count_aaNH | 9 | + |
Sum_aaNH | 10 | + |
Count_ssssC | 11 | - |
Sum_sssN | 12 | + |
The top-ranked ES descriptors of 224 PPARγ agonists with both TZD and carboxylic acid cores
ES descriptors | Rank | Sign of β_{ ES } |
---|---|---|
Count_ssO | 1 | - |
Sum_ssO | 2 | - |
Sum_aaO | 3 | + |
Count_aaO | 4 | + |
Sum_aaaC | 5 | + |
Count_aaaC | 6 | + |
Count_sssN | 7 | + |
Sum_sssN | 8 | + |
Count_aaNH | 9 | + |
Sum_aaNH | 10 | + |
Sum_aaN | 11 | + |
Count_aaN | 12 | + |
Next, the ES symbol sssN, indicates the introduction of a tertiary amine (sssN) moiety, which is a positive structural modification. ‘sss’ indicates three single bonds linked in structure. Nine agonists have 1 sssN moiety of the 46 collected agonists. The corresponding feature of the ES symbol, sssN, in the tertiary amine (sssN) moiety, is shown in Figure 1(b).
The next ES symbol in Table 2 is ssCH2. The positive regression coefficient indicates that the elongation of a ligand structure through the addition of carbon moiety (ssCH2) is a positive structural modification. This topological elongation of the agonist makes a large impact to the molecular binding. However, we notice that the ssCH2 symbol is negative in Table 3. And, when combining two data sets in Table 4, the topological structural modification ssCH2 falls out of the monitor table. Apparently, the suitable length of the TZD agonist is optimal for cellular binding.
The ES symbol following this in Table 2 is aaO. ‘aa’ indicates oxygen atom in an aromatic ring. Throughout the whole 46 TZD agonists there are only two agonists: AD-7075 and BRL48482, in Figure 1(c), that have the oxazole moiety. We notice that these two agonists serve as very good examples of “analog outliers”, which bear specific feature of outliers The other ES symbols, aaO, aaN and aaCH, also indicate this oxazole moiety. The 5-methyl-oxazole of AD-7075 has the additional symbols aasC and sCH3 whereas the benzo-oxazole of BRL48482 has the additional symbol aaaC. Taken together, the significance of oxazole moieties are faithfully pointed out by these symbols. The other monitored symbols, sCH3 and dsCH, indicate other, less important, structural modifications.
Lastly, in Table 3 and Table 4, we find a similar picture regarding the potency order of structural modifications. The top ES symbol, ssO, represents the most important structural modification, tyrosine moiety, performed on the middle part of PPARγ agonist. The rest of the top-ranked ES symbols represent important structural modifications done to the tail part. For example, a fmoc-like moiety (fluorenylmethyloxycarbonyl-like), other than oxazole moiety and tertiary amine, is another case which has an ES symbol of significance: aaaC. Again, serving as analog outliers, these two potent agonists [18] are shown in Figure 1(d).
There is one more observation: we can clearly see that no ES symbol regarding TZD moieties, dssC, dO, sssCH, ssNH, or ssS, appears in Table 2; and we can also see that no carboxylic acid symbols, such as dssC, dO, or sOH, appear in Table 3. ‘d’ indicates double bond. When combining the two data sets, no ES symbols regarding TZD or carboxylic acid appear in Table 4. There are two simple interpretations of this: 1, the lack of modification of the core part of analogs in a given set will naturally lead to no ES symbols monitored in the table, and 2, a core shift in the combined set without causing a large difference of reaction will not produce related ES symbols of significance. Thus, in conclusion, the TZD and carboxylic acid are known as the necessary parts of full PPARγ agonists without synthetic modifications. The necessity of two essential cores for full cellular activity can be immediately inferred when comparing the inactive compounds at initial synthesis.
The top-ranked ES symbols point out key ligand binding interactions
Taken together, these correspondences clearly point out that the top-ranked ES symbols are the key structural modifications surrounding molecular binding.
Discussion
Jurs_RNCG as a filter
The descriptor Jurs_RNCG acts as a filter. One might expect to see some outcomes if the single Jurs_RNCG descriptor is not included, i.e. there is no first filter of this tandem technique. Apparently, all general effects will contribute to the top-ranked ES descriptor. In Additional file 1: Table S3, for example, the top-ranked ES symbol, ssO, tyrosine moiety, of PPARγ agonists, falls outside the monitor table. In other words, we need a descriptor in the first regression that can effectively remove the general effect of a ligand cellular system. As mentioned above, one purpose for using Jurs_RNCG is to leave outliers for the second part.
Moreover, the potency orders and tendencies (signs of regression coefficients) of structural modifications coincide with our knowledge about the structural modifications of PPARγ agonists. So the top-ranked structural modifications can detect their corresponding key interactions surrounding molecular binding, as shown in the X-ray image of a potent agonist-PPARγ complex. The outcomes of such a regression-outlier analysis also tell us that the Jurs_RNCG is truly an adequate filter.
In addition, and exceeding our expectations, the Jurs_RNCG can be expressed in a linear combination of partition coefficients, polar surface area, and shape-like descriptors [33], which further reveals three essential factors for drug-cell interfaces in such a ligand cellular system [34].
Three types of dependency in the top-ranked ES descriptors
The ES symbols monitored in the Table 2 are intentionally combined in a single fitting equation
# | Equation |
---|---|
1 | Y = 4.97 − 20.8Jurs _ RNCG − 2.81 Count _ ssO + 0.38 Sum _ ssO * |
2 | Y = 3.41 − 4.50 Jurs_RNCG − 3.09 Count_ssO + 0.47 Sum_ssO * + 0.95 Sum_sssN − 0.86 Count_ssN * |
3 | Y = 3.34 − 11.1 Jurs_RNCG − 0.33 Count_ssO + 0.31 Count_sssN + 0.47 Count_aaO − 0.33 Count_sCH3 − 0.079 Count_aaN * |
Second, by removing the modification description dependency, the Count and Sum values of the same ES symbol are not in the same equation. In Equation 3 of Table 5, the symbol Count_aaN has a negative regression coefficient compared to the sign listed in Table 2. It therefore contradicts the observation that the captured symbols, aaO, aaaC, sssN, aaNH, and aaN, represent positive structural modifications to the tail part of PPARγ agonists. Obviously, in this equation form 3, the symbols aaO and sssN represent the identical key interaction in the tail part of PPARγ agonists, and the simultaneous appearance of them for the same moiety turned the regression coefficient of the additional aaN into the opposite sign. Thus, one can say here that the ES symbols aaN, aaO and sssN have dependency of description on the identical moiety.
Third, throughout all of the 46 TZD PPARγ agonists, when examining the values of the symbol aaO and related structural moieties, we found that no structural moiety contains this aaO feature aside for oxazole. The moiety oxazole exists only in the two potent agonists AD-7057 and BRL48482 [10]. The value of ES_Count_aaO is 1 for these 2 agonists whereas the value is 0 for the rest of collected PPARγ agonists. The ES symbols of oxazole have aaO, aaN, and aaCH. Four agonists have the aaN structural moiety of these 46 collected agonists and, among those four, two compounds are AD-7057 and BRL48482. Moreover, all agonists have the aaCH structural moiety, but the symbol aaCH does not appear in Table 2. Clearly, the symbols aaO, aaN and aaCH have unequal dependencies of description in these data samples.
Especially, these dependencies of descriptor will actually cause serious consequence to all QSARs of four categories (classical, 3-dimensional, decisional and orthogonal) [37–39], their existence would make a model lose its interpretability. Put together, the three types of dependencies in the top-ranked ES symbols actually play a major role in the design of the context equation. That is, two ES symbols don’t appear simultaneously in a context equation. Obviously, if one forces two dependent ES descriptors to be combined in a single equation, the signs in the regression coefficients of key structural modifications may change, and thus fail to point out the real tendency of impact to ligand binding in an analog set. If one mixes two ES descriptors in a single equation acting as a detector, the one in this regression-outlier analysis will lose its ability to correctly detect the real tendencies of key structural modifications in the given analog sets.
Conclusions
The innovative point of the present study is the fact that we used a statistical regression and its outlier as a computational technique for separation. This technique was used specifically in the ligand cellular system. As a counterpart to the hard equipment in the tandem technique, the prior molecular descriptor resembles a filter that removes the influence from the bulk of a cell and the latter class of descriptors is an array of detectors that can identify any important identities. In the case of the PPARγ agonist cellular system, the key structural modifications surrounding ligand binding were successfully detected and the tendencies of impact were examined. In the end, after the tandem regression-outlier analysis of this ligand cellular system, the results show that this prioritization of the context equations (filter) tagged with features of outliers (detector) is an effective computational tool for cheminformatics to detect possible features of outliers (key structural modifications), as well as their impact tendencies to ligand binding.
Declarations
Acknowledgements
The author, Y.-T. Lin, thanks the National Science Council of Taiwan for the financial supports (Grant No. NSC101-2113-M-037-010, NSC100-2113-M-037-010, NSC97-2113-M-037-004 and NSC95-2113-M-037-016-MY2).
Authors’ Affiliations
References
- Lin Y, Trouillon R, Safina G, Ewing AG: Chemical analysis of single cells. Anal Chem. 2011, 83: 4369-4392. 10.1021/ac2009838.View ArticleGoogle Scholar
- Chatterjee S, Hadi A: Influential Observations, High Leverage Points, and Outliers in Linear Regression. Stat Sci. 1986, 1: 379-416. 10.1214/ss/1177013622.View ArticleGoogle Scholar
- Maggiora GM: On outliers and activity cliffs–why QSAR often disappoints. J Chem Inf Model. 2006, 46: 1535-10.1021/ci060117s.View ArticleGoogle Scholar
- Johnson SR: The trouble with QSAR (or how I learned to stop worrying and embrace fallacy). J Chem Inf Model. 2008, 48: 25-26. 10.1021/ci700332k.View ArticleGoogle Scholar
- Peltason L, Iyer P, Bajorath J: Rationalizing three-dimensional activity landscapes and the influence of molecular representations on landscape topology and the formation of activity cliffs. J Chem Inf Model. 2010, 50: 1021-1033. 10.1021/ci100091e.View ArticleGoogle Scholar
- Stanton DT, Jurs PC: Development and use of charged partial surface area structure descriptors in computer-assisted quantitative structure–property relationship. Anal Chem. 1990, 62: 2323-2329. 10.1021/ac00220a013.View ArticleGoogle Scholar
- Henke BR, Blanchard SG, Brackeen MF, Brown KK, Cobb JE, Collins JL, Harrington WW, Hashim MA, Hull-Ryde EA, Kaldor I: N-(2-Benzoylphenyl)-L-tyrosine PPARgamma agonists. 1. Discovery of a novel series of potent antihyperglycemic and antihyperlipidemic agents. J Med Chem. 1998, 41: 5020-5036. 10.1021/jm9804127.View ArticleGoogle Scholar
- Neogi P, Lakner FJ, Medicherla S, Cheng J, Dey D, Gowri M, Nag B, Sharma SD, Pickford LB, Gross C: Synthesis and structure-activity relationship studies of cinnamic acid-based novel thiazolidinedione antihyperglycemic agents. Bioorg Med Chem. 2003, 11: 4059-4067. 10.1016/S0968-0896(03)00393-6.View ArticleGoogle Scholar
- Desai RC, Han W, Metzger EJ, Bergman JP, Gratale DF, MacNaul KL, Berger JP, Doebber TW, Leung K, Moller DE: 5-aryl thiazolidine-2,4-diones: discovery of PPAR dual alpha/gamma agonists as antidiabetic agents. Bioorg Med Chem Lett. 2003, 13: 2795-2798. 10.1016/S0960-894X(03)00505-5.View ArticleGoogle Scholar
- Willson TM, Brown PJ, Sternbach DD, Henke BR: The PPARs: from orphan receptors to drug discovery. J Med Chem. 2000, 43: 527-550. 10.1021/jm990554g.View ArticleGoogle Scholar
- Chittiboyina AG, Venkatraman MS, Mizuno CS, Desai PV, Patny A, Benson SC, Ho CI, Kurtz TW, Pershadsingh HA, Avery MA: Design and synthesis of the first generation of dithiolane thiazolidinedione- and phenylacetic acid-based PPARgamma agonists. J Med Chem. 2006, 49: 4072-4084. 10.1021/jm0510880.View ArticleGoogle Scholar
- Sauerberg P, Mogensen JP, Jeppesen L, Svensson LA, Fleckner J, Nehlin J, Wulff EM, Pettersson I: Structure-activity relationships of dimeric PPAR agonists. Bioorg Med Chem Lett. 2005, 15: 1497-1500. 10.1016/j.bmcl.2004.12.084.View ArticleGoogle Scholar
- Liu K, Black RM, Acton JJ, Mosley R, Debenham S, Abola R, Yang M, Tschirret-Guth R, Colwell L, Liu C: Selective PPARgamma modulators with improved pharmacological profiles. Bioorg Med Chem Lett. 2005, 15: 2437-2440. 10.1016/j.bmcl.2005.03.092.View ArticleGoogle Scholar
- Martin JA, Brooks DA, Prieto L, Gonzalez R, Torrado A, Rojo I, Lopez de Uralde B, Lamas C, Ferritto R, Dolores Martin-Ortega M: 2-Alkoxydihydrocinnamates as PPAR agonists. Activity modulation by the incorporation of phenoxy substituents. Bioorg Med Chem Lett. 2005, 15: 51-55. 10.1016/j.bmcl.2004.10.042.View ArticleGoogle Scholar
- Cai Z, Feng J, Guo Y, Li P, Shen Z, Chu F, Guo Z: Synthesis and evaluation of azaindole-alpha-alkyloxyphenylpropionic acid analogues as PPARalpha/gamma agonists. Bioorg Med Chem. 2006, 14: 866-874. 10.1016/j.bmc.2005.09.040.View ArticleGoogle Scholar
- Lu Y, Guo Z, Guo Y, Feng J, Chu F: Design, synthesis, and evaluation of 2-alkoxydihydrocinnamates as PPAR agonists. Bioorg Med Chem Lett. 2006, 16: 915-919. 10.1016/j.bmcl.2005.10.104.View ArticleGoogle Scholar
- Henke BR, Adkison KK, Blanchard SG, Leesnitzer LM, Mook RA, Plunket KD, Ray JA, Roberson C, Unwalla R, Willson TM: Synthesis and biological activity of a novel series of indole-derived PPARgamma agonists. Bioorg Med Chem Lett. 1999, 9: 3329-3334. 10.1016/S0960-894X(99)00603-4.View ArticleGoogle Scholar
- Sauerberg P, Pettersson I, Jeppesen L, Bury PS, Mogensen JP, Wassermann K, Brand CL, Sturis J, Woldike HF, Fleckner J: Novel tricyclic-alpha-alkyloxyphenylpropionic acids: dual PPARalpha/gamma agonists with hypolipidemic and antidiabetic activity. J Med Chem. 2002, 45: 789-804. 10.1021/jm010964g.View ArticleGoogle Scholar
- Rybczynski PJ, Zeck RE, Dudash J, Combs DW, Burris TP, Yang M, Osborne MC, Chen X, Demarest KT: Benzoxazinones as PPARgamma agonists. 2. SAR of the amide substituent and in vivo results in a type 2 diabetes model. J Med Chem. 2004, 47: 196-209. 10.1021/jm0301888.View ArticleGoogle Scholar
- Koyama H, Miller DJ, Boueres JK, Desai RC, Jones AB, Berger JP, MacNaul KL, Kelly LJ, Doebber TW, Wu MS: (2R)-2-ethylchromane-2-carboxylic acids: discovery of novel PPARalpha/gamma dual agonists as antihyperglycemic and hypolipidemic agents. J Med Chem. 2004, 47: 3255-3263. 10.1021/jm030621d.View ArticleGoogle Scholar
- Pinelli A, Godio C, Laghezza A, Mitro N, Fracchiolla G, Tortorella V, Lavecchia A, Novellino E, Fruchart JC, Staels B: Synthesis, biological evaluation, and molecular modeling investigation of new chiral fibrates with PPARalpha and PPARgamma agonist activity. J Med Chem. 2005, 48: 5509-5519. 10.1021/jm0502844.View ArticleGoogle Scholar
- Kier LB, Hall LH: Derivation and significance of valence molecular connectivity. J Pharm Sci. 1981, 70: 583-589. 10.1002/jps.2600700602.View ArticleGoogle Scholar
- Kier LB, Hall LH: General definition of valence delta-values for molecular connectivity. J Pharm Sci. 1983, 72: 1170-1173. 10.1002/jps.2600721016.View ArticleGoogle Scholar
- Kier LB, Hall LH: An electrotopological-state index for atom in molecules. Pharm Res. 1990, 7: 801-807. 10.1023/A:1015952613760.View ArticleGoogle Scholar
- Hall LH, Kier LB: The E-state as the basis for molecular structure space definition and structure similarity. J Chem Inf Comput Sci. 2000, 40: 784-791. 10.1021/ci990140w.View ArticleGoogle Scholar
- Hall LH, Mohney B, Kier LB: The electrotopological state: structure information at the atomic level for molecular graphs. J Chem Inf Comput Sci. 1991, 31: 76-82. 10.1021/ci00001a012.View ArticleGoogle Scholar
- The QSAR module in Discovery Studio. 2008, San Diego, CA, USA: Accelrys Software Inc, 21
- R development core team: R. 2005, Vienna, Austria: R Foundation for Statistical Computing, 2110Google Scholar
- O'Boyle NM, Morley C, Hutchison GR: Pybel: a Python wrapper for the OpenBabel cheminformatics toolkit. Chem Cent J. 2008, 2: 5-10.1186/1752-153X-2-5.View ArticleGoogle Scholar
- O'Boyle NM, Banck M, James CA, Morley C, Vandermeersch T, Hutchison GR: Open Babel: An open chemical toolbox. J Cheminform. 2011, 3: 33-10.1186/1758-2946-3-33.View ArticleGoogle Scholar
- Kerwin SM: ChemBioOffice Ultra 2010 suite. J Am Chem Soc. 2010, 132: 2466-2467. 10.1021/ja1005306.View ArticleGoogle Scholar
- Basak SC, Magnuson VR: Molecular topology and narcosis. A quantitative structure-activity relationship (QSAR) study of alcohols using complementary information content (CIC). Arzneimittelforschung. 1983, 33: 501-503.Google Scholar
- Lin YT, Chen GY: A scaffold-independent subcellular event-based analysis: characterization of significant structural modifications. J Chem Inf Model. 2012, 52: 506-514. 10.1021/ci200540y.View ArticleGoogle Scholar
- Walters WP, Green J, Weiss JR, Murcko MA: What do medicinal chemists actually make? A 50-year retrospective. J Med Chem. 2011, 54: 6405-6416. 10.1021/jm200504p.View ArticleGoogle Scholar
- Nolte RT, Wisely GB, Westin S, Cobb JE, Lambert MH, Kurokawa R, Rosenfeld MG, Willson TM, Glass CK, Milburn MV: Ligand binding and co-activator assembly of the peroxisome proliferator-activated receptor-gamma. Nature. 1998, 395: 137-143. 10.1038/25931.View ArticleGoogle Scholar
- Xu HE, Lambert MH, Montana VG, Plunket KD, Moore LB, Collins JL, Oplinger JA, Kliewer SA, Gampe RT, McKee DD: Structural determinants of ligand binding selectivity between the peroxisome proliferator-activated receptors. Proc Natl Acad Sci U S A. 2001, 98: 13919-13924. 10.1073/pnas.241410198.View ArticleGoogle Scholar
- Putz MV, Lacrama AM: Introducing spectral structure activity relationship (S-SAR) analysis. Application to ecotoxicology. Int J Mol Sci. 2007, 8: 363-391. 10.3390/i8050363.View ArticleGoogle Scholar
- Lacrama AM, Putz MV, Ostafe V: A spectral-SAR model for the anionic-cationic interaction in ionic liquids: Application to Vibrio fischeri ecotoxicity. Int J Mol Sci. 2007, 8: 842-863. 10.3390/i8080842.View ArticleGoogle Scholar
- Putz MV: Residual-QSAR. Implications for genotoxic carcinogenesis. Chem Cent J. 2011, 5: 29-10.1186/1752-153X-5-29.View ArticleGoogle Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.