Nicola J.
Knight
a,
Elsa
Hernando
b,
Cally J. E.
Haynes§
a,
Nathalie
Busschaert¶
a,
Harriet J.
Clarke
a,
Koji
Takimoto
c,
María
García-Valverde
b,
Jeremy G.
Frey
*a,
Roberto
Quesada
*b and
Philip A.
Gale
*a
aChemistry, University of Southampton, Southampton, SO17 1BJ, UK. E-mail: philip.gale@soton.ac.uk; J.G.Frey@soton.ac.uk; Tel: +44 (0)23 8059 3332
bDepartamento de Química, Facultad de Ciencias, Universidad de Burgos, 09001 Burgos, Spain. E-mail: rquesada@ubu.es
cOrganic and Polymeric Materials, Tokyo Institute of Technology, 2-12-1 O-okayama, Tokyo 152-8552, Japan
First published on 8th December 2015
The transmembrane anion transport activity of 43 synthetic molecules based on the structure of marine alkaloid tambjamine were assessed in model phospholipid (POPC) liposomes. The anionophoric activity of these molecules showed a parabolic dependence with lipophilicity, with an optimum range for transport efficiency. Using a quantitative structure–transport activity (QSAR) approach it was possible to rationalize these results and to quantify the contribution of lipophilicity to the transport activity of these derivatives. While the optimal value of logP and the curvature of the parabolic dependence is a property of the membrane (and so similar for the different series of substituents) we found that for relatively simple substituents in certain locations on the tambjamine core, hydrophobic interactions clearly dominate, but for others, more specific interactions are present that change the position of the membrane hydrophobicity parabolic envelope.
Among the identified naturally occurring anionophores, the structurally related prodiginines and tambjamine alkaloids are the most studied examples.11 These compounds show interesting pharmacological properties including antitumor activity.12,13 The synthetic prodiginine analogue obatoclax has been shown to display promising anticancer activity in the clinic.14 We have demonstrated that the ionophoric activity of these compounds is related to their cytotoxicity.15 Active ionophores are able to disrupt intracellular pH gradients and to trigger apoptosis in cancer cells.16–19
An increasing number of synthetic molecules capable of facilitating anion transport by forming lipophilic supramolecular complexes or membrane spanning channels have been reported in the literature.20–23 Despite this progress, the knowledge of the requirements for designing effective anion transporters remains poor, and identification of active derivatives is mostly based on trial/error methods. Qualitative structure–transport activity studies underscored lipophilicity as one of the most important factors influencing the ionophoric transport activity of these compounds.24 Moreover, Gale, Davis and co-workers have also introduced the concept of lipophilic balance in the design of these compounds.25 Quantitative structure–activity relationship (QSAR) approaches are widely employed in medicinal chemistry. QSAR constitutes a powerful tool to assist rational molecular design and to predict different physicochemical properties.26 Recently, we have reported a quantitative structure–transport activity (QSAR) study of the anion binding and transport of a series of 1-hexyl-3-phenylthioureas bearing various substituents at the para- positions of the aromatic ring.27 This study allowed us to determine a statistically relevant model correlating anion transport activity with parameters such as lipophilicity, the Hammett coefficient of the varied substituent and SPAN, a descriptor for molecular size. Prompted by this success we decided to perform a more ambitious study introducing several structural changes on the studied molecules. We aimed to investigate a series of effective anion transporters having a range of lipophilicity values as well as transport activities. In this regard, the tambjamine alkaloids represent ideal candidates because of their synthetic accessibility and tolerance to different substituents while remaining as potent transmembrane anion transporters. In this work we present a QSAR study of the transmembrane anion transport activity of 43 tambjamine inspired transporters, aimed to shed light on the structural design requirements to successful anion carriers and the quantification of the relationships between lipophilicity and transmembrane anion transport activity of small molecules.
Compound | EC50b | Hill parameter n | k ini | logPd | Retention time (min) | |
---|---|---|---|---|---|---|
a n.d. not determined. b molar percentage with respect to POPC, mol%. c Values calculated by fitting the plot of relative chloride release (y) versus time (x) for 0.05 mol% compound to lipid to an asymptotic function y = a − b × cx. The initial rate of chloride release (kini in % s−1) is given by −bln(c). d LogP values calculated using ALOGPs 2.1 software. e Determined via correlation between kini and EC50 (see ESI). | ||||||
1 | 0.00719 | 1.19 | 0.952 | 3.08 | 10.4 | |
2 | 0.00613 | 1.23 | 1.41 | 3.74 | 11 | |
3 | 0.00699 | 1.25 | 1.24 | 4.17 | 11.6 | |
4 | 0.00779 | 1.32 | 1.17 | 4.63 | 12.2 | |
5 | 0.0104 | 1.29 | 1.02 | 4.72 | 11.9 | |
6 | 0.00951 | 1.25 | 1.13 | 5.02 | 12.7 | |
7 | 0.288 | 0.965 | 0.0231 | 7.11 | 14.5 | |
8 | 0.0688 | 1.42 | 0.229 | 2.58 | 8.8 | |
9 | 0.0197 | 1.29 | 0.638 | 2.86 | 9.8 | |
10 | 0.0134 | 1.28 | 0.786 | 3.37 | 10.5 | |
11 | 0.0231 | 1.31 | 0.470 | 3.76 | 11 | |
12 | 0.0260 | 1.29 | 0.494 | 3.2 | 10.4 | |
13 | 0.0208 | 1.18 | 0.474 | 2.92 | 9.6 | |
14 | 0.0155 | 1.27 | 0.661 | 3.49 | 10.3 | |
15 | 0.0236 | 1.37 | 0.444 | 3.62 | 10.5 | |
16 | 0.0221 | 1.29 | 0.510 | 3.76 | 10.8 | |
17 | 0.0167 | 1.48 | 0.830 | 2.68 | 9 | |
18 | 0.0494 | 1.59 | 0.314 | 2.11 | 8.2 | |
19 | 0.197 | 0.853 | 0.0919 | 1.88 | n.d. | |
20 | 0.346 | 1.30 | 0.0368 | 1.03 | 7 | |
21 | 0.0921 | 1.08 | 0.215 | 1.55 | 7.7 | |
22 | 0.0274 | 1.03 | 0.517 | 2.03 | 8.5 | |
23 | 0.0116 | 0.860 | 0.743 | 2.46 | 9.3 | |
24 | 0.00648 | 1.18 | 1.46 | 2.99 | 10.2 | |
25 | 0.005 | 1.19 | 1.50 | 3.52 | 10.9 | |
26 | 0.00451 | 1.51 | 1.52 | 4.02 | 11.5 | |
27 | 0.00312 | 1.07 | 2.63 | 4.79 | 12.1 | |
28 | 0.0038 | 1.10 | 1.63 | 5.1 | 12.6 | |
29 | 0.0053 | 1.33 | 1.54 | 5.36 | 13.1 | |
30 | 0.00731 | 1.15 | 1.09 | 6.14 | 13.8 | |
31 | 0.0113 | 1.20 | 0.941 | 2.24 | 9.2 | |
32 | 0.00668 | 1.05 | 1.01 | 2.84 | 10 | |
33 | 0.0977 | 0.963 | 0.224 | 1.5 | n.d. | |
34 | 0.0157 | 1.32 | 0.744 | 4.4 | 11.5 | |
35 | 0.0116 | 1.20 | 0.708 | 5.94 | 13.1 | |
36 | 0.0123 | 0.857 | 0.321 | 6.46 | n.d. | |
37 | 0.0133 | 1.45 | 0.600 | 4.38 | 11.6 | |
38 | 0.00878 | 1.43 | 1.69 | 3.3 | 11.3 | |
39 | 0.0196 | 1.14 | 0.605 | 3.62 | 10.7 | |
40 | 0.00968 | 1.74 | 1.16 | 4.49 | 11.9 | |
41 | 0.00517 | 1.15 | 1.04 | 6.07 | n.d. | |
42 | 0.0204 | 0.929 | 0.420 | 6.42 | 13.9 | |
43 | 0.0616e | —e | 0.186 | 7.14 | 10.4 |
A simple plot of the transport activity, expressed as log(1/EC50), vs. ALOGPs or retention time (RT) suggested a parabolic dependence of these variables (Fig. 2a and b). The rationale behind this observation is that there is an optimum compromise in the hydrophilicity/hydrophobicity balance, which maximizes the transmembrane transport activity of a given compound.24 A too hydrophilic transporter would not partition into the phospholipid membrane whereas a too hydrophobic derivative would not be able to move away from the membrane core and thus act as a carrier. At the beginning of the modelling part of this study, a set of 38 compounds had been synthesized. However, the majority of these compounds were present in the middle of the explored ALOGPs range (values from 2–6) with only a few compounds above or below this range. Therefore, the need of including further compounds, having low and high logP values, to confirm this parabolic dependence and to avoid an excessive leverage of data corresponding to compounds displaying low activity and extreme logP values was evident. Compounds of a similar structure to the existing tambjamines were hypothesised and their ALOGPs values calculated. Those that fell in the ranges of 1–2.5 and 5–7.5 were considered suitable and suggested for synthesis. Thus, 5 additional tambjamine derivatives (numbers 19, 33, 36, 41, 43) were synthesised and measured (the additional molecules are highlighted by * in Fig. 2c). Attempts to find simple correlations between the anion transport activity and the lipophilicity of tambjamine derivatives were not satisfactory. Therefore, it was evident that a more sophisticated analysis should be made.
To cope with the leverage of the high and low logP molecules, a stratified test set selection method was employed, ensuring that compounds were selected for the low, mid and high logP ranges. However, the size of the dataset and the relatively few molecules in the strata does not allow for much flexibility in the selection. To minimise test set selection bias and maximize the information from all the molecules in the dataset, a bootstrap method was selected as a suitable method for validation of the model fits. Using the bootstrap package, boot, in R,42,43 the data were sampled from the full dataset and the statistics calculated, using a resampling of the dataset 999 times. Comparing the confidence intervals for the bootstrap fit and the linear least squares prediction highlights the reasonable robustness of the fits.
The simple parabolic two parameter model (ALOGPs, ALOGP-sq) generates the following eqn (1) with an R2 value of 0.629:
Log(1/EC50) = −0.579 + 1.203ALOGPs − 0.133ALOGPs-sq | (1) |
Increasing the number of parameters to three increased the R2 value to approximately 0.79 for the top models. All the top 20 models have an R2 value above 0.74. Summary information about the 10 best three-parameter models to the whole dataset is shown in Table 2, ranked by R2 values (additional models can be seen in ESI†).
No. des. | Descriptors | R 2 | |||
---|---|---|---|---|---|
3 | ALOGPs | ALOGPs-sq | Mv | — | 0.7901 |
3 | ALOGPs | ALOGPs-sq | J3D | — | 0.7892 |
3 | ALOGPs | ALOGPs-sq | Mp | — | 0.7836 |
3 | ALOGPs | ALOGPs-sq | nH | — | 0.7822 |
3 | ALOGPs | ALOGPs-sq | AMW | — | 0.7768 |
3 | ALOGPs | ALOGPs-sq | J | — | 0.7680 |
3 | ALOGPs | ALOGPs-sq | E3u | — | 0.7672 |
3 | ALOGPs | ALOGPs-sq | ARR | — | 0.7654 |
3 | ALOGPs | ALOGPs-sq | Density (g cm−3) | — | 0.7615 |
3 | ALOGPs | ALOGPs-sq | Surface tension (dyne cm−1) | — | 0.7571 |
4 | ALOGPs | ALOGPs-sq | nCIC | J3D | 0.8160 |
4 | ALOGPs | ALOGPs-sq | nH | J | 0.8152 |
4 | ALOGPs | ALOGPs-sq | AMW | J | 0.8151 |
4 | ALOGPs | ALOGPs-sq | AMW | J3D | 0.8141 |
4 | ALOGPs | ALOGPs-sq | J3D | Ui | 0.8140 |
4 | ALOGPs | ALOGPs-sq | Density (g cm−3) | J3D | 0.8138 |
4 | ALOGPs | ALOGPs-sq | Density (g cm−3) | J | 0.8121 |
4 | ALOGPs | ALOGPs-sq | Parachor (cm3) | nH | 0.8099 |
4 | ALOGPs | ALOGPs-sq | Molar refractivity (cm3) | nH | 0.8085 |
4 | ALOGPs | ALOGPs-sq | Polarizability (cm3) | nH | 0.8084 |
Following the ‘fit all models’ fit, confidence intervals were obtained for a selected number of models from the least-squares analysis. These models were then also run through a bootstrap method in R to obtain confidence intervals using a sampling method. Due to the distribution of the data still being heavily biased towards the middle of the ALOGPs range, we utilised a stratified selection within the bootstrap function to ensure that a selection of points from the lower and upper regions were always included.
Confidence intervals obtained from the bootstrap function were well aligned with the confidence intervals obtained from the linear fit (Table 3) (see ESI† for additional details). This suggests that the fits are quite robust. The most variation comes in the coefficient for the intercept with a much narrower range in the ALOGPs and ALOGPs-sq coefficients. However, plotting actual vs. predicted for the models gives a fairly similar appearance for all of the selection of ten models (see ESI† for details).
Coefficients | Model parameters | ALOGPs ALOGPs-sq | ALOGPs ALOGPs-sq Mv | ALOGPs ALOGPs-sq nCIC J3D |
---|---|---|---|---|
R 2 | 0.6292 | 0.7901 | 0.816 | |
Intercept | −0.579 | 3.362 | −5.105 | |
Linear fit | 2.5% C.I. | −1.165 | 1.838 | −7.579 |
97.5% C.I. | 0.008 | 4.887 | −2.632 | |
Bootstrap | 2.5% C.I. | −1.108 | 2.159 | −7.681 |
97.5% C.I. | −0.086 | 4.419 | −2.694 | |
ALOGPs | 1.203 | 1.372 | 1.284 | |
Linear fit | 2.5% C.I. | 0.903 | 1.135 | 1.056 |
97.5% C.I. | 1.504 | 1.610 | 1.511 | |
Bootstrap | 2.5% C.I. | 0.904 | 1.126 | 1.087 |
97.5% C.I. | 1.470 | 1.579 | 1.493 | |
ALOGPs-sq | −0.133 | −0.158 | −0.146 | |
Linear fit | 2.5% C.I. | −0.168 | −0.186 | −0.172 |
97.5% C.I. | −0.098 | −0.129 | −0.120 | |
Bootstrap | 2.5% C.I. | −0.166 | −0.190 | −0.173 |
97.5% C.I. | −0.093 | −0.123 | −0.116 | |
3rd parameter | −6.616 | 0.411 | ||
Linear fit | 2.5% C.I. | −9.063 | 0.057 | |
97.5% C.I. | −4.168 | 0.764 | ||
Bootstrap | 2.5% C.I. | −8.432 | 0.064 | |
97.5% C.I. | −4.473 | 0.796 | ||
4th parameter | 1.587 | |||
Linear fit | 2.5% C.I. | 0.808 | ||
97.5% C.I. | 2.367 | |||
Bootstrap | 2.5% C.I. | 0.796 | ||
97.5% C.I. | 2.330 |
As shown by the models described in Table 2, there were a large number of calculated descriptors that seemed to offer potentially useful additional descriptive power to the fits, but without any clear advantage of one over the others (apart from the clear importance of logP). This suggested that principle component analysis and partial least squares analysis might be useful. However, this led to insignificant improvements in the models, and made the contributions of the terms in the models less clear. Therefore, we sought an alternative classification approach along the lines of partial decisions trees by modelling subsets of the compounds based on the structural features of the molecules.
Splitting by ring-substituent R4 gives two groups: thirty-three compounds with a methoxy group and ten compounds with a –OBn substituent (Fig. 4a). Splitting considering R5 group gives two main groups and two points that do not fit into either the NH or NH–Ph classification. The NH group has nineteen compounds and the NH–Ph group has twenty-two compounds (Fig. 4b). Splitting by the R6 group is fairly difficult as there are a variety of different substituents. The most populated group is that in which R6 is an alkyl group, with twenty-eight compounds. The remaining fifteen compounds fit into six other groups (Fig. 4c).
Fig. 4 Plot of log(1/EC50) vs. ALOGPs splitting the dataset by different substituents: (a) ring-substituent (R4), (b) enamine-substituent (R5), (c) R-type (R6), see Fig. 3. |
The subset with the most interesting grouping involves the split by enamine-substituent R5 (Fig. 4b). From plotting log(1/EC50) against ALOGPs (assuming a parabolic relationship) we have two sets of data where the peak log(1/EC50) values appears to change between the two sets. However the optimum logP value appears to be similar for the two sets. The R-type plot shows a nice parabolic relationship for the R6 alkyl R-type, however the other groups are not populated well enough to show a proper correlation. The reason for this is that in the NH group set the main substituent that is possible is an alkyl chain. On the other hand, with the phenyl ring in NH–Ph there is the opportunity to substitute a wider variety of R-types. Since there is only a substitution at the para position it limits the number of compounds that will have the same R-type substituent. Due to this we choose to take only those compounds with an alkyl substituent and carry out modelling of the subset using the lme4 package,45,46 in R. This package allows us to use an entire dataset to fit the curve of the parabola, whilst allowing the subset of data to adjust the positioning of the curve by changing the intercept. A linear mixed effect model (lmer) was run for the subset of the compounds containing an alkyl R-type, modelling the dataset to the form log(1/EC50) = a + b × ALOGPs + c × ALOGP2, and further splitting by the substituent R4. See Fig. 5.
Taking only the OMe ring substituted compounds (20 of the 28 alkyl compounds) results in the following lmer model and plot (Fig. 6).
These models show that extending a hydrocarbon tail certainly has the classic parabolic behaviour on logP with the optimum value of logP (and the curvature) being a property of the membrane (so similar for many of the subsets). The effect of the other substituent (OMe) and (OBn) in changing the maximum value of log(1/EC50) is demonstrated but we are less clear what is driving this effect and this will be a subject for further investigation.
Fig. 7 shows that by defining several sub-groups of substituents in terms of substituent location and chemical type we are able to demonstrate the parabolic dependence on logP and begin to highlight the aspects that are a property of the membrane and those that depend on more specific interactions between the membrane and the tambjamine molecules. The parabolic dependence observed is a property of the membrane. However, each substituent series is shifted in optimal logP for transport. This evidence leads us to suggest that whilst for relatively simple substituents in certain locations on the tambjamine core, hydrophobic interactions dominate, for others more specific interactions are present that change the position of the membrane hydrophobicity parabolic envelope. The functions illustrated in Fig. 7 are presented in Table 4.
Sub group | Equation fit | R 2 |
---|---|---|
OBn.NH.alkyl | Y = −4.783 + 2.737 × ALOGPs − 0.2656 × ALOGPs2 | 0.84 |
OBn.NH–Ph.alkyl | Y = −0.2663 + 0.7575 × ALOGPs − 0.06513 × ALOGPs2 | 0.999 |
OMe.NH.alkyl | Y = −0.8097 + 1.509 × ALOGPs − 0.1707 × ALOGPs2 | 0.97 |
OMe.NH–Ph.alkyl | Y = 0.1699 + 1.088 × ALOGPs − 0.1456 × ALOGPs2 | 0.999 |
OMe.NH–Ph.halogen | Y = −6.332 + 4.936 × ALOGPs − 0.7501 × ALOGPs2 | 0.48 |
OMe.NH–Ph.O–R′ | Y = −13.52 + 9.364 × ALOGPs − 1.42 × ALOGPs2 | 0.98 |
Footnotes |
† Electronic supplementary information (ESI) available: Synthesis of new compounds, anion transport studies and details of the QSAR analysis. See DOI: 10.1039/c5sc03932k |
‡ The underlying research data for this paper are available in accordance with EPSRC open data policy from http://dx.doi.org/10.5258/SOTON/384138 |
§ Present address: Department of Chemistry, University of Cambridge, Lensfield Rd, Cambridge, CB2 1EW. |
¶ Present address: Chemistry Research Laboratory, Department of Chemistry, University of Oxford, Mansfield Road, Oxford, OX1 3TA, UK. |
This journal is © The Royal Society of Chemistry 2016 |