Open Access Article
Umatur Rehman
a,
Jan-Michael Mewes
b,
Stephen J. Butler
a and
Felix Plasser
*a
aDepartment of Chemistry, Loughborough University, Loughborough, LE11 3TU, UK. E-mail: f.plasser@lboro.ac.uk
bbeeOLED GmbH, Gostritzer Str. 67c, 01217, Dresden, Germany. E-mail: janmewes@janmewes.de
First published on 27th May 2026
Anion sensing is a crucial area of research with applications in medical diagnostics, environmental monitoring and industrial analysis. Lanthanide complexes are actively studied for these purposes, particularly in biological media, as their positively charged central cations can provide strong electrostatic interactions with the respective analytes and because their unique photophysical properties allow for efficient detection. Within this work, we investigate the suitability of various computational methods to determine the geometries of europium(III) complexes and their binding energies to anions of biological relevance, putting special emphasis on the influence of conformational flexibility. We find that the composite r2SCAN-3c method produces accurate geometries at a low computational cost and that numerical stability as well as computational efficiency are increased when using a Y atom as proxy for Eu. We highlight the importance of conformational flexibility, necessitating an appropriate conformer search algorithm. Finally, when studying binding energies in host–guest complexes, we find that, despite investigating a range of methodological influences, we are unable to find a computational protocol that achieves even semi-quantitative accuracy. In summary, this work provides a solid starting point for future investigations on luminescent lanthanide(III) complexes and their anion sensing properties while also illustrating the open challenges.
The electronic structure of lanthanides is largely responsible for their unique photophysical properties,11 such as sharp emission bands25 and large Stokes shift of about 200 nm,26 making them useful to serve as sensors. Sensors based on these complexes have distinctive selectivities, and their detection limits are similar to those of commonly employed organic fluorophores like coumarin, fluorescein and rhodamine.27 However, they are considered superior probes to conventional organic fluorophores due to their very narrow emission bands having a full-width half maximum of less than 10 nm which reduces spectral interference and enhances the precision with which emission changes can be monitored during sensing.28 In addition, these complexes have long-lived luminescence, which allows sensitive time-resolved detection of emission and prevents interference from background excitation and scattered radiation.29–32
The structural design of ligands to make efficient lanthanide complexes has been an important goal of several researchers,3,33 as the properties of these complexes can be improved for more sensitive and selective detection of specific ions and molecules by carefully designing the structure of ligands around the central ion. For this purpose, quantum mechanical modelling can help by providing molecular insights and energetics of the relevant processes.
Researchers have used quantum chemical methods to identify important structural features of these complexes, for instance, the binding modes of guests to the complexes.34 The quantum chemical studies have also been proven to be useful in verifying the presence of important chemical interactions such as π–π stacking and hydrogen bonding in them.35,36 They have assisted in understanding the increased selectivity of one guest molecule over others by providing a deeper understanding of the binding geometries.37 In addition to structural aspects, theoretical methods have provided an understanding of the photophysical behaviour of the complexes, such as sensitisation of the lanthanide ion by ligands.3,38,39
Having a computational method which can accurately describe the structure of discrete lanthanide complexes is crucial for advancing their applications. Earlier work involved the development of various variants of the Sparkle model as extensions to different semi-empirical methods.40–42 Subsequently, Grimme and co-workers have shown that their general purpose GFN-xTB method outperforms Sparkle/PM6 for geometries of lanthanide complexes.43 The next step by this group amounts to the development of a force-field (GFN-FF) for lanthanides that can describe the lanthanide-to-ligand bond lengths with a mean absolute deviation of 0.14 Å.44 As an alternative option, Okayasu and Yuasa revealed that DFT based predictions, in combination with experimental techniques, can be used to determine the configuration of europium(III) complexes in solution.45
The modelling of lanthanide complexes is challenging due to their open-shell electronic structure, large number of possible conformers,20 flexible coordination geometries spanning coordination numbers usually from 7 to 10 as well as the presence of multiply charged species and the influence of solvation. Considering the importance of these complexes and the difficulties in their design process, our aim is to present a workflow that closely reflects the objectives and challenges identified in our previous studies of related complexes,34–36,46 thereby establishing an approach optimised for anion-recognition applications.
Our work begins by comparing geometries obtained using various computational methods and evaluating them against experimental crystal structures. For this purpose we choose the four experimentally synthesised europium(III) complexes [Eu.1],47 [Eu.L2],48 [Eu.L3],49 and [Eu.L5],49 based on either cyclen (1,4,7,10-tetraazacyclododecane) or TACN (1,4,7-triazacyclononane) macrocyclic scaffolds, as shown in Fig. 1. [Eu.1] has been developed as a probe for peroxynitrite and features a coordinatively saturated nonadentate ligand.47 [Eu.L2] was synthesised by Parker and coworkers from the tris-(4-bromopyridyl) europium(III) complex in a two-step, one-pot sequence and has three-fold symmetry in solution; [Eu.L3] and [Eu.L5] were also synthesised from the same precursor complex using Sonogashira coupling reactions. [Eu.1] has an approximate square antiprismatic geometry, while [Eu.L2], [Eu.L3] and [Eu.L5] have tricapped trigonal prismatic coordination geometry differing in the nature of the donor atoms (phosphinate or carboxylate donors) and peripheral substituents.48,49 After testing how well geometries are reproduced when starting from the crystal structure, we move to the more realistic setting of evaluating how well they can be determined in a completely blind approach. In this context, we emphasise the importance of appropriate conformer search algorithms. As a final step, we move to the even more challenging question of predicting binding energies and geometries in host–guest complexes. For this purpose, we choose two derivatives of [Eu.1], the complexes [Eu.pB(OH)2]+ and [Eu.mB(OH)2]+, also shown in Fig. 1.36,46 These complexes are experimentally known to bind to phosphate, adenosine monophosphate (AMP), as well as adenosine diphosphate (ADP) and we wanted to investigate their binding properties using computation.46
The basis sets 6-31G* and 6-311G** are paired with the large-core ECP52MWB,67 which also represents the f-electrons and replaces in total 52 electrons of Eu(III) and computations are performed in a pseudo-singlet configuration. The notations Y/28, Eu/28 and Eu/52 are used in the discussion to refer to the use of small and large-core potentials, respectively. We, thus, have three different levels of representing the europium atom and it is not a priori clear which one is the best. While the small core computation with Eu(III) seems to be the highest level, we note a severe limitation. The true 7F0 ground state of Eu(III) is roughly spherically symmetric due to static correlation within the f-orbitals. As a consequence, the seven f-orbitals should have an overage occupation of 6/7 electrons each. But within the present computations, six of the f-orbitals are singly occupied, and one is left empty, thus breaking the spherical symmetry. Even more, the system now contains seven different quasi-degenerate 6f-configurations with the potential of causing severe problems in terms of numerics and reproducibility. By contrast, the ECP52MWB and Y(III) computations preserve the spherical symmetry of the core and semi-core electrons per construction and are, thus, expected to be numerically more stable (as also seen in our preliminary computations).
To perform DFT calculation with ωB97M-V functional (6-31G* and 6-311G** basis sets and ECP52MWB) we used Q-Chem 6.168 while, for the other calculations, Orca 5.069–71 interfaced to libxc72 was used. In these calculations, water was included as solvent via the conductor-like polarizable continuum model (CPCM).73 Semi-empirical calculations were carried out using xTB 6.5.1.74 Water solvation in the xTB calculations is included via analytical linearized Poisson–Boltzmann (ALPB) implicit solvation model.75
To explore the conformational space of the complexes, the conformer-rotamer ensemble sampling tool (CREST version 2.12), based on a direct sampling scheme for the generation of conformational ensembles at the xTB level using a Y(III) central atom was employed.76 Settings used in CREST are: non-covalent interaction (NCI) mode, scaling factor of 0.6 for outer wall potential, metadynamics length of 50 picoseconds and energy threshold of 10 kcal mol−1 (=42 kJ mol−1). It is worth mentioning that it is important to optimise the conformations generated by CREST, at DFT level because their relative energies at semiempirical and DFT levels can be very different.77,78 If more than 50 structures were generated, only the 50 lowest-energy ones were considered for further optimisation with r2SCAN-3c to keep computational effort at a reasonable level.
Subsequently, for the anion binding energies, geometries were optimised with r2SCAN-3c and single-point energy calculations were performed with the previously mentioned methods on these optimised geometries. In addition to these methods, ωB97M-V/def2-TZVPP(D) and ωB97M-V/6-311(+)G** computations were also performed; here we used diffuse basis functions on all oxygen and phosphorus atoms as studies have indicated that diffuse functions are useful to describe interactions with anions.79,80 Computations of binding energies used the SMD81 model to include the solvent effects of water. Note that SMD is formally a more advanced model than CPCM as it does not only include a PCM contribution but also models the effect of hydrogen bonds,81 but we did not notice a notable difference in energies. Corrections for free energy contributions were included in electronic energies, from single-point Hessian with the application of biasing potential in Cartesian space to retain the initial geometry of the complex,82 with the GFN2-xTB method.
Analysis of geometries was performed with the MATILDA package as described below.83
In practice, there is not a unique definition of the twisting angle as it depends on the relative positions of 9 individual atoms. For SAP/TSAP complexes, we use the construction as outlined in Fig. 2 (left). We first place an auxiliary atom at the center of the bottom square (which is a cyclen macrocycle in the present cases). Subsequently, we compute four individual dihedral angles connecting bottom square - auxiliary - Eu - top square, iterating over the four atoms of bottom and top square. The twisting angle is now defined as the average of those for dihedrals. If the absolute angle is greater than 35° the complex is assigned SAP and if it is lower than 25°, TSAP is assigned. A borderline status is assigned to any value between 25° and 35°.
Whereas the SAP/TSAP discussions, based on square coordination geometries, are more common in the literature, we can also define a similar parameter for complexes with a trigonal structure, such as [Eu.L2], [Eu.L3], and [Eu.L5] as shown in Fig. 1. In this case, we perform the analogous operations using the bottom/middle and bottom/top triangles (Fig. 2, right). Here, we always follow along individual coordination arms of the ligand to match the corners of the different triangles.
To characterise the structures, we also use the rotational constant, following ref. 76. For the present purpose, rotational constants are used as these provide a unique and easily computable descriptor that provides some information about how similar two structures are. The rotational constants are computed in MATILDA using
![]() | (1) |
Root-mean-square deviations (RMSD) between two molecular structures are computed in MATILDA in connection with a quaternion algorithm for superimposing the structures.90 RMSDs are computed as mass-weighted averages of the individual atomic displacements.
| Eu·L–H2O + G− ⇄ Eu·L–G− + H2O | (2) |
We first consider a “raw” binding constant Kb that is derived directly from eqn (2).
![]() | (3) |
![]() | (4) |
Directly measuring these quantities from experiment is not convenient and one usually determines an apparent binding constant. Here, we use the relation
![]() | (5) |
![]() | (6) |
with the experimentally determined apparent binding constant
. Here, the concentration of water is inserted as 55.6 mol L−1, the [H+] value is chosen according to experimental pH, and the Ka of the guest is taken from literature. Whereas the right-hand-side of eqn (6) is determined by experiment, we compute the binding free energy on the left-hand-side using quantum chemistry computations in a standard rigid-rotor/harmonic-oscillator model (with rotor cut-off value of 50 cm−1) for the original reaction described in eqn (2).
We noted that the binding energies tended to be strongly overestimated and assigned this fact to the problem that the electrostatic interaction between the positively charged Eu.L complex and the guest G− may be overestimated in our implicit solvent models as solvation or desolvation of the interacting species can have great influence on the binding free energies.91,92 We attempted to correct for this effect in a way that did not significantly enhance the computational effort by including an ad-hoc correction using an Na+ counterion in the equation
| Eu·L–H2O + G–Na ⇄ Eu·L–G− + H2O–Na+ | (7) |
We investigate the four complexes [Eu.1],47 [Eu.L2],48 [Eu.L3],49 and [Eu.L5],49 as presented in Fig. 1, using various computational methods, as discussed in Section 2.1. In all cases, we start the geometry optimisation from the experimental crystal structure and investigate how much the final converged geometry deviates from this. An analysis of the resulting bond lengths is presented in Fig. 3. For this comparison, one should keep in mind that finite-temperature effects can affect the measured equilibrium bond-lengths re compared to the 0 K value r0 predicted by DFT. Typical deviations between r0 and re are on the order of 0.01 Å.96,97 Therefore, experimental re values should be considered as an upper bound to r0, meaning that the values predicted by DFT should be slightly smaller.
In the following, we investigate DFT in conjunction with the B3LYP and ωB97M-V functionals, considering that these were used for previous lanthanide studies.34,37,39,46 In addition, as computationally cheaper options, we apply the composite r2SCAN-3c method, the semi-empirical GFN2-xTB method, and the force field GFN-FF.
Fig. 3a shows the individual errors in bond lengths versus the crystal structure grouped by the type of bond (O–Eu or N–Eu); Fig. 3b shows the mean absolute error (MAE) averaged over all values in panel a. Starting with GFN-FF, the computationally cheapest method investigated here, we find an MAE of 0.101 Å, and as Fig. 3a shows there is a clear trend for overbinding with most of the bond lengths underestimated.
Proceeding to the semi-empirical GFN2-xTB method, we attempted geometry optimisations using either an Y(III) or an Eu(III) atom. The results obtained between both methods are similar but, interestingly, we obtain a slightly better MAE (0.060 Å vs. 0.068 Å) when using Y in the computations. Continuing with r2SCAN-3c, we find very accurate bond lengths with an error of about 0.028 Å (Y) and 0.024 Å (Eu). While the formal replacement of europium(III) with yttrium(III) in the computation does not have a notable effect on the MAE (always with respect to Eu(III) experiments), there is a clear difference in bond lengths as shown in Fig. 3a. Using Y the bond lengths are predominantly underestimated, using Eu they are predominantly overestimated. In line with the comments made above, a slight underestimation might actually correctly reflect finite temperature effects, thus, favouring the Y results.
Proceeding to the widely used B3LYP functional, in conjunction with the D4 dispersion correction, we find that it provides an MAE of 0.040 Å (Y) and 0.037 Å (Eu), which is somewhat increased compared to r2SCAN-3c. D4-B3LYP underestimates O–Eu distances while overestimating N–Eu distances with similar performance for using Y and Eu atoms.
Finally, we investigated the ωB97M-V functional using five different basis/ECP/central atom combinations. The use of ωB97M-V/def2-SVP and def2-ECP (28 electrons in core) with yttrium, again, resulted in an MAE of about 0.038 Å. A slight improvement is obtained when using Eu as the central atom. Increasing the basis set to def2-TZVPP has only a negligible effect. Lastly, ωB97M-V with ECP52MWB (52 electrons in core) gives values of MAE of 0.030 and 0.029 with 6-31G* and 6-311G**, respectively. Interestingly, as opposed to before, the bond lengths are now mostly overestimated.
In addition to bond lengths, we have also determined the twist angles for the same four complexes. The coordination sphere of [Eu.1] is determined by two squares and we investigate the twisting between the top square formed by the three acetate oxygens and one ether oxygen and the bottom square formed by the four macrocyclic nitrogen atoms. In the tricapped trigonal prismatic [Eu.L2], [Eu.L3] and [Eu.L5] complexes, we investigate the three triangles formed by the three oxygens of the acetate groups (top), the three nitrogens of the quinoline (middle) and the three nitrogens of the macrocycle (bottom). The values of the twisting angles are presented in Fig. 4 with the experimental reference values as solid lines and the computed results as solid shapes (circles, triangles, diamonds and squares) connected by dashed lines. The experimental twist for the squares of [Eu.1] is 23° while for [Eu.L2], [Eu.L3] and [Eu.L5] the twist between bottom and middle (ϕtribm) is slightly above or below 41° and between bottom and top (ϕtribt) is in a range of 95–101°.
Viewing first GFN-FF, we find that its predicted torsion angles are quite far away from the reference values. For [Eu.1] the torsion angle is too high, meaning that GFN-FF does no longer characterise this as clear TSAP. For the other three complexes, both ϕtribm and ϕtribt are again significantly higher than the references. GFN2-xTB also shows some clear deviations, especially concerning the ϕtribt in tricapped trigonal complexes. By contrast, all DFT methods predict ϕtribt very accurately for [Eu.L2], and only show a slight overestimation for [Eu.L3] and [Eu.L5]. For [Eu.1], the DFT results are always very close to the reference value. Interestingly, for ϕtribt in the tricapped complexes [Eu.L2/3/5] we find a zigzag pattern where twisting angles using Y as central atom are always higher than when using Eu. This trend is very pronounced for GFN2-xTB but also clearly visible for the DFT methods. Finally, we note that one should not overinterpret the results from Fig. 4. The experimental twisting angles were measured in the solid state at finite temperature whereas our computations represent the solution phase and formally 0 K. As such one cannot expect perfect agreement. However, the clear conclusion remains that the GFN-FF and GFN2-xTB results are of a somewhat lower quality whereas all DFT methods perform satisfactorily.
Summarising this section, we note that the GFN-FF and GFN2-xTB methods provide computationally inexpensive means for getting approximate geometries and that GFN2-xTB performs slightly better when using a Y atom compared to an Eu atom. Improved results are obtained using DFT and within our dataset the most accurate results are obtained for r2SCAN-3c with almost equal performance for using Y and and Eu. We will, therefore, use r2SCAN-3c as the workhorse for the remaining part of this study. Noting the almost equal performance but significant numerical advantages, we will use a central Y atom rather than an Eu atom in the geometry optimisations.
The results of this investigation are presented in Fig. 5. Here we plot the relative energy against the rotational constant (used as a generic parameter to indicate by how much two structures differ). Within Fig. 5 the hand-drawn and crystal structures are shown as a red cross and blue plus sign, respectively. Note that both of these structures are optimised using r2SCAN-3c (using hand-drawn and crystal structure as a starting point). The CREST ensembles using r2SCAN-3c and GFN2-xTB optimisations are shown using green circles and grey diamonds respectively.
Starting with [Eu.1] (shown on the upper left in Fig. 5), we find that that this complex possesses quite a dense distribution of conformers. Indeed, within an energy window of 42 kJ mol−1, a total of 357 structures were found using CREST/GFN2-xTB (shown as grey diamonds). 50 of these structures were further optimised using r2SCAN-3c, as shown using green circles. For [Eu.1] we find that the hand-drawn structure lies very closely with the crystal structure and that both are embedded within the ensemble of CREST structures. CREST produces some low-energy structures with the lowest one being 2.9 kJ mol−1 below the crystal structure when computed at the r2SCAN-3c level. This finding should be understood in the sense that in solution a range of structures are energetically accessible and that the crystal structure is similar in energy to one of the lowest energy conformers but structurally quite different.
Moving to [Eu.L2], we again find a very dense distribution of conformers with 382 structures generated below 42 kJ mol−1. Reoptimisation using r2SCAN-3c leads to a dense set of structures clustered around the crystal structure with only one structure higher up near 40 kJ mol−1. In this case, it is also noteworthy that the lowest energy conformer using r2SCAN-3c corresponds to the 19th conformer from GFN2-xTB highlighting the importance of investigating larger numbers of conformers. Strikingly, our hand-drawn structure lying at 58.4 kJ mol−1 is completely removed from any of the other structures. This finding illustrates the danger of directly using hand-drawn structures without further scrutiny which would, in this case, yield completely unphysical energies. A more detailed analysis of [Eu.L2] is presented in Fig. 6 illustrating the central part of the complex as displayed for the crystal structure, the lowest energy conformer and the hand-drawn structure. The figure illustrates that in terms of, both, energy and RMSD the lowest energy CREST conformer is quite similar to the crystal structure while the hand-drawn structure deviates strongly. It is difficult to spot the underlying structural dissimilarity and only on close inspection one finds a different arrangement of the methyl groups attached to the phosphorus atoms. For the crystal structure and CREST structure, one finds that all three methyl groups are pointing upwards yielding a roughly C3 symmetric arrangement. By contrast, for the hand-drawn structure, only one methyl group points up and the other two point sideways. The conclusion from this analysis is that subtle changes in conformation and stereochemistry can have a massive impact on the energetics. It is not a priori clear what arrangement is favourable and an automated conformer search algorithm can be greatly beneficial.
![]() | ||
| Fig. 6 Central part of the [Eu.L2] complex displayed for its optimised geometry from crystal structure, lowest energy CREST conformer and hand-drawn structure. (Energy is in kJ mol−1 and RMSD in Å). | ||
Resuming with [Eu.L3] in Fig. 5, we find that now a significantly smaller set of low energy conformers is generated using CREST/GFN2-xTB (only 33 below 42 kJ mol−1). The crystal structure coincides with the lowest energy structure from the CREST/r2SCAN-3c optimisation. The hand-drawn structure is slightly higher in energy at 3.4 kJ mol−1. Finally, the simplest case is found for [Eu.L5]. In this case only four low-energy conformers were generated. Out of these conformers, one exactly coincided with both the crystal and hand-drawn structures.
In summary, Fig. 5 highlights the challenges in working with flexible systems like europium(III) complexes. Whereas in some cases, such as [Eu.L5], it is simple to identify the lowest energy conformer, there are other cases such as [Eu.1] and [Eu.L2] where a large number of low-energy conformers are involved. Even more, subtle differences in the way structures are generated can have dramatic effects on the outcomes producing even qualitatively incorrect results as exemplified for [Eu.L2]. Specifically, we have highlighted the challenges of directly using a hand-drawn structure. To obtain more robust results, we suggest to always start with a crystal structure if available. If no crystal structure is available we would recommend a systematic conformer search.
Theoretical determination of accurate binding free energies for such systems is often difficult due to a number of challenges, such as the charged state of molecules, solvation, pH and basis set superposition error (BSSE).59,99–101 To study these complexes, we first performed conformer analysis on the water-bound europium(III) complexes, host–guest complexes (as binding of guests to the macrocyclic hosts is known to cause conformational variations102) and anions. Then, the lowest energy structure (as determined at the r2SCAN-3c level using a Y(III) central ion) was selected to compute the binding free energy. To perform computations on complexes, we have replaced Eu with Y for reasons mentioned before, and geometries of all structures were optimised with the r2SCAN-3c method. A comparison to experimental binding constants was performed following Section 2.3.
The optimised molecular geometries for the lowest energy conformer from CREST/r2SCAN-3c are displayed in Fig. 7. These show the binding of host europium(III) complexes to water, hydrogen phosphate, AMP and ADP. Here, the binding of AMP and ADP is monodentate (binding through only one phosphate oxygen). The guest anions are shown on the right side of the complex, binding to the central cation via one of their negatively charged oxygen atoms. Selected bond lengths and the SAP twist angles are listed below.
Starting with the water-bound [Eu.pB(OH)2]+ complex in Fig. 7a, we note that all Eu–N bonds are in the range of 2.5–2.6 Å. There is more variation with the bonds involving oxygen. The Eu–O(ac) bonds involving the charged acetate groups are the shortest (2.3 Å), followed by the bonds to water (2.4 Å). Conversely, the Eu–O(e) bond involving the ether is significantly longer highlighting the weaker interaction with this group.
Upon binding of HPO42− the binding geometry around the europium atom stays largely intact but some crucial adjustments to bond lengths occur, as shown in Fig. 7b. Most importantly the Eu–O(p) bond (2.24 Å) involving phosphate is significantly shorter than the analogous Eu–O(w) bond involving water was in the previous example (2.39 Å) reflecting the enhanced charge on this atom. As a consequence the Eu–O(a) bonds are slightly elongated. We also find that one of the macrocyclic nitrogens is now farther out at 2.71 Å while the distance to the ether oxygen is now only 2.81 Å reflecting some further changes in the coordination sphere. Viewing the three-dimensional structure in Fig. 7b, we also note that the phenylboronic acid group is now pointing in a different direction as compared to before highlighting the influence of the conformer search procedure.
Moving on to the AMP and ADP complexes (Fig. 7c and d), we again note that the overall coordination geometry remains intact, but also observe some changes in bond lengths. Especially, we note an increase in the Eu–N(q) and Eu–O(e) bond lengths, indicative of steric crowding caused by the larger size of AMP and ADP compared to water and HPO42−. Conversely, the distance of the phosphate oxygen is similar to the HPO42−-bound [Eu.pB(OH)2]+ complex. Interestingly, there is also a change in the twist angles for the lowest energy conformers from −37° to 25° changing the formal classification from SAP to TSAP. Reviewing finally the phenylboronic acid group in all four complexes in Fig. 7, we note that it is arranged in quite divergent directions in all four cases highlighting again the comformational flexibility of this complex.
Next, we proceed to the [Eu.mB(OH)2]+ complex, as shown in the same Fig. 7. We generally observe similar trends as in the case of the para isomer but some notable differences are present. Viewing the water bound complex, we note that the phenylboronic acid group is now pointing away, apparently to form a hydrogen bond with the associated acetate group. In addition, we find that the Eu–O(e) bond is significantly shortened for meta compared to para (2.83 Å vs. 3.08 Å). Comparing the phosphate bound [Eu.mB(OH)2]+ complex in Fig. 7f with its para counterpart, we find a different arrangement of the phenylboronic acid group, which is now pointing toward the phosphate group to form a hydrogen bond. This goes along with a quite dramatic elongation of the formal Eu–O(e) bond, which is now 3.15 Å. Proceeding to the AMP bound [Eu.mB(OH)2]+ complex in Fig. 7g, we note that this stands out due to its particularly short Eu–O(p) bond of only 2.217 Å. Viewing finally the ADP bound [Eu.mB(OH)2]+ complex in Fig. 7h, we find that the coordination geometry is quite similar to the para analogue but that the conformer adopted by ADP is quite different with a more folded arrangement for meta and a more linear arrangement for para.
To summarise the results from Fig. 7, we note that binding of the analytes leaves the coordination geometry around the europium atom largely intact, with only some of the bond lengths being altered. At the same time, the conformational flexibility of the ligands is illustrated, particularly through the phenylboronic acid group, which is moved in various directions possibly to optimise hydrogen bond interactions. The variability in the structures illustrates the importance of using a conformer search algorithm when computing the geometries of host–guest complexes. Finally, we note that the lowest energy conformers as shown here, cannot be assumed to be the only relevant conformers considering the existence of a number of similar energy conformers. Nonetheless, we believe that the presented results provide a reasonable representative binding geometry.
Having discussed the geometries, we were next interested whether it is possible to obtain accurate binding free energies. Experimentally determined values [following eqn (6)] are presented in the first column of Table 1. There is not much variation in these values, all lie around 35 kJ mol−1, with a slightly higher preference of [Eu.pB(OH)2]+ for ADP and somewhat less for HPO42−.
| Complex | Anion | Exp.a | Drawnb | CRESTc | Na+ d |
|---|---|---|---|---|---|
| a Experimental binding energy computed via eqn (6) using binding constants from ref. 46.b Hand-drawn and optimised without conformer search.c CREST conformer search carried out for each component.d Estimation of the effect of Na+ counterions via eqn (7) (including CREST conformer search) | |||||
| Para | [HPO4]−2 | −31.1 | −79.4 | −79.2 | −34.3 |
| AMP | −35.1 | −81.0 | −67.3 | −26.7 | |
| ADP | −37.1 | −81.9 | −67.3 | −15.0 | |
| Meta | [HPO4]−2 | −31.2 | −78.2 | −91.7 | −46.9 |
| AMP | −34.3 | −84.2 | −79.6 | −38.9 | |
| ADP | −34.2 | −49.5 | −71.8 | −18.7 | |
We evaluated three different strategies for the computation of the binding free energies. In the first approach, we optimised the hand-drawn europium(III) complexes directly using r2SCAN-3c. The second method includes performing conformer search on complexes using CREST and then optimising the resulting conformers at the r2SCAN-3c level. Subsequently the lowest energy conformer (at r2SCAN-3c level) is used for calculating the binding free energy. The results from these two approaches, as presented in Table 1 in the columns “Drawn” and “CREST”, are clearly not satisfactory severely overestimating binding energies producing values about twice as large as they should be. This overbinding could be attributed to the fact that charged ions are screened in solution, due to the effect of the solvent molecules, and interact with counterions. To account for this effect, we apply an ad hoc correction by considering the binding of an Na+ counterion to the analyte anion in the computation following eqn (7). Interestingly, this correction does indeed lead to the values being close to experimental results. However, the qualitative trends and relative numbers are still not satisfactory.
Noting that none of the above methods produce satisfactory results, we now investigate the effect of different computational methods. Fig. 8 presents energies with the inclusion of the Na+ counter ion, i.e., via eqn (7). We investigate the same set of methods as above but do not include the small-ECP Eu computations as these produced quite erratic results due to numerical instabilities related to the 6f electron configuration. Conversely, we included two more methods, denoted ωB97M-V/def2-TZVPP(D) and ωB97M-V/6-311(+)G** including diffuse basis functions on all the O- and P-atoms. In Fig. 8, the r2SCAN-3c values represent the results also shown in Table 1. The other results are based on the same geometries using single-point computations with the respective method. We first note that GFN2-xTB produces extremely high binding energies (around 300 kJ mol−1, not shown here). Continuing with D4-B3LYP/def2-SVP and ωB97M-V/def2-SVP, we find that some of the energies are severely overestimated reaching above 70 kJ mol−1 in three cases, each. We can rationalise this overestimation in terms of basis-set superposition error (BSSE) and, indeed, when using the larger def2-TZVPP basis set in conjunction with ωB97M-V, binding energies are strongly reduced, more in line with r2SCAN-3c. Increasing the basis set further and adding diffuse functions (def2-TZVPP(D)) has only a slight further influence. Interestingly, when using ωB97M-V/6-31G* in conjunction with ECP52MWB, we do not observe strong overbinding despite the fact that 6-31G* is of polarised double-ζ quality analogous to def2-SVP. Increasing the basis set to 6-311G** now somewhat increases the binding, which is the opposite trend to what would be expected if the BSSE between Eu on phosphate is the main influence and what was observed with the def2 basis sets. The addition of diffuse functions in the basis set (6-311(+)G**) somewhat lowers binding energies putting them on the right order of magnitude.
![]() | ||
| Fig. 8 Binding free energies of anions with europium(III) complexes (using Na+ counterions) determined with different computational methods. | ||
Finally, we briefly discuss the results obtained when not using a counterion, as shown in Fig. S2. In this case, the binding energies are even more severely overestimated, as also seen in Table 1. Interestingly, GFN2-xTB underestimates the binding energies while all other methods overestimate them.
Reviewing the results presented here, we are left with the conclusion that while geometries can be meaningfully reproduced, none of the presented computational approaches yields satisfactory results for binding energies. We discussed two potential sources of error, (i) use of the wrong conformer and (ii) BSSE. We showed that these can be avoided via (i) an appropriate conformer search algorithm and (ii) larger basis sets and/or appropriate BSSE corrections (such as the gCP correction in r2SCAN-3c). However, even after including these, the results are neither of the correct order of magnitude nor reflect the experimentally observed trends. We hypothesise that the missing contribution lies in the solvation of the multiply charged ions investigated here. Apparently, the relevant solvation effects are not fully described by the SMD continuum model employed. The ad hoc addition of an Na+ ion moves the results in the right direction, but this approximation is too coarse to produce quantitative results. A more realistic solvation environment might be obtained via dynamics with explicit solvent or free energy perturbation approaches. However, such approaches are out of the scope of this work.
Starting with geometries, we find that the composite DFT-based r2SCAN-3c method comes out as the optimal approach being both accurate and computationally efficient. We also find that substituting Eu for Y does not noticeably affect the results while providing significant advantages in terms of numerical stability and computational efficiency. To reduce computational cost, the GFN2-xTB and GFN-FF methods were shown to be effective albeit with the downside of notably increased errors in bond lengths and twisting angles.
Progressing to the influence of conformational flexibility, we illustrated that some of the investigated complexes possess a large number of low-energy conformers highlighting the importance of sampling these appropriately to get quantitative results. Even more, we noticed in one instance that simply using a hand-drawn structure led to qualitatively wrong results and an energy overestimated by more than 50 kJ mol−1. We found CREST to be a suitable conformer search tool noting, however, that the generated structures had to be reoptimised at a higher level for an accurate estimation of the energetic ordering and we used r2SCAN-3c for this purpose.
We concluded this work with a case study on anion binding, considering phosphate, AMP, and ADP, using two previously synthesised probe complexes. First, we studied the host–guest structures obtained with the different anions illustrating changes in bond distances, twist angles and more wide ranging changes in conformer structure upon binding, underscoring the structural variability of these complexes. Subsequently, we investigated the possibility of computing binding energies. Despite investigating a variety of protocols and computational methods, we were not able to pinpoint any approach that yielded satisfactory results. We have highlighted the importance of basis set superposition error and conformational flexibility both of which, when neglected introduce strong errors. However, even after taking both into account appropriately, the binding energies are still strongly overestimated. We attribute the remaining error to challenges of appropriately describing charged anions in solution, especially as far as charge screening and the effect of counterions is concerned. Describing such effects would require dynamics simulations or free energy perturbation approaches, both of which are beyond the scope of this work. We are, thus, left with the conclusion that within standard quantum chemistry approaches the description of binding energies is out of reach in these systems.
To summarise, we conclude that r2SCAN-3c proves to be an effective method for obtaining accurate geometries of europium(III) complexes. In connection with CREST, also conformationally flexible systems can be treated well. Accurate binding energies are unfortunately out of reach with the presented methodologies. As such we believe that the presented methods are beneficial for rationalising and illustrating experimental results. However, at this point it is not possible to predict binding affinities, e.g., to develop anion selective probes. Nonetheless, we hope that the results presented here serve as a good representation of the state of the art and will serve as an inspiration to develop more accurate methods.
Supplementary information (SI): detailed derivation for binding energy (Section S1), structures of anions (Fig. S1), complexation of sodium cation with guests (Fig. S2), and binding energy without sodium cations (Fig. S3). See DOI: https://doi.org/10.1039/d6cp00457a.
| This journal is © the Owner Societies 2026 |