G.
Rugg
and
H. M.
Senn
*
WestCHEM and School of Chemistry, University of Glasgow, Glasgow G12 8QQ, Scotland, UK. E-mail: hans.senn@glasgow.ac.uk
First published on 31st October 2017
To rationalise mechanistically the intriguing regio- and chemoselectivity patterns for different substrates of the non-haem iron/2-oxoglutarate dependent halogenase SyrB2, it is crucial to elucidate the structure of the pivotal [FeIVO] intermediate. We have approached the problem by a combination of classical and QM/MM modelling. We present complete atomistic models of SyrB2 in complex with its native substrate L-threonine as well as L-α-amino butyric acid and L-norvaline (all conjugated to the pantetheine tether), constructed by molecular docking and extensive MD simulations. We evaluate five isomers of the [Fe
O] intermediate in these simulations, with a view to identifying likely structures based on a simple “reaction distance” measure. Starting from models of the resting state, we then use QM/MM calculations to investigate the formation of the [Fe
O] species for all three substrates, identifying the intermediates along the O2 activation/decarboxylation pathway on the S = 1, 2, and 3 potential-energy surfaces. We find that, despite differences in the detailed course of the reaction, essentially all pathways produce the same [Fe
O] structure, in which the oxido is directed away from the substrate.
The NHFe halogenase SyrB215 is the type specimen of the class of radical halogenases, which activate environmentally abundant halide anions (X−) as halogen radicals (X˙) for incorporation into organic substrates. SyrB2 was the first NHFe halogenase to be characterised structurally16 and is by far the most thoroughly studied. About a dozen NHFe halogenases have so far been characterised biochemically, but only four of them also structurally.12 SyrB2 stems from the plant-pathogenic bacterium Pseudomonas syringae, where it is part of the biosynthetic pathway for the production of syringomycin E, a cyclic lipopeptide with phytotoxic and antifungal activity. Like the majority of NHFe halogenases, SyrB2 chlorinates the aliphatic side-chain of an α-amino acid. Specifically, it installs chlorine at the terminal methyl group of L-threonine (Scheme 1), before the modified residue is integrated into the growing product. The enzyme requires oxygen and 2-oxoglutarate (2OG) as co-substrates, which are turned over into CO2 and succinate.
![]() | ||
Scheme 1 SyrB2 catalyses the chlorination of L-threonine at the terminal side-chain carbon. 2-OG = 2-oxoglutarate. |
SyrB2 does not recognise free threonine as a substrate. The amino acid needs to be conjugated to a phosphopantetheine tether (see Scheme 2) and presented to the enzyme by a carrier protein. This requirement has so far prevented the experimental determination of the structure of the enzyme–substrate complex, and thus, the exact set-up and configuration of the active site with bound substrate remain elusive. The same applies to all NHFe halogenases reported to date, with one recent exception: WelO5 (and its close homologue AmbO5) chlorinate freestanding substrates, which enabled the first crystal structure of a NHFe halogenase in complex with its substrate to be obtained.17 While this result will undoubtedly further the elucidation of the mechanism of NHFe halogenases in general and WelO5 in particular, the structural details are not directly transferable. Although all NHFe halogenases share a common make-up of the active-site iron complex, WelO5 belongs to a different family that chlorinates cyclohexane rings in alkaloid scaffolds whereas SyrB2 (and most other NHFe halogenases) work on aliphatic side-chains of α-amino acids.
The mechanism of the NHFe halogenases (Scheme 3) is closely related to that of the NHFe hydroxylases, which is well studied.18 The main difference is the set of first-shell ligands: in the hydroxylases, the iron is coordinated by two His imidazoles and an Asp or Glu carboxylate, which form the 2-His-1-Asp/Glu “facial triad” motif. In the halogenases, the carboxylate is replaced by a halide. In the resting state A-H2O, the Fe(II) centre is octahedrally coordinated by two His, the chelating 2OG, chloride, and water. Substrate binding triggers water dissociation (A), opening up a free coordination site for O2 to bind (B). The O2-bound complex B undergoes O2 activation and decarboxylation (C1 and C2 denoting intermediates before and after decarboxylation, respectively), yielding the FeIV–oxido (ferryl) intermediate D. This intermediate is central for the reactivity, having the capability to abstract an unactivated hydrogen from the substrate. In the subsequent “rebound” step, the substrate radical combines with the chlorido ligand of E to form the chlorinated product.
![]() | ||
Scheme 3 Proposed catalytic cycle in NHFe/2OG dependent halogenases; the residue numbering refers to SyrB2. R is an aliphatic carbon of the substrate; R′ = (CH2)2COO−. |
Except for A-H2O,16 the structures, and in some cases the existence, of the intermediates shown in Scheme 3 are not known experimentally. The [FeO] species D at least is sufficiently long-lived to allow its spectroscopic characterisation in SyrB2 (or the close homologue CytC3), notably by Mössbauer techniques.19–22 Nuclear resonance vibrational spectroscopy (NRVS) combined with DFT modelling22 strongly suggested that D is five-coordinate, with oxido trans to His116 (as drawn in Scheme 3). Previously, it had been assumed that the oxido was formed trans to His235 and that succinate was bidentate.23–27
The overall selectivity of the reaction is determined by two steps: the (slow) H-abstraction D → E controls the position at which the substrate radical is formed and thus the regioselectivity. The (fast) rebound E → F governs the chemoselectivity: the substrate radical can in principle combine with either the OH or the Cl ligand of intermediate E, leading to hydroxylation or chlorination, respectively. With its native substrate THR, SyrB2 has an almost complete selectivity for chlorination, but in reaction with related α-amino acid substrates, it is equally competent at hydroxylation (Scheme 4). ABA (L-α-amino butyric acid or (S)-2-aminobutanoic acid), which lacks THR's OγH group, yields a mixture of hydroxylated and chlorinated product, the substitutions taking place at the terminal methyl only. NVA (L-norvaline or (S)-2-aminopentanoic acid) has an extra methylene group and is converted into the 4-Cl, 4-OH, and 5-OH derivatives.
![]() | ||
Scheme 4 The chemo- and regioselectivity of SyrB2 is dependent on the substrate: THR (L-threonine), ABA (L-α-amino butyric acid), NVA (L-norvaline); Pant = pantetheine. |
The questions as to how the enzyme controls selectivity and what factors determine the outcome for a particular substrate have been actively investigated.20–22,27,28 From these studies emerged the concept that the positioning of the substrate relative to the oxido oxygen in D (and the Cl and OH ligands in E) is decisive. If the C–H bond is close to the oxido in D, the resulting Fe–OH and substrate radical will also be close, favouring hydroxylation. However, this reactivity is suppressed by positioning the C–H further away from the oxido and closer to the Cl, trading efficiency for selectivity. This is consistent with slow H-abstraction (and overall) rates for chlorination (4.2 min−1 for THR) compared to hydroxylation (480 min−1 for 5-hydroxylation of NVA).28
The precise structure of the [FeO] intermediate D is thus pivotal for an understanding of the intriguing selectivity of SyrB2. The lack of an experimental structure with bound substrate, the conformational flexibility of the substrate, the variable coordination geometry of the iron centre, and the interactions of the active-site complex with the protein environment make this a challenging problem for computational modelling. A number of computational studies22–27,29–31 have addressed different aspects of the reactivity of SyrB2. Having established the structure of D from their NRVS/computational results,22 Solomon and co-workers recently investigated in detail the intrinsic electronic factors affecting the reactivity and selectivity of H-abstraction and Cl/OH transfer.30,31 All studies used QM (DFT) cluster models, with selected active-site residues surrounding the iron complex. The only exception are Shaik and co-workers,29 who employed a full QM/MM model. Starting from intermediate D, they delineated the role of specific interactions of the reaction intermediates with active-site residues in controlling the selectivity.
In this contribution, we present complete atomistic models of SyrB2 in complex with THR, ABA, and NVA, constructed by molecular docking and extensive MD simulations. We evaluate isomers of the [FeO] intermediate D in these simulations, with a view to identifying plausible structures based on classical simulations alone. We then use for the first time QM/MM calculations on full-enzyme models and follow the O2 activation/decarboxylation pathway B → D for all three substrates on the S = 1, 2, and 3 potential-energy surfaces. We find that, despite differences in the detailed course of the reaction, essentially all pathways produce the same isomer of D, labelled D1, in which the oxido is trans to His116.
The Gromacs suite of programs41–43 was used throughout, with the Amber ff03 forcefield44 for the protein, GAFF45,46 for the active-site complexes and substrates, and TIP3P water.47 RESP charges for non-standard residues (substrates, active-site complexes) were derived using the Merz–Singh–Kollman scheme.48 Topology and parameter files for use with Gromacs were generated using AmberTools49 and ACPYPE.50,51 The iron atom and the first shell of donor atoms were fixed by harmonic position restraints with a force constant of 104 kJ mol−1 nm−2. After preliminary energy minimisation and pressure equilibration, the simulations were run in the NVT ensemble until deemed equilibrated, followed by production runs of 17–27 ns; see the ESI† for additional details. The simulations included ca. 35000 atoms, with the protein (including cofactors) accounting for ca. 4850, the substrate for ca. 60, and the solvent for the remainder.
QM calculations were done at the DFT level with the B3LYP61–66 exchange–correlation functional (as implemented in Turbomole), complemented by Grimme's D3 dispersion correction.67 Reaction-coordinate scans and optimisations were initially done with the def2-SVP basis set, stationary points were then re-optimised with def2-TZVP.68 Reported results were obtained at the def2-TZVP level unless stated otherwise. The MM parameters were the same as in the MD simulations described above.
The QM region comprised the active-site iron complex with all first-shell ligands (with histidines truncated to 5-methylimidazoles, 2OG truncated to 2-oxopropanoate or acetate + CO2, respectively), the substrate head (truncated at the C(O)–Cα bond), and the acetate side-chain of Glu102. The latter was included to allow for proton transfers across the substrate–NH3+–Glu102 salt bridge. For THR, where the direct salt bridge was replaced by a water-mediated interaction during the MD simulations, the two bridging water molecules were also included in the QM region. In the event, we did not observe any proton transfers in any of the calculations. The QM region included 65 atoms (including 5 link atoms); the MM region contained 6802 atoms: the entire enzyme (including substrate) and a sphere of 650 water molecules within 23 Å of the iron (numbers refer to THR). See the ESI† for further details of the QM/MM setup.
Mössbauer spectroscopic parameters were calculated with ORCA69,70 at the B3LYP61–66/def2-TZVP68 level with the efficient RIJCOSX method.71 The full set of point charges from the QM/MM models was included in the calculations. To ensure an accurate description of the density at the iron nucleus, the CP(PPP) basis72 was used for iron, together with a dense radial integration grid at iron (accuracy parameter 7.0). The Mössbauer isomer shift, δ, is related to the calculated density at the nucleus, ρ0, via the linear relationship δ = α(ρ0 − C) + β, where α = −0.366 a03 mm s−1, β = 2.852 mm s−1, and C = 11810 a0−3 are fit parameters specific to the choice of functional and basis set.73 Alternative fit parameters have been derived from a larger and more diverse training set:74α = −0.424 a03 mm s−1, β = 7.55 mm s−1, and C = 11
800 a0−3.
The docking procedure was based on the X-ray structure of the SyrB2 holoenzyme in the resting state; the Fe-bound water molecule was removed, creating intermediate A. The protein was kept rigid, except for the two side-chain torsions of Phe196 and the Cζ–OH torsion of Tyr272. Flexibility of Phe196, which is located near the entrance of the channel, was deemed to be relevant as its side-chain hinders access to the channel in the X-ray structure and has been suggested to act as a “gate-keeper”.16 The OH group of Tyr272, located at the end of the channel towards the active-site cavity, has been proposed as a possible hydrogen-bonding partner of the substrate head, so rotation about this torsion was also allowed.
Table 1 lists the affinities and RMSDs from the top pose for the ten most favourable docked poses for THR as obtained from the Vina docking procedure. The structures of the three best poses are shown overlaid in Fig. 1. As is evident from the RMSDs, Poses 1–4 are structurally very similar whereas the remaining poses differ significantly. The small structural variability in Poses 1–4 stems mostly from the pantetheine tail end (see Fig. 1), which extended beyond the substrate channel into the solvent and is conformationally less constrained. Only in Poses 1–4 was the substrate's reacting methyl group placed sufficiently close to the iron complex to make a reaction appear feasible. In the other poses, the substrate head was positioned away from the reactive centre or the substrate as a whole was partially retracted from the channel.
Pose no. | Affinity/(kJ mol−1) | RMSD/Å |
---|---|---|
1 | −33.5 | 0.00 |
2 | −32.6 | 1.41 |
3 | −32.2 | 1.71 |
4 | −31.4 | 1.68 |
5 | −29.7 | 9.96 |
6 | −29.7 | 3.29 |
7 | −29.3 | 9.71 |
8 | −28.9 | 9.36 |
9 | −28.9 | 9.70 |
10 | −28.9 | 10.13 |
THR in Poses 1–4 is stabilised by hydrophobic interactions with Phe121 in the active-site cavity and Phe195 and Phe196 in the channel, in agreement with the participation of these residues in substrate stabilisation suggested based on mutation studies.75 The flexible Phe196 benzyl side chain re-oriented as expected so as to grant the substrate access to the active-site channel. Whereas previous studies26,27,29,75 adjusted the side-chain torsions manually to allow access to the channel, keeping Phe196 rigid during subsequent docking, we find that this conformational change is intrinsically favoured by the interactions with the incoming substrate.
Most relevant for the exact positioning of the reacting C–H bond are the potentially strong, directing interactions of the substrate head. The ammonium group is well set up to form a salt bridge with the side-chain carboxylate of Glu102. Indeed, THR formed a strong hydrogen bond/salt bridge between its ammonium group and Glu102 in Poses 1–3 (shown in Fig. 1) while Pose 4 lacks this interaction. This salt bridge was also present in previous docking studies,26,75 and the Glu102Ala mutant was found to be inactive,75 which supports the importance of this interaction.
While any α-amino acid substrate will feature the ammonium group in the same position and therefore be able to form such a salt bridge, THR in addition has the possibility to form a second directing interaction by hydrogen-bonding via its OγH group. We found OγH bonding either to Arg254 (Pose 1) or to Glu102 (Poses 2 and 3); no such hydrogen-bond exists in Pose 4. Previous studies found the OH group bonding to Glu10226 or Asn123.75 Tyr272, whose Cζ–OH bond was treated as rotatable to allow for hydrogen-bonding to the substrate OH or ammonium groups, did not form any such hydrogen bond in any of the poses (nor in the subsequent MD simulations).
The docking of ABA and NVA yielded sets of poses very similar to THR (see Tables S3 and S4, ESI†). Like in the case of THR, the four and five best-scoring poses for ABA and NVA, respectively, place the substrate head sufficiently close to the iron centre to make a reaction plausible. Again, these top poses are structurally highly similar, with small differences in the tail end. Comparing the top poses for the three substrates (see Fig. 2 and Table 2), they are essentially identical, featuring very similar substrate ammonium–Glu102 salt bridges and hydrophobic stabilisation from Phe121, Phe195, and Phe196.
Substrate | Method | Glu102 dSB/Å | Phe121 dRH/Å | Phe195 dRH/Å | Phe196 dRH/Å |
---|---|---|---|---|---|
THR | Docking | 1.92 | 3.64 | 3.46 | 3.36 |
MD | 3.85 | 5.44 | 3.23 | 2.86 | |
ABA | Docking | 2.10 | 3.73 | 3.52 | 3.10 |
MD | 1.82 | 5.25 | 3.10 | 2.74 | |
NVA | Docking | 2.17 | 3.75 | 3.49 | 2.16 |
MD | 1.74 | 4.42 | 2.85 | 2.95 |
The close agreement in terms of substrate conformations and positions – both within the set of best-scoring poses for any one substrate and between the top poses for the three different substrates – means that one can expect a molecular-dynamics trajectory of reasonable length, starting from any of the poses, to explore the full conformational space spanned by the set of relevant poses.
For ABA and NVA, the conformations and characteristic protein–substrate interactions as obtained from docking were largely maintained during the MD simulations; see Table 2. In particular, the salt bridge between the substrate ammonium group and the Glu102 carboxylate, which anchors the substrate head, was stable and preserved throughout.
THR, on the other hand, changed its conformation substantially over the course of the trajectory. The conformational rearrangements appeared to be controlled by interactions of the OγH group. Initially hydrogen-bonded to Arg254, it sampled other bonding partners, including water molecules associated with Tyr272–OH and the Glu102 carboxylate (see Fig. S2, ESI†). The conformational mobility of the THR head was associated with a looser interaction between its substrate ammonium group and Glu102. Instead, Glu102 formed a salt bridge with nearby Arg254. Concomitantly with cleaving the NH3+–Glu102 salt bridge, the THR head group rotated about the C2–C3 bond, which re-oriented the OH and methyl groups attached to C3 compared to the other cases (see Fig. 3). In the equilibrated MD structure, the THR OH group hydrogen-bonded to the free carboxylate tail of 2OG, which rotated towards the substrate head. (For the other substrates, the 2OG carboxylate is part of a dynamic hydrogen-bonding network involving Arg248, Lys106, Thr113, and Ser237; see Fig. S3, ESI.†) Overall, the THR complex differs from the other docked and MD-equilibrated structures in several important respects: (i) instead of the direct, strong salt bridge between the substrate ammonium and Glu102, there is a looser, water-mediated interaction; (ii) the head group with the reacting methyl adopts a different rotamer; (iii) the substrate head as a whole is placed slightly less deeply into the active-site cavity and further away from the iron centre.
![]() | ||
Fig. 4 Representative snapshot from the MD trajectory of THR-D1, illustrating the “reaction distances”, dOH and dCCl. |
As in the simulations with intermediate A before, the position and conformation of the THR head was again controlled by the interactions of its hydroxyl group (see Fig. S6, ESI†). In THR-D2 and THR-D3, where the oxido ligand points “up” towards the substrate, the OγH group is hydrogen-bonding to the oxido oxygen, which leads to a short dOH distance but relatively long dCCl distance. By contrast, in THR-D4 and THR-D5, where the chloride is pointing “up”, OγH instead hydrogen-bonded to the carboxylate tail of the succinate, as it did to 2OG in THR-A (Fig. 3, right); this results in larger reaction distances. In THR-D1, we observed two stable conformations of the substrate head: OγH was hydrogen-bonding either to the succinate tail carboxylate, like in THR-D4 and THR-D5, or to the free arm of the succinate head carboxylate (as shown in Fig. 4). The latter arrangement leads to relatively short values for both dOH and dCCl.
ABA also showed similar conformational behaviour in the simulations with D as it did with A, which in this case means that it essentially maintained the position and conformation already adopted in docking. This conformation affords relatively short reaction distances (see Table 3). The only exception was ABA-D4, where the substrate head broke free of the hydrophobic pocket in which it otherwise sits, without finding a stable conformation over the course of the simulation (see Fig. S7, ESI†).
Like ABA, also NVA largely kept the same conformation in the simulations with D, with the ammonium–Glu102 salt bridge being maintained throughout. However, the extra methylene group of NVA introduces an additional degree of freedom (i.e., rotation about C3–C4), which affects the positions of the reacting carbons C4 and C5. Two C3–C4 rotamers were observed in all NVA simulations (except for NVA-D3, which yielded a single rotamer), which remained stable over extended periods, with occasional switches between them (see Fig. S8, ESI†). At least one of the rotamers afforded relatively short reaction distances with respect to C5 in all cases (Table 3 lists the values for the more favourable rotamer in each case). Consistent with a more facile reaction at C5, reaction distances to C4 are generally longer.
We applied a very simple criterion to assess the viability of the various structural models: a substrate–isomer combination was deemed viable if both reaction distances were below 5 Å, the rationale being that if either distance is too large, the reaction would not be able to proceed without further conformational changes. As a secondary criterion, we preferred shorter reaction distances to longer ones. From this distance-based analysis, we conclude that D1 is the most likely intermediate for THR; D2 or possibly D3 (which both have the oxido ligand pointing towards the substrate) for ABA; D2 for NVA reacting at C4; and D2 or D3 for NVA reacting at C5.
These conclusions, although obtained from a purely classical modelling protocol (i.e., docking followed by MD) and a simple analysis, are nevertheless pertinent. They indicate that the [FeO] intermediate is D1 for THR (five-coordinate, oxido pointing away) and D2 for ABA and NVA (five-coordinate, oxido pointing “up”); for the latter substrates the second arm of the carboxylate may be coordinated (D3). Isomers with chloride pointing “up” can be excluded. These findings agree in essence with Wong et al.'s,22 obtained from sophisticated NRVS experiments and DFT calculations on cluster models. They concluded that the [Fe
O] intermediate must be pentacoordinate and identified D1 as the isomer in the reaction with THR. For NVA (which was not investigated experimentally but included in the modelling), they proposed isomer D2; ABA was not considered in that study.
Table 4 summarises relative energies and spin populations of the O2 complexes B; additional structural data are provided in Fig. S10 and Table S8 (ESI†). THR-B and ABA-B have a septet ground state, the triplet and quintet states lying 40–50 kJ mol−1 higher in energy. For NVA-B, the quintet was the only stable state. Electronically, these complexes are best described as Fe(III)–superoxido (O2˙−) complexes, with the unpaired d-electrons on the iron (one, three, and five in the S = 1, 2, 3 state, respectively) ferromagnetically coupling to the superoxide radical (see Fig. S11, ESI†). (In the complexes with qu(Fe) ≈ 4.2, an additional ca. 0.5 majority spins are localised on the other directly bonded ligand atoms, mostly chloride; these bring the “iron” spin count to nearly five.) NVA-5B differs from the other quintet complexes by having five, instead of three, unpaired d-electrons, antiferromagnetically coupled to O2˙−. THR-3B is an exception altogether in that it resembles more closely an Fe(II)–3O2 adduct, with four unpaired spins on iron antiferromagnetically coupling to the neutral triplet dioxygen. The differences in electronic structure are reflected in the metal–ligand distances (see Table S8, ESI†), the Fe–Op bond being particularly sensitive to the number of unpaired electrons on iron.
Substrate | S/ℏ | ΔE/kJ mol−1 | q u/(ℏ/2) | ||
---|---|---|---|---|---|
Fe | Op | Od | |||
a THR-3B lost O2 during the def2-TZVP optimisations; values refer to the def2-TZVP single point at the def2-SVP optimised structure. b NVA-B was stable only for S = 2. | |||||
THR | 1a | 44 | 3.37 | −0.73 | −0.81 |
2 | 50 | 2.91 | 0.38 | 0.52 | |
3 | 0 | 4.17 | 0.63 | 0.54 | |
ABA | 1 | 50 | 1.09 | 0.49 | 0.50 |
2 | 41 | 2.93 | 0.37 | 0.47 | |
3 | 0 | 4.14 | 0.66 | 0.63 | |
NVA | 2 | —b | 4.15 | −0.22 | −0.49 |
As is to be expected from the different electronic structures of the O2 complexes, the subsequent O2 activation/decarboxylation proceeds via different routes on the different spin surfaces. Schemes 6 and 7 show the reaction profiles for the path B → D on the S = 1, 2, 3 surfaces with THR and ABA substrates, respectively. For NVA, where only the quintet state was stable, we identified the following sequence of intermediates (with relative energies in kJ mol−1):
5B (0) → 5C2a (−183) → 5D1 (−240) |
Representative structures of the intermediates are shown in Fig. 5; structural parameters for all the intermediates are collated in Table S10 (ESI†); and the energy profiles of the coordinate scans are plotted in Fig. S13–S15 (ESI†). Fig. S16 (ESI†) shows the structure of the favoured [FeO] species 5D1 for the three substrates.
![]() | ||
Fig. 5 QM/MM-optimised structures of the intermediates encountered on the O2 activation pathway. C1, C2a, and C2c are taken from ABA (S = 2), C2b from ABA (S = 3). Structural parameters are tabulated in Table S10 (ESI†). |
Broadly speaking, two main variants of the activation pathway can be discerned; one occurring on the triplet or quintet surfaces (“low-spin”), the other on the septet surface (“high-spin”). The “low-spin” mechanism proceeds from the Fe(III)–superoxido complex 3,5B (or the Fe(II)–dioxygen complex in case of THR-3B) via nucleophilic attack of Od on the keto carbon (C2) of the 2OG ligand, forming a peroxy-bridged intermediate 3,5C1. In this intermediate, the C1–C2 bond of 2OG and also the Op–Od bond are still intact. The Fe–Ok bond has shortened, reflecting the change of the oxygen from neutral keto to a (formal) oxyanion.
In 5C1 (but not 3C1), the Fe–Op bond is significantly lengthened. Structure C1 was a stable minimum for THR (S = 1, 2) and ABA (S = 2), but was not found for NVA (S = 2).
C1 decays by cleavage of the C1–C2 bond (i.e., decarboxylation), which is strongly exothermic and leads to the Fe(II)–peroxysuccinate complex C2a, which was identified for all three substrates on the S = 2 surface and also for THR (S = 1). C2a still has an intact peroxy Op–Od bond. One-electron reduction of this bond produces the Fe(III) complex C2c. 5C2c was a stable minimum on the quintet surface for all three substrates when optimised with the smaller def2-SVP basis set. Re-optimising with def2-TZVP removed this minimum, leading directly to 5D1, except for ABA, where ABA-5C2c was stable also with def2-TZVP. This indicates that the minimum around 5C2c, where it exists, is very shallow.
For THR (S = 1, 2), ABA (S = 2), and NVA (S = 2), the final structure of the pathway was the oxido complex 3,5D1. The only exception was ABA (S = 1), which yielded the oxido isomer ABA-3D2 directly from ABA-3B, without any intermediates along the pathway.
The “high-spin” mechanism, found for THR and ABA (S = 3), proceeds from 7B again by attack of Od onto C2. Concomitantly, the C1–C2 bond is cleaved; however, the incipient CO2 does not fully dissociate but remains weakly coordinated to the iron in the resulting intermediate 7C2b. 7C2b structurally resembles 3,5C2a, featuring a peroxysuccinate moiety with regular Fe–Op and Op–Od bonds. However, 7C2b is electronically and energetically a rather different species: the CO2 moiety has radical anion character and the iron centre is oxidised to Fe(III). Energetically, 7C2b is only a few kJ mol−1 higher in energy than the initial superoxido complex. Only in the final step of the “high-spin” mechanism is the O–O bond cleaved, fully oxidising the CO2, which dissociates, and yielding 7D1. This step is strongly exothermic.
Taking these results at face value, the most likely pathway for the formation of the oxido intermediate proceeds on the septet surface: 7B undergoes facile nucleophilic addition to form 7C2b, which reacts (in what probably is the rate-determining step) to 5D1, either via the intermediacy of 7D1 or in a concerted reaction/spin-conversion step. Alternatively, 7C2b may rearrange/spin-convert to 5C2a, from where the reaction proceeds on the quintet surface, that is, by facile O–O cleavage (possibly via5C2c) to 5D1.
This mechanistic scenario agrees well with the conclusions of a recent study by Wójcik et al.,77 who compared a range of mechanistic variants proposed in the literature78–80 for the O2 activation in NHFe/2OG dependent hydroxylases (which have a 2-His-1-Asp/Glu facial triad). They carefully searched for and characterised minima, transition states, and minimum-energy crossing points, using a small QM model at the B3LYP-D3/cc-pVTZ level, validated against CCSD(T)-F12. Their preferred mechanism proceeds on the quintet surface via5B → 5C2a → 5D (with a possible, but slightly less favourable “detour” via5C1), thus resembling our “low-spin” mechanism. However, they found that a “high-spin” (septet) pathway was nearly as favourable. Equally as relevant for our purposes is their conclusion that B3LYP-D3 provides a faithful representation of the delicate spin-state energetics in these systems.
While those findings lend support to our approach and results on the O2 activation mechanism in SyrB2, one should keep in mind that the fine detail of the mechanism – the exact topographies of the energy surfaces of the different spin states – is very sensitive to the choice of method and model. This is illustrated, e.g., by the fact that the existence of intermediate 5C2c depends on the basis set; that is, a seemingly small, “technical” change, which might ordinarily be expected to have a modest effect on the relative energies, qualitatively alters the character of the potential-energy surface.
δ/(mm s−1) | |ΔEQ|/(mm s−1) | ||
---|---|---|---|
a Calculated with B3LYP/CP(PPP) for the full QM/MM models in this work. δ values were obtained using the linear fits of ref. 73 and [ref. 74]. b From ref. 26. Calculated with B3LYP/CP(PPP) for first-shell models extracted from a larger cluster. δ values were obtained using the linear fit of ref. 73. c From ref. 21 (ESI). The ratio of the two species present is given in brackets. | |||
THR-5D1 | Calcda | 0.19 [0.22] | 1.12 |
ABA-5D1 | Calcda | 0.15 [0.18] | 1.11 |
NVA-5D1 | Calcda | 0.17 [0.20] | 1.32 |
THR-5D3 | Calcdb | 0.27 | 1.31 |
THR-5D4 | Calcdb | 0.21 | 0.94 |
THR-5D | Exptlc | 0.30, 0.23 (4![]() ![]() |
1.09, 0.76 |
ABA-5D | Exptlc | 0.28, 0.24 (7![]() ![]() |
0.99, 0.66 |
While the accuracy of isomer shifts calculated with a particular fit is typically very good (∼0.1 mm s−1),73,74 the values obtained using different fits can vary by several tenths mm s−1, as illustrated in Table 5. Quadrupole splittings are predicted less accurately, to within about 0.2–0.3 mm s−1.74 Considering these error bars, the isomer shift calculated for THR-5D1 agrees well with the experimental value for the minority species. However, for ABA-5D1, which is calculated to have a significantly smaller δ, the match is much less good. On the other hand, the calculated quadrupole splittings for both THR-5D1 and ABA-5D1 agree with the experimental values for the majority species. The Mössbauer data thus neither clearly corroborate nor exclude the identification of the [FeO] species with structure D1. By contrast, the Mössbauer parameters calculated in ref. 26 for the six-coordinate isomers D3 and D4 are in remarkable agreement with experiment, which would support the assignment of the two species seen experimentally as D3 and D4. However, these structures are not consistent with the NRVS data.22 At this stage, it is difficult to reconcile the Mössbauer data with the conclusions from the NRVS/computational22 study and the present QM/MM results.
While our results agree with Wong et al.'s22 on the central question of the structure of the [FeO] intermediate for THR, it is instructive to compare their (computational) findings about the mechanism leading to its formation. They suggested that the substrate amino group played a significant role in controlling the structure of D. (Note that in their model, the substrate amino group and also the Glu102 side-chain are neutral.) For THR, both OγH and the amino group were hydrogen-bonded to Glu102, which held the amino group away from the peroxy oxygens in intermediates C, allowing the oxido oxygen to adopt the position trans to His116, thus forming D1. By contrast, for NVA, the amino group hydrogen-bonded to Op during O2 activation, holding it in the position trans to His235, which led to the formation of D2.
In our simulations, however, the direct ammonium–Glu102 interaction in THR is broken during the MD simulations, facilitated by the other interactions of OγH. In NVA, on the other hand, the ammonium–Glu102 salt bridge is maintained throughout, even when the oxido oxygen is well accessible as a bonding partner (in D2 or D3). We did not observe hydrogen-bonding between the ammonium and either of the peroxy oxygens in any of the systems. Considering the sensitivity of the mechanism, it is therefore notable that despite the differences in structure and setup (QM cluster model vs. full QM/MM, neutral vs. ionised residues, BP86/def2-SVP vs. B3LYP/def2-TZVP), the primary outcome (i.e., the formation of D1) is the same.
The QM/MM study by Shaik and co-workers,29 which started from an MD-equilibrated structure of THR-D3, considered isomerisation to D1 and D4, but the subsequent hydrogen-abstraction was found to proceed from the initial D3. The resulting [FeCl(OH)] species E was proposed to isomerise before transferring preferably the Cl onto the substrate. This is compatible with the present study in so far as D1, being the primary product of O2 activation/decarboxylation, could isomerise to D3 before the hydrogen-abstraction step. However, unless H-abstraction was faster than the D1 → D3 conversion (which it is not according to ref. 29), it is difficult to reconcile this scenario with Wong et al.'s NRVS results,22 which identify the (relatively) long-lived species D as being D1.
Another interesting comparison is with an experimental study on the structure of the O2-bound complex B that appeared while this work was already in progress. Using advanced EPR techniques and NO as a non-reactive surrogate of O2, Martinie et al.85 determined distances and angles between the NO nitrogen (standing for Op), iron, and specific hydrogens of the substrates THR, ABA, and NVA in SyrB2. While the experimental distances agree well with the structures of the QM/MM-optimised complexes B, the angles do not (see Table S9 and Fig. S12, ESI†). Based on simple docking models, Martinie et al. proposed that O2 might not bind trans to His235, i.e., at the position of the water present in the X-ray structure of the holoprotein, but trans to His116. Substrate binding, water dissociation, or O2 binding would thus trigger a rearrangement, resulting in structure B′. Precedent for such a rearrangement exists in clavaminate synthase, a 2-His-1-Asp NHFe/2OG enzyme.86
However, to the best of our knowledge, all mechanistic models of O2 activation in NHFe/2OG enzymes, the halogenases in particular, are based on structure B. (Note that B and B′ are enantiomers, so models that do not include any environment residues would not be affected.) This raises interesting new questions about the structural and electronic course of O2 activation in these enzymes.
In this contribution, we have approached the problem by a combination of classical and QM/MM modelling. In addition to the native substrate THR, we included ABA and NVA, which have been used in previous experimental and computational studies of SyrB2. Using molecular docking and classical MD simulations, we constructed complete atomistic, equilibrated models of SyrB2 in complex with these substrates and five isomers of D. We evaluated each substrate–isomer combination based on a simple “reaction distance” criterion and identified isomers D1 and D2 to be likely intermediates.
We also built equilibrated models of the substrate-bound resting state A. These served as starting points for a QM/MM investigation to identify the intermediates along the O2 activation/decarboxylation pathway B → D for all three substrates on the S = 1, 2, and 3 potential-energy surfaces. The primary outcome is that all pathways (with one exception) yield the [FeO] isomer D1 in the quintet state, in which the oxido oxygen points away from the substrate. The details of the O2 activation/decarboxylation steps are very sensitive to the choice of method and model, and their full elucidation may require further developments, especially in the area of multi-reference methods capable of treating larger systems.9,87 However, in the meantime, the present conclusions appear sufficiently robust to inform further computational and experimental efforts aimed at uncovering the factors that control the unique reactivity and selectivity of SyrB2 and related NHFe halogenases.
Footnotes |
† Electronic supplementary information (ESI) available: Additional computational details and supplementary results. See DOI: 10.1039/c7cp05937j |
‡ The trigonality index τ5 is 0.65 for D1/D2 and 0.49 for D5. τ5 = (β − α)/60°, where β is the largest of the ten valence angles around a five-coordinate centre; α is the second-largest. For a perfect trigonal bipyramid, τ5 = 1 (β = 180°, α = 60°); for an ideal square pyramid, τ5 = 0 (α = β ≤ 180°). |
This journal is © the Owner Societies 2017 |