Guangcai Maa,
Na Chenga,
Hao Sua and
Yongjun Liu*ab
aSchool of Chemistry and Chemical Engineering, Shandong University, Jinan, Shandong 250100, China. E-mail: yongjunliu_1@sdu.edu.cn; Fax: +86 531 885 644 64; Tel: +86 531 883 655 76
bKey Laboratory of Tibetan Medicine Research, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, Qinghai 810001, China
First published on 22nd December 2014
WlbB, one of the enzymes required for the biosynthesis of UDP-2,3-diacetamido-2,3-dideoxy-D-mannuronic acid (UDP-ManNAc3NAcA), is an N-acetyltransferase that catalyzes the N-acetylation of UDP-2-acetamido-3-amino-2,3-dideoxy-D-glucuronic acid (UDP-GlcNAc3NA) to form UDP-2,3-diacetamido-2,3-dideoxy-D-glucuronic acid (UDP-GlcNAc3NAcA). In this paper, based on the crystal structure, the detailed reaction mechanism of WlbB has been studied by using a combined QM/MM method. In particular, six snapshots taken from MD trajectories were used as the computational models to investigate how the starting geometries influence the calculation results. Our calculations suggest that the WlbB-catalyzed process involves two sequential steps. The nucleophilic attack of the C3-amino group of the substrate on the carbonyl carbon of acetyl-CoA occurs in concert with the departure of CoA from acetyl-CoA, generating a negatively charged CoA and a positively charged intermediate, which is inconsistent with the previous proposals that the catalytic reaction undergoes an oxyanion tetrahedral intermediate. Subsequently, the sulfur anion of CoA accepts the proton of the positively charged intermediate to yield the final product. Although Asn84 is not essential, it is important for promoting the catalysis by forming a hydrogen bond with the C3-amino group to position the lone pair of the electrons of the C3-amino group in an ideal orientation for nucleophilic attack and stabilize the transition states and intermediate. The cautious selection of initial geometries was found to be important for exploring the enzymatic mechanism and getting reliable energy barriers of the reaction pathways.
Up to now, enormous quantities of NATs have been identified, which can be sorted into two major classes: the Gcn5-related N-acetyltransferase (GNAT) and the left-handed parallel β-helix (LβH) superfamilies. The former usually function as dimers, and each monomer exhibits a structurally conserved fold composed of a mixed β-sheet flanked by irregular α-helices.7,8 Members of LβH superfamily typically function as trimers and adopt a conserved LβH structure with rare left-handed crossover connections and repeated isoleucine-rich hexapeptide motif, which was first observed in UDP-N-acetylglucosamine (UDP-GlcNAc) acyltransferase.9,10 In recent years, a number of members of the LβH superfamily have been structurally and mechanistically characterized, which often operate on dTDP-, UDP- or GDP-linked sugar substrates.11–17
ManNAc3NAcA, a reasonably rare di-N-acetylated deoxysugar, is found in the outer membranes of some Gram-negative pathogenic bacteria, such as Bordetella petrii, Pseudomonas aeruginosa and Bordetella pertussis.14,18,19 In these bacteria, the precursor of ManNAc3NAcA is UDP-ManNAc3NAcA, which is synthesized by the actions of five distinct enzymes, starting from UDP-GlcNAc. The fourth enzyme, also referred to as WlbB in Bordetella petrii, is an N-acetyltransferase that catalyzes the biotransformation of UDP-2-acetamido-3-amino-2,3-dideoxy-D-glucuronic acid (UDP-GlcNAc3NA) into UDP-2,3-diacetamido-2,3-dideoxy-D-glucuronic acid (UDP-GlcNAc3NAcA) by transferring an amino group from acetyl-CoA to UDP-GlcNAc3NA, as shown Scheme 1.
The X-ray crystal structure of WlbB in complex with either UDP and acetyl-CoA or substrate and CoA has been determined to a high resolution of 1.43 Å.14 On the basis of amino acid sequence analysis, WlbB has been identified as a member of LβH superfamily of NATs. Similar to other LβH members, WlbB functions as a trimer, as displayed in Fig. 1a, and each active site is constructed at the interface of the two subunits. Structural analysis of WlbB reveals that only the side chain of Asn84 and the backbone amide group of Arg94 lie within hydrogen bonding distances to the substrate sugar ring.14
![]() | ||
| Fig. 1 (a) Crystal structure of WlbB in the presence of CoA and substrate UDP-GlcNAc3NA. (b) Selected model for the QM/MM calculations in the present study. | ||
Although all members of LβH superfamily have similar left-handed parallel β-helix structures, their active sites are remarkably different.11–16 So far, two possible N-acetylation mechanisms have been proposed. In some members, such as PglD and PerB, it has been suggested that an active site histidine residue acts as a general base to abstract the proton of the amino group of substrate to catalyze the formation of an oxyanion tetrahedral intermediate, and site-directed mutagenesis has confirmed its importance for catalysis.12,16 However, inspection of the active site structures of QdtC and WlbB demonstrates that there were no potential proton acceptors.13,14 In the active site of AntD, although Asp94 is located at hydrogen bonding distance to the sugar amino group, site-directed mutagenesis suggested that it may only play role in binding substrate, but does not serve as a catalytic base.15 In view of the experimental results, Holden and co-workers proposed that the QdtC-, WlbB- and AntD-catalyzed reactions proceed through a substrate-assisted mechanism, that is, the sulfur of CoA acts as the catalytic base to deprotonate the amino group of substrate.15 Note that in QdtC, WlbB and AntD, the substrates have been assumed in their deprotonated states to bind the active sites.13–15
Several theoretical studies have been done to explore the reaction mechanisms of human arylamine N-acetyltransferases and histone acetyltransferases of NATs.20–23 Although some experimental advances have been made in understanding the structures and mechanisms of the LβH superfamily, to the best of our knowledge, no theoretical study was reported yet. Furthermore, some key questions regarding the reaction mechanism of catalysis still remain unresolved: (1) what is the detailedreaction pathway? Whether the proposed oxyanion tetrahedral intermediate exists and whether the sulfur of CoA serves as the proton acceptor? (2) what are the energetics of the whole catalytic cycle? (3) how do the protein environments, especially some key residues in the active site influence the catalytic reaction? Answering the above questions can help us to understand the reaction mechanism of the LβH superfamily. Therefore, further theoretical study at the atomistic level is highly deserved.
In the present work, the detailed acetylation mechanism of WlbB from Bordetella petrii has been studied by using combined quantum mechanical/molecular mechanical (QM/MM) method. In recent years, QM/MM methodology has been successfully employed to investigate the chemical reactions, especially in the biomolecular systems.24–27 Previous theoretical studies of some enzymatic reactions have revealed the importance of selection of starting structure used for the QM/MM calculations.28,29 From the MD simulation, a large number of snapshots were usually observed; however, many of them might be irresponsible for the “reactive” conformations. Lonsdale et al. found that the unreasonable enzyme–substrate complex brings about unrealistically high energy barriers that are not representative of the true enzymatic reactivity.30 Actually, the QM/MM energy barriers are quite sensitive to small difference of the starting geometries. Thus, extensive analysis of the MD trajectories and careful screening of the snapshots are crucial to uncover the enzymatic reactions. Based on our calculations, the previously proposed mechanism was revised, and the effect of starting geometries on the reaction pathways was explored by extracting a series of snapshots. In addition, the energy barriers of the overall reaction, all the stable states and transition states along the reaction pathway as well as the specific role of Asn84 have been illuminated.
120 atoms, including 5086 TIP3 water molecules, as shown in Fig. S1.†
As mentioned above, the choice of initial structure can largely influence the accuracy of the QM/MM calculation results.28–30 To test how the initial structure influences the calculation results, six snapshots were randomly chosen as the initial models for the following QM/MM calculations from the MD trajectories at an interval of 3 ns from 5 ns to 20 ns, which were labeled as NAT-1 to NAT-6, respectively. These snapshots were used to explore the detailed reaction mechanism and the effects of starting structures on the reaction pathways, and to model the most reactive structure.
Unless noted otherwise, the QM region contains the glycosyl moiety of substrate UDP-GlcNAc3NA, a part of acetyl-CoA and the side chain of Asn84 (56 atoms), as shown in Fig. 2. The remaining atoms of the solvated model were assigned to the MM region. In some additional calculations, to investigate how the selection of QM region influences the calculation results, the Asn84 was left out (47 atoms) or the backbone amino group of Arg94, Gln59 and three solvent water molecules (85 atoms) were added to the QM region. During the QM/MM calculations, we defined an active region including the QM region and the MM atoms within 15 Å of the N3 atom of substrate. The active region was fully optimized, whereas all the remaining atoms were fixed. Geometry optimizations were performed with the hybrid delocalized internal coordinates (HDLC) optimizer,44 in which the quasi-Newton limited memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) method45,46 was used to locate minima and the partitioned rational function optimization (P-RFO) algorithm47 was used for transition state searches.
![]() | ||
| Fig. 2 The QM region (in ball and stick model) that contains the sugar ring of UDP-GlcNAc3NA, a part of acetyl-CoA and the side chain of Asn84. | ||
Considering six MD snapshots were used to investigate the detailed reaction pathways, we calculated the average values of the energy barriers of the pathways to analyze the results of the QM/MM optimizations. The averages were calculated according to the Boltzmann-weighted average equation:
![]() | (1) |
Energetic data for the reaction pathways of the six models are listed in Table 1. The energy barriers range from 16.9 to 22.1 kcal mol−1 for nucleophilic attack process and from 6.6 to 12.0 kcal mol−1 for proton transfer process. Ignoring the possible errors in the QM/MM calculations, the wide fluctuation of the energy barriers for these models is mainly due to the conformational variability of snapshots. The Boltzmann-averaged energy barrier is 17.9 kcal mol−1 for nucleophilic attack process and 10.4 kcal mol−1 for proton transfer process. Based on the Boltzmann-weighted average equation, the lowest barrier contributes most to the average. Note that a simple sample of six snapshots should not be expected to provide an accurate average. The effects of starting geometries on the reaction pathways will be discussed in Section 3.3. The calculated barriers show that the nucleophilic attack step is rate-limiting.
| Model | ES | TS1 | EI | TS2 | EP |
|---|---|---|---|---|---|
| NAT-1 | 0.0 | 22.1 | 10.9 | 14.1 | −11.2 |
| NAT-2 | 0.0 | 19.8 | 12.0 | 13.7 | −10.3 |
| NAT-3 | 0.0 | 20.3 | 11.4 | 13.6 | −11.1 |
| NAT-4 | 0.0 | 18.9 | 8.6 | 10.7 | −15.1 |
| NAT-5 | 0.0 | 16.9 | 6.6 | 9.4 | −13.6 |
| NAT-6 | 0.0 | 19.7 | 11.0 | 13.1 | −11.0 |
For NAT-5, the barriers are obviously smaller than those of the other models, suggesting this model should be more preferred to catalyze the acetylation reaction. Here, we utilize model NAT-5 as the example to illustrate the detailed reaction mechanism of WlbB. Optimized structures of reactant complex (ES), transition states (TS1 and TS2), intermediate (EI) and product complex (EP) for NAT-5 are shown in Fig. 3. In ES, the distances of R(N3–C) and R(C–S) are 2.90 and 1.80 Å, respectively. The acyl O atom of Asn84 forms a hydrogen bond with H1 atom of C3-amino group with a distance of 2.07 Å. This hydrogen bond induces the amino nitrogen to position its lone pair of electrons in an ideal orientation for nucleophilic attack. The detailed discussion on the specific role of Asn84 is described in Section 3.2. Two-dimensional potential energy surface (PES) for the nucleophilic attack process has been mapped out using the distances of R(N3–C) and R(C–S) as reaction coordinates (see Fig. 4). On this contour diagram, the horizontal axis characterizes the formation of N3–C bond, and the vertical axis characterizes the cleavage of C–S bond. On the basis of the optimized ES, the R(N3–C) was scanned with a decrement of 0.05 Å and the R(C–S) was scanned with an increment of 0.05 Å. In TS1, the distance of R(N3–C) is 1.73 Å, while the distance of R(C–S) is 1.97 Å. This suggests that the nucleophilic attack process proceeds through a concerted mechanism, in the transition state the N3–C bond was partially formed and simultaneously the C–S bond was partially broken. The energy barrier of this step is calculated to be 16.9 kcal mol−1, which is smaller than those of the other models. The calculated barrier is slightly overestimated compared with the experimental estimations of 15–16 kcal mol−1, which is deduced from the kcat values of 8 and 72 s−1 for the QdtC- and AntD-catalyzed N-acetylation reactions, respectively.13,15 Another local energy minimum located in the PES corresponds to the intermediate EI. In EI, the C–S bond has been completely broken, and the negatively charged sulfur of CoA adjusts its position to form a hydrogen bond with H2 atom of the protonated amino group with a distance of 2.13 Å. Compared with ES, this intermediate is quite unstable. The calculation results confirm that the sulfur of CoA functions as a general base to remove the H2 proton via a low-energy transition state (TS2). In TS2, the distance of H2–S is shortened to 1.73 Å and the distance of N3–H2 is changed to 1.21 Å. The calculated barrier of proton transfer is 2.8 kcal mol−1 relative to EI, suggesting this step is easy to occur. The final product (EP), UDP-GlcNAc3NAcA, has been demonstrated to be the substrate of 2-epimerase (WlbD), which is the last enzyme required for the biosynthesis of UDP-ManNAc3NAcA.14
![]() | ||
| Fig. 3 Optimized structures of reactant (ES), transition states (TS1 and TS2), intermediate (EI) and product (EP) for model NAT-5. | ||
![]() | ||
| Fig. 4 PES calculated at the B3LYP/6-31G** level for the nucleophilic attack process using distances R(N3–C) and R(C–S) as reaction coordinates. | ||
Based on the experimental observations and our QM/MM calculation results, we come to a conclusion that although Asn84 does not directly take part in the WlbB-catalyzed reaction, it is important for promoting the activity of the enzyme by forming hydrogen bond with the C3-amino group of the substrate. On the one hand, the hydrogen bond stabilizes the transition states and intermediate along the reaction pathway. On the other hand, the presence of Asn84 positions the lone pair of electrons of C3-amino group in a right orientation for nucleophilic attack. As mentioned above, mutation of Asn84 into Ala84 remarkably reduces the catalytic efficiency.
| ΔEQM/MM | ΔEQM+ptch | ΔEQM | ΔEMM | |
|---|---|---|---|---|
| NAT-1 | 368.1 | −17.2 | 0.6 | 385.3 |
| NAT-2 | 402.3 | −14.3 | 2.2 | 416.6 |
| NAT-3 | 215.7 | 9.3 | 0.5 | 206.4 |
| NAT-4 | 445.1 | 2.2 | −0.5 | 443.0 |
| NAT-5 | 0.0 | 0.0 | 0.0 | 0.0 |
| NAT-6 | 135.9 | 8.4 | −1.0 | 127.5 |
As aforementioned, the calculated barriers for nucleophilic attack processes range from 16.9 to 22.1 kcal mol−1 and from 6.6 to 12.0 kcal mol−1 for proton transfer processes (Table 1). This illustrates that the initial geometries remarkably influence the individual energy barriers of the WlbB-catalyzed processes. Key distances in the optimized structures of all species during the reactions for the six models are given in Table S1.† Comparison of the enzyme–substrate complexes (ESs) reveals that in NAT-1 the distances of R(N3–C) and R(O–H1) are 2.96 and 2.18 Å, respectively, which are longer than those of the other models. This result in the NAT-1 desires more nucleophilic activation energy than the others. In contrast, NAT-5 exhibits more suitable N3–C distance and stronger hydrogen-bonding interaction between Asn84 and the amino group, thereby inducing the nucleophilic attack reaction to occur more readily. Whereas in NAT-2 the distance of R(N3–C) is only 0.01 Å shorter than that of NAT-5, the hydrogen-bonding distance of R(O–H1) is much longer, which increases the energy barrier for nucleophilic attack process and in turn affirms the importance of Asn84 for catalysis. Previous QM/MM calculations have revealed that not only the active pocket residues but also the residues in considerable distance to active center have important influence on the reaction barriers.48 Although no residue is involved in the acetylation reaction, the residues and water molecules that locate within hydrogen bonding distances to the substrate and cofactor are important for substrate and cofactor binding. Undoubtedly, unreasonable conformational changes of these residues may more or less influence the enzymatic reaction. In these six models, NAT-5 shows the relatively lower energy barriers for both nucleophilic attack and proton transfer processes (16.9 and 9.4 kcal mol−1) than those of the others. Furthermore, the total energy of NAT-5 is significantly lower than that of the other models. All these results suggest that the reaction path described by NAT-5 should be the most preferred one. Given the conformational complexity of the enzyme–substrate complex and the unexpected errors in the QM/MM calculations, the careful selection of MD snapshots is vital for exploring the reaction mechanism and getting accurate results.
Six snapshots taken from MD trajectories were used as the initial models to explore the effects of starting geometries. The calculation results insist on the fact that the selection of initial geometries significantly affects the individual energy barriers of the enzymatic reaction, in other words, a reasonable starting geometry is crucial for exploring the enzymatic activity.
Despite the fact that WlbB demonstrates a lack of a catalytic base to deprotonate the amino group of substrate, some other members of LBH superfamily, such as AntD from Bacillus cereus, PerB from Caulobactercrescentus and PglD from Campylobacter jejuni, have an active site residue probably functions as the potential proton acceptor. However, whether the residue is directly involved in enzymatic reaction is still open to debate, which deserves further theoretical studies.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c4ra13278e |
| This journal is © The Royal Society of Chemistry 2015 |