Open Access Article
Wentao
Xu
,
Tianyu
Guo
and
Haibin
Su
*
Department of Chemistry, Laboratory of Theoretical and Computational Chemistry, The Hong Kong University of Science and Technology, Hong Kong, China. E-mail: haibinsu@ust.hkhaibinsu@ust.hk; Tel: +86 852 2358 7388
First published on 21st January 2026
The conformational transition of the receptor binding domain (RBD) of the SARS-CoV-2 spike (S) protein from the “closed” to the “open” state is an important dynamic event influencing the infectivity of the virus. However, how accumulating mutations reshape these conformational dynamics during viral evolution remains to be clarified. In this study, we observed the motion of the RBD of fully glycosylated S protein trimers from wild type to Omicron (BA.2, BA.4&5) variants by using all-atomic molecular dynamics simulations. Our study indicates that, although a fully converged free energy landscape in all motion directions was not obtained, analysis of the molecular dynamics trajectories reveals significant differences in the RBD motional patterns among variants due to mutations. Studies have shown that specific surface mutations of the Omicron variant exhibit a strong coupling effect with glycan dynamics. This effect remodels the “glycan gate” conformation mediated by N165 and N343 glycans and subsequently regulates the motion of the RBD. These findings elucidate the molecular mechanism between viral surface mutations and glycan dynamic regulation, providing a new theoretical basis for understanding the evolutionary adaptability and biological function of the virus.
Various experimental and computational methods have been employed to characterize the dynamic behavior of the S protein. Among them, the single-molecule fluorescence resonance energy transfer experiment observed and quantified the dynamic equilibrium of the RBD, revealing that viral variants (such as D614G) significantly shifted the conformational equilibrium towards an open state that is more conducive to binding than the wild-type (WT) strain.10,11 Meanwhile, cryo-electron microscopy studies also provided structural evidence that the D614G variants favor a more open conformation by disrupting an inter-protomer salt-bridge contact present in the WT strain.12–15 Other key single mutations, like K417N, were also shown to impact the stability of the S protein's open state.16 Furthermore, the hydrogen deuterium exchange mass spectrometry technique was used to analyse the flexible region of the S protein and further indicated that the Delta variant's spike tended to have an open conformation, but it was also found that the RBD of the early Omicron variant unexpectedly leaned more towards the closed conformation.17
Molecular dynamics (MD) simulation combined with enhanced sampling methods and free energy calculation is a powerful tool for studying the conformational dynamics of S proteins at the atomic level.18–24 For example, free energy calculations via 2D umbrella sampling have been used to explore the impact of a single mutation on this transition process, providing insights into the dynamic evolution of spike proteins, even when the specific mutations studied did not become prevalent variants.19 Other work focused on the influence of distant residues, demonstrating that the analysis of residue motion synergy from MD trajectories can provide profound insights for predicting the outcomes of distant mutations.18 Furthermore, research on the surface glycans of the spike protein has innovatively revealed their key regulatory role in the conformational dynamics of the RBD.25,26 Studies combining conventional molecular dynamics simulations with experimental validation first demonstrated that N-glycans at sites N165 and N234 play an essential structural role by stabilizing the RBD in the open conformation.26 Subsequently, a distinct mechanistic contribution was uncovered through weighted ensemble simulations, which showed that the N343 glycan acts as a “glycan gate”, actively facilitating the closed-to-open transition by interacting with the RBD surface.25 These findings highlight two different aspects of glycans' influence: stabilization of the open state and facilitation of the opening transition. This understanding has been quantitatively refined by a 2D free energy surface constructed using replica exchange umbrella sampling, which showed that glycans increase the energy barrier to opening and highlighted the roles of N165 and N343 in stabilizing both the up and down states.21 Further investigations have explored how the specific type and size of glycans at these key sites (N165, N234, N343) fine-tuned the stability of the open RBD, and examined the evolutionary implications of changes in the glycan shield topology, such as the loss of the N370 glycosylation site in SARS-CoV-2.22 These important studies have established that factors, from key mutations such as D614G to specific glycans such as N165, can play crucial regulatory roles in the S protein conformational dynamics.
However, virus evolution is not a simple accumulation of isolated factors.27,28 The challenge of current research lies in how these mutations can work in synergy with the dynamic regulation of glycans to jointly reshape the conformational landscape of the S protein of the variants of concern. Solving this challenge is precisely the key to answering the fundamental question mentioned in the beginning. In this study, we used all-atom metadynamic simulations of the fully glycosylated SARS-CoV-2 spike protein trimer to investigate how the RBD opening mechanism has evolved across major variants (from wild type to Omicron BA.2 and BA.4/5), and to analyze how surface mutations and glycan dynamics collectively shape this process, particularly at N165 and N343, reshapes the conformational landscape of the spike protein. Through this analysis, we aim to elucidate the connection between surface mutations and glycan dynamics, thereby establishing a mechanistic basis for understanding the structural and functional evolution of the virus.
With these defined sequences, we then generated the structures through homology modeling. Initially, utilizing wild-type models from the CHARMM-GUI COVID-19 Archive29,30 as templates, we incorporated the VOC-specific amino acid mutations using the MODELLER software package.31 For glycosylation patterns, we integrated recent comprehensive analyses by Shajahan et al.,32 which comparatively examined N- and O-glycosylation sites across wild-type, Delta, and Omicron spike proteins. In contrast to the wild-type strain, the N17 site in Delta and the N17, N74, and N149 sites in Omicron BA.2 were maintained in non-glycosylated states. Detailed information of mutations and glycan sequences are stored in 3_Additional_Information.xlsx. These structural modifications were implemented through the glycan reader & modeler module of the CHARMM-GUI online platform. The initial coordinates for the fully-glycosylated Spike protein system were prepared using the CHARMM-GUI. The protonation states of all titratable residues were assigned assuming a physiological pH of 7.0. The system was then solvated in a periodic boundary box with TIP3P water molecules, ensuring a minimum distance (20 Å) between the protein and the box edges. Potassium and chloride ions were added to neutralize the total system charge and to achieve a physiological salt concentration of 0.15 M. Both the protein and the attached N-glycans were described using the all-atom CHARMM36 force field.
We conducted three independent repeated simulations for each variant, with each simulation lasting for 400 nanoseconds (400 ns * 3 replicas). Particularly for the BA.2 variant that exhibits a unique open mechanism, we performed an additional five repeated simulations to further verify its behavior. A detailed PLUMED file has been uploaded to 3-METAD.dat in the github repo.
![]() | (1) |
![]() | (2) |
![]() | (3) |
is the inverse thermal energy and V(s,ti) is the bias potential deposited at time.
0.9 kcal mol−1) on the reconstructed free energy surface (such as the WT protein as shown in Fig. 1B) as accurate kinetic parameters is inappropriate.
To this end, we adopted a more robust analysis strategy, shifting the focus from the pursuit of absolute accuracy in energy values to capturing the topological differences in the conformational energy landscapes among different variants. We interpret the “low energy” regions as corresponding to thermodynamically stable and highly sampled conformations. Therefore, FEL is regarded as a powerful tool for analysing the motion patterns of RBD in MD trajectories, rather than a quantitative indicator for summarising the conformational changes of RBD. Such an analytical strategy has also been demonstrated in previous studies on the dynamics of RNA conformation.48 Based on this, the minimum energy path connecting the closed state and the open state is defined as the main pathway of conformational opening (opening path, see Fig. S3). We believe that even if the free energy values do not fully converge, significant changes in the FEL topological structure (such as the position, shape, and relative depth of the energy wells) can reliably reveal the differences in conformational intermediates and dynamic behaviors among different variants. To achieve standardised comparisons among variants, we further project FEL onto a two-dimensional space defined by the conformations and the RMSD of the reference “open” and “closed” states (WT shown in Fig. 1C). This projected space constitutes the basis for our cross-variant comparative analysis (Fig. 2).
Based on RMSD FELs, the focus of our analysis is on the occupancy ratio of the “open” and “closed” states. This metric is defined as the proportion of low-energy conformations within each energy basin (the state boundaries are shown in Fig. S1) on the RMSD free energy surface. It directly quantifies the thermodynamic preference of the system for each macroscopic state and the diversity of the conformational ensemble. The quantitative occupancy ratio results of each variant have been labelled on the respective free energy surfaces in Fig. 2. By focusing on the relative changes in state occupancy rather than the absolute energy barriers, the optimisation mechanism of the dynamic behavior of the S protein under evolutionary pressure can be reliably elucidated.
Although variants like Delta, which are pre-Omicron variants, evolved through an optimized conservative kinetic pathway, the emergence of Omicron introduced a major innovation – a new path on the open pathway that features the appearance of an intermediate state. As shown in the geometric FEL of BA.2 (Fig. S2), a stable intermediate state that is far away from both closed and open states was formed, and it can be inferred that this intermediate state significantly reduced the transition barrier from closed toward open. However, this kinetic advantage is deceptive and is accompanied by a thermodynamic cost. Although this intermediate state is easy to enter, the system is difficult to escape from this “trap” to reach the fully open conformation. This is quantitatively reflected in the low energy occupancy rate of the open state of BA.2, being approximately 9.8% (Fig. 2), which is comparable to that of the original WT strain. This mechanism difference provides a reasonable explanation for the puzzling finding observed in the HDX-MS17 experiment, where the early Omicron variants have a lower tendency to be open compared to their predecessors. The conformational dynamics of BA.4&5 represent an integrated feature, resolving the dilemma of BA.2. The key difference from BA.2 lies in the topological structure of its energy landscape: the geometric FEL of BA.4&5 exhibits a smoother energy slope (Fig. S3), which facilitates a gradual transition to the open state. The occupancy rate of its open state has climbed to over 60% (Fig. 2), also strongly supporting this. By combining the kinetic accessibility provided by the intermediate state driving path with a thermodynamically favourable landscape conducive to extensive exploration, the BA.4&5 variants achieve truly efficient RBD opening. This suggests that the overall topological features of the landscape during the optimization of dynamic properties, including the smoothness of the pathway and the presence of intermediate-state traps, perhaps play a more critical role in determining the efficiency of opening. Next, we will attempt to explain the changes in the opening mechanism of different variants through protein structure analysis.
The representative opening pathways, illustrated in the right panel of Fig. 2, provide primary structural insights into the dynamic differences, the core of which lies in the dynamic behavior of key glycans surrounding the RBD. In both Delta and BA.4&5, as the RBD opens upward, the N165-glycan forms dynamic contacts with the N343-glycan on the lower side of RBD, establishing a “bridge” structure, a key feature of the “with N165–N343 glycan contact” mechanism illustrated in Fig. 4. This observation of N165 and N343 glycans cooperatively stabilizes the open state, building upon and refining previous findings that highlighted their individual and collective roles in modulating the S protein's conformational energy landscape.21,26 In contrast, during the opening process in BA.2, the N165-glycan remains adhered to the RBD surface and fails to establish effective interactions with the N343-glycan, defining the “without N165–N343 glycan contact” mechanism (Fig. 4). The absence of this bridging structure likely contributes significantly to the thermodynamic instability observed in the BA.2 open state. This suggests that viral amino acid mutations can reshape RBD opening dynamics through allosteric modulation of the protein–glycan interaction network, indicating that the well-known “glycan gating” phenomenon plays a central role in this evolutionary trajectory.21,25,26
To investigate the molecular mechanisms driving the differential glycan dynamics among the variants, we focused on key amino acid mutations on the RBD surface. Detailed analyses of the glycan–RBD interactions for the Delta, BA.2, and BA.4&5 variants are organized in Fig. S3, Fig. 3, and Fig. S4, respectively. These complex dynamic data can be distilled into two distinct, variant-specific RBD opening mechanisms, which are illustrated as intuitive models in Fig. 4. These models, classified as “with N165–N343 glycan contact” and “without N165–N343 glycan contact”, demonstrate how mutations directly regulate the motional pathway of the N165-glycan by remodeling the local amino acid–glycan interaction network. The quantitative evidence supporting these mechanistic models comes from our detailed analysis of interaction frequencies, presented as histograms in Fig. S6. This figure quantifies the evolving dynamic contacts between key mutated residues and the N165-glycan (top, blue) or N343-glycan (bottom, orange) along the RBD opening pathway, providing the underlying data for the mechanisms shown in Fig. 4.
In the BA.2 variant, the Q493R mutation at the edge of the receptor-binding motif (RBM) imparts a stage-dependent regulatory effect on the N165-glycan, showcasing a mechanism of interaction during the RBD opening process. This finding reveals that a specific mutation alters spike dynamics through a glycan interaction, thereby reinforcing the view that glycans play important functional roles beyond simply providing surface shielding. In the early stages of RBD opening, the newly introduced arginine residue (R493) acts as a potent electrostatic anchor. Its positive charge forms a specific and strong salt bridge with the negatively charged terminal sialic acid of the N165-glycan. This initial interaction functions as a transient but strong tether, significantly restricting the glycan's conformational freedom and guiding its initial movement along with the RBD surface (see Fig. S5).
However, as the RBD transitions towards a more open state, this specific, localized tethering effect diminishes. The increasing distance and altered orientation between R493 and the glycan weaken the initial salt bridge. The interaction then shifts to a weaker, more diffuse electrostatic attraction. This secondary interaction is governed by the overall electrostatic complementarity between the positive surface potential of the RBD, enhanced by the R493 mutation, and the glycan surface (see Fig. S5A–C). Our additional simulation on BA.2 spike with oligomannose-N165 glycan (see Fig. S5E) demonstrates that this principle holds for different glycoforms, including both sialylated complex and oligomannose types, which provides a structural basis for understanding why the N165 glycan can tolerate significant structural diversity across variants while still performing its crucial role in modulating RBD dynamics. This transition from a strong, localized interaction to a weaker, delocalized one is supported by our electrostatic potential surface calculations and analysis (see Fig. S5).
Consequently, the Q493R mutation does not permanently anchor the glycan but rather modulates its dynamics in a two-fold manner: an initial strong restriction followed by a gentler, more global guidance. This effect collectively ensures the N165-glycan remains associated with the RBD surface, preventing the large-scale swinging motions required for bridge formation. This overall coupled motion is further corroborated by a principal component analysis of the N165-glycan and the RBD in the BA.2 variant (Fig. 3, lower right corner), which reveals a high degree of correlation between their respective centers of mass. To test the reliability of this proposed tethering mechanism, we performed two additional, independent metadynamics simulations of the BA.2 variant: one with re-initialized glycan conformations and another where the N165 glycan was remodelled to the oligomannose-type (Man5) also prevalent in Omicron. Crucially, the key features of this N165-glycan restriction and the resulting thermodynamic instability of the open state remained consistent across all simulations. This consistency suggests that the observed mechanism is a feature of the system and not an artefact of a specific initial model.
The Delta and BA.4&5 variants tried to employ a similar strategy: by introducing a positively charged arginine residue via the L452R mutation to increase the surface's positive charge and attract the N165-glycan. This successfully anchors the root of the glycan, thereby balancing the competing needs of stabilizing the RBD intermediate state and maintaining tip flexibility for N343 bridge formation (see Fig. S3 and S4's lower right corner). However, they diverge in the stabilization effect on the RBD opening intermediate state, revealing how BA.4&5 selectively inherited and refined traits from its predecessor. In the Delta variant, while the L452R mutation anchors the root of the N165-glycan, the presence of the negatively charged glutamic acid at position 484 (E484) contributes to a more diffuse negative electrostatic potential on the surrounding RBD surface. This creates a subtle, yet persistent, unfavorable interaction with the overall partial negative charge of the N165-glycan, thereby weakening the global affinity between the glycan and the RBD. This attenuated attraction makes it difficult for the Delta variant to stabilize a distinct intermediate state during the RBD opening process, unlike what is observed in its predecessors.
BA.4&5 meticulously refines this mechanism. While retaining L452R, it eliminates the unfavorable electrostatic interference through the E484A mutation, leading to a markedly enhanced contact between the N165-glycan and the RBD surface. This strengthened and uniformly distributed interaction network strikes an exceptional balance: it is strong enough to efficiently stabilize the critical intermediate state along the opening pathway, yet it avoids the excessive tether-like restriction of BA.2's Q493R. By retaining the native Q493, whose neutral side chain forms weaker hydrogen bonds compared to the potent electrostatic attraction from arginine's positively charged group, it preserves the conformational freedom necessary for the glycan tip to form the bridge. By resolving the electrostatic hindrance encountered by Delta, BA.4&5 resolves the dilemma between kinetic accessibility and thermodynamic stability, demonstrating a fitness advantage.
Furthermore, the persistent mutational trend observed at site 477/478 throughout the evolution of SARS-CoV-2 implies a critical function for this region in modulating glycan dynamics. We note that the T478K mutation, which became dominant in the Delta variant, and the S477N mutation, prevalent in the Omicron family, could both further modulate the triangular competition network by altering the local chemical environment. Chemically, the T478K mutation substitutes a neutral, polar threonine with a positively charged, long-chain lysine. We hypothesize that this newly introduced positive charge center could more effectively “capture” the polar glycan chains via long-range electrostatic attraction, thereby enhancing the stickiness of this site as a network node and potentially affecting the frequency and duration of N165 or N343 glycan contacts. On the other hand, the S477N mutation replaces a serine with an asparagine; although both are neutral polar residues, asparagine possesses a different hydrogen bond donor–acceptor pattern (due to its amide group) and a longer side chain.
Such a seemingly subtle adjustment could potentially optimize the conformational landscape of the hydrogen-bond network with the glycan chain, achieving a delicate “fine-tuning” of the interaction. This leads to an intriguing hypothesis: these subsequent mutations may not serve to create entirely new mechanisms. Instead, their role could be to seek a more advantageous dynamic equilibrium for RBD opening and closing by optimizing interaction strengths at critical nodes within the pre-existing dynamic framework. This hypothesis provides a new perspective for understanding variant evolution strategies, and its specific mechanisms await further experimental and computational validation. This “fine-tuning” hypothesis is also supported by another of our findings. Our simulations show that in BA.2, the R346 residue is also involved in electrostatic anchoring of the N165 glycan. Notably, the R346T mutation frequently occurs in subsequent, more widely spread Omicron subvariants (such as BA.2.75), strongly suggesting that this site is also a key fine-tuning node for the virus's adaptive evolution to optimize glycan dynamics.
We also recognize that constructing a precise and comprehensive description for a system as large and complex as the spike glycoprotein is a significant computational challenge. On one hand, although we cross-validated the simulations using multiple standard methods, fully satisfying the ideal standard of multiple reversible transitions for slow, large-scale conformational changes remains a Frontier challenge. On the other hand, our current analysis focuses on direct glycan–protein interactions. We fully acknowledge that water-mediated hydrogen bonds, or “water bridges”, play a crucial role in the interaction landscape and conformational dynamics. Indeed, we observed traces of structural water molecules within our trajectories (as shown in Fig. S5), yet fully characterizing their role requires additional extensive work. A comprehensive analysis of these water bridges, which involves dissecting the intricate coupling between solvent, flexible protein loops, and glycans, represents a substantial undertaking that we are pursuing as an independent, in-depth follow-up study. Therefore, we position the FELs in this study as semi-quantitative physical pictures that reveal key mechanistic divergences. We believe that, building upon this foundation, future work incorporating both longer simulation times and explicit analysis of solvent effects will be able to refine the absolute values of the energy barriers, thereby deepening our understanding of the dynamic mechanisms of viral evolution.
This work provides insights into the physicochemical principles underlying viral adaptive evolution, highlighting how distant residues can allosterically reshape conformational energy landscapes to alter function. Crucially, the detailed mechanisms suggested here offer a robust framework for interpreting perplexing virological data. For instance, the context-dependent RBD dynamics we describe, which are intrinsically tied to the full trimer conformation, provide a plausible mechanistic framework for recently reported discrepancies between the binding affinities of isolated RBDs and intact spike trimers for emerging variants like JN.1.49 By illuminating these sophisticated evolutionary strategies, our findings offer new targets and refined principles for the rational design of future antiviral drugs and vaccines.
Minimum energy paths in free energy landscapes, and interactions between key residues and adjacent glycans can be found in supplementary information (SI). See DOI: https://doi.org/10.1039/d5cp04391c.
| This journal is © the Owner Societies 2026 |