Open Access Article
Alessandro Mossa
* and
Giorgia Brancolini
*
Center S3, CNR Institute of Nanoscience, via Campi 213/A, 41125 Modena, Italy. E-mail: alessandro.mossa@nano.cnr.it; giorgia.brancolini@nano.cnr.it
First published on 24th November 2025
This study employs a multiscale approach to investigate the interaction dynamics between the West Nile virus (WNV) envelope (E) protein and gold nanoclusters functionalized with DNA aptamers. By integrating aptamer secondary and tertiary structure prediction, structural docking and atomistic molecular dynamics simulations, we reveal the structural, dynamic and electrostatic factors governing the aptamer-target WNV E protein recognition at the nanoscale. Two gold nanoclusters, Au144(SR)60 and Au314(SR)96 (SR = thiolate ligand), were examined to assess how nanoparticle size and surface functionalization influence aptamer anchoring and binding stability under physiological salt conditions. This approach enhances aptamer functionalization strategies for detecting West Nile virus and creates a flexible framework for aptamer-based diagnostics for other emerging pathogens, with implications for diagnostics across a range of viral and protein biomarkers.
The interaction between proteins and functionalized gold nanoparticles (AuNPs) plays a fundamental role in optimizing surface properties necessary for biosensor applications.3 Utilizing monolayer-coated AuNPs that incorporate DNA aptamers as functional groups allows fine-tuning of surface characteristics to enhance interactions with biological targets.4 In particular, controlling the density, orientation, and binding affinity of biomolecules, such as proteins, is critical to improve the performance of monolayer-coated AuNP systems.5
This study aims to address a fundamental gap by developing label-free plasmonic nanoparticles (NPs)-based bioactive platforms for the spectroscopic recognition and quantification of pathogens. Specifically, it focuses on the WNV's envelope (E) protein, a key glycoprotein that plays a pivotal role in the virus's pathogenesis,6 thus advancing our understanding of virus-nanomaterial interactions at the molecular level.
Although substantial progress over the past decade has advanced our understanding of sensing methods to detect viruses belonging to the flaviviridae family,7 such as Dengue virus (DENV), Zika virus (ZIKV), Yellow Fever virus (YFV), and Tick-borne encephalitis virus (TBEV), methods for detecting West Nile virus (WNV) using nanoparticle-based nanosensors remain underdeveloped. Currently, the available diagnostic techniques for DENV and WNV primarily rely on cell culture,8 polymerase chain reaction (PCR),9 and Enzyme-Linked Immunosorbent Assay (ELISA).10,11 These conventional diagnostic methods often necessitate sophisticated medical facilities and trained personnel, which are not suitable for resource-limited settings.
The development of nanoparticle-based biosensors represents a promising approach by exploiting the fundamental physicochemical interactions between nanoparticles and biomolecules to enhance sensitivity and specificity in viral detection. Recent advances in aptamer-based biosensors, or aptasensors, highlight the importance of understanding these interactions at the molecular level, providing a powerful platform for disease diagnostics through precise and selective biomolecular recognition.12 Aptamers, which are single-stranded oligonucleotides, possess several advantages including high stability, ease of chemical functionalization, low cost, and minimal batch-to-batch variation.13
Notably, colorimetric-based aptasensors have emerged as favorable analytical methods, utilizing nanomaterials such as nanozymes and plasmonic nanoparticles for enhanced signal detection.14,15 However, electrochemical-based aptasensors have gained dominance due to their superior signal transduction capabilities.12,16 Despite these advancements, existing methods often suffer from a lack of signal amplification, leading to sensitivity limitations.17
In recent studies, novel aptamer designs have shown promise for specific viral detection. For example, Lee and Zeng14 developed Zika NS1-binding aptamers with high affinity, demonstrating their potential for sensitive viral detection in ELISA-based assays. Similarly, Matsunaga et al.15 utilized DNA aptamers containing unnatural bases to enhance specificity for detecting Dengue virus NS1 serotypes. Morais et al.17 successfully selected single-stranded DNA (ssDNA) aptamers specific to the Zika virus through whole-virus Systematic Evolution of Ligands by Exponential Enrichment (SELEX),18 allowing Zika to be differentiated from Dengue virus. In this work, the term single-stranded oligonucleotides refers to short, single-stranded nucleic acid sequences. ssDNA denotes oligonucleotides composed exclusively of deoxyribonucleotides, typically used as the structural basis for DNA aptamers. The term aptamer is reserved for a subset of ssDNA (or RNA) sequences that, through the SELEX process, fold into defined three-dimensional conformations capable of specifically recognizing and binding to a target molecule with high affinity.
Aptamers act as synthetic analogs of antibodies, exhibiting high affinity and specificity toward a wide variety of targets, including proteins, viruses, and small molecules. Emerging methodologies incorporating artificial intelligence (AI) have further streamlined the aptamer design process, enhancing the efficiency of SELEX.19,20 By integrating computational techniques such as molecular dynamics simulations and machine learning, researchers can optimize aptamer-target binding predictions and improve the overall design of aptamers for diagnostic applications.
In recent years, computational modeling—particularly molecular dynamics (MD) simulations—has become a powerful complement to experimental approaches for elucidating aptamer behavior in biosensing systems. A typical in silico workflow involves aptamer structure prediction, target docking, and MD refinement to evaluate the stability and binding energetics of aptamer–ligand complexes.21
Several MD studies have provided detailed insight into aptamer–nanomaterial and aptamer–protein interactions relevant to biosensing.22,23 For instance, simulations of RNA aptamers immobilized on thiol-functionalized gold surfaces have revealed how spacer length, aptamer orientation, and interfacial ion distributions affect target recognition efficiency.4 Similarly, atomistic simulations of aptamer–protein complexes, such as anti-MUC1 systems, have shown how subtle sequence or structural variations modulate binding affinity and conformational stability.24 Other examples include MD analyses of ssDNA aptamers adsorbed on citrate-capped gold nanoparticles, highlighting the electrostatic nature of aptamer adsorption and the influence of divalent ions on complex formation.25 These studies collectively demonstrate the central role of MD simulations in connecting nanoscale structure, surface chemistry, and binding performance, thereby guiding the rational design of high-efficiency aptasensors.
In light of recent advances by Park et al.,26 who experimentally demonstrated the Tr-WNV-37 truncated aptamer in a rapid, cost-effective electrochemical biosensor for West Nile virus (WNV) detection, we use the Tr-WNV-37 sequence as the foundation for a computational exploration of aptamer performance when anchored to gold nanoclusters. We design and optimize high-affinity DNA aptamers targeting the WNV envelope (E) protein by predicting their secondary and tertiary structures, linking them via a T6 spacer, and modeling their immobilization on hexanol-thiol-functionalized gold nanoclusters. This computational approach enables the identification of key structural determinants governing aptamer binding and provides insights that complement and extend the experimental findings.
Using state-of-the-art tools we perform in silico simulations to enhance binding specificity. Selected aptamer sequences are anchored onto hexanol-thiol-functionalized gold nanoclusters of varying sizes to assess their stability and binding affinity.
Our methodology involves the development of tailored force field parameters to simulate these systems accurately,27 revealing that an increase in aptamer density on Au314(SR)96 enhances additional interactions without disrupting primary binding, whereas Au144(SR)60 shows less promising stability. Thiol angle analysis highlights variability in anchoring orientations due to nanoparticle diffusion, suggesting a flexible and robust binding interface between the aptamer and E protein.
The computational findings are further supported by free energy calculations, demonstrating a lower ΔG of binding upon the aptamer anchoring to gold nanoclusters, which is indicative of a more stable aptamer-protein interaction. This approach not only validates the experimental stability observed in Park et al.'s work,26 but also highlights the fact that the aptamer immobilization on surfaces does not weaken its binding affinity for the target protein.
To summarize, the aim of the present work is to develop a computational strategy to engineer gold nanoparticle (AuNP), aptamer conjugates that maintain and potentially enhance binding affinity under high ionic strength conditions, environments that typically compromise biosensor performance.3,27 Through advanced molecular dynamics simulations, we present the first detailed characterization of the structural dynamics, surface anchoring behavior, and electrostatic interactions within the Tr-WNV-37–AuNP–protein complex. This theoretical framework elucidates the molecular determinants of aptamer functionality in realistic biosensing settings and establishes design principles for next-generation aptamer-based nanobiosensors targeting WNV and related flaviviruses.
![]() | ||
| Fig. 1 (A) Three-domain structure of the E protein of the West Nile virus. (B) Atomic model of the aptamer Tr-WNV-37 used in the docking procedure. (C) Best-scoring aptamer-protein complex as determined by docking. All images of molecular models in this paper have been created with UCSF ChimeraX.30 | ||
The protonation state of the E protein was computed using the web server H++31 under the conditions 150 mM l−1 NaCl, pH 7.4, external and internal dielectric constants valued 80 and 10, respectively. The total charge of the E protein is then −2e, its mass 42.1 kDa. All simulations were conducted at a physiological ionic strength of 150 mM, consistent with the salt concentrations typically used in aptamer folding buffers,26 and biosensing assays.3,27
The structures of gold nanoparticles Au144(SR)6032 and Au314(SR)9633 have been retrieved from the webserver NanoModeler 2.0,34 which was also used to coat the NPs with thiolate ligands SC6H12OH. The use of such self-assembled monolayers (SAMs) in electrochemical biosensors has been shown to improve both the formation and stability of aptamer–gold interfaces, thus improving sensor performance.35 The functionalization with 6-mercapto-1-hexanol self-assembled monolayer (SAM) was chosen for its chemical stability and its widespread use as the preferred functionalization in gold electrochemical biosensors. This selection is supported by multiple studies,22,36 demonstrating the effectiveness of functionalized gold surfaces in biosensing applications via atomistic simulations. In particular, in our recent work we discussed the role of SAMs in stabilizing nanostructured interfaces,35 by performing atomistic simulations compared with SPR experiments. Moreover, Imanparast et al.37 showed that 6-mercapto-1-hexanol facilitates functionalization while preserving structural stability and enabling targeted conjugation strategies. Furthermore, Ruan et al.4 conducted molecular dynamics simulations of an RNA aptasensor immobilized on a hexanol-thiol-functionalized gold surface, demonstrating that the aptamer remains oriented parallel to the surface, preserving its structural integrity and binding affinity–features critical for biosensing performance.
Special attention was paid to domain III (DIII) of the E protein, a well-characterized epitope known to be targeted by neutralizing antibodies.38 Aptamer design was directed specifically toward this region to ensure high-affinity and selective binding.
The modeling pipeline begins with UNAfold,39 which predicts the most thermodynamically stable 2D secondary structure of the aptamer based on its sequence. This structure is then inputted into RNAComposer40,41 to generate a 3D atomic model. The resulting RNA model is converted into its DNA counterpart (Fig. 1(B)), energy-minimized, and subsequently used for docking simulations against the E protein. Docking allows for the identification of the most favorable binding orientation and key interaction sites between the aptamer and its target.
One may wonder if there are quicker alternatives to the time-consuming modeling pipeline describe above. Even though some measure of success has recently been reported42 on using directly AlphaFold 343 to predict aptamer 3D structure, this particular sequence belongs to the class of synthetic oligonucleotides underrepresented in the databases used for training, with the result that the predicted structure has in this case very low confidence and is in fact hardly believable in its attempt to reduce the aptamer's shape into a regular double-helix.
Pons et al.44 demonstrated, using surface plasmon resonance (SPR), that the introduction of a spacer group during aptamer immobilization significantly influences the sensor response. Specifically, they observed that a spacer length of six thymidines (T6) enhanced aptamer–target affinity compared to no spacer (T0), as evidenced by a marked increase in SPR signal amplitude. This improvement was attributed to the spacer's ability to reduce steric hindrance and promote an optimal orientation of the aptamer on the sensor surface, thereby facilitating target accessibility and binding efficiency. Guided by these findings, we incorporated a T6 spacer in our design to achieve a similar effect–enhancing aptamer–target interactions by minimizing surface-induced conformational constraints. This choice is further supported by the molecular dynamics simulations of Ruan et al.,4 which showed that a T6 spacer preserves the aptamer's structural integrity and orientation upon immobilization, thereby maintaining its binding affinity. The rationale for employing the T6 spacer is thus firmly grounded in established principles of aptamer immobilization and SPR-based biosensing.
The sequence of the aptamer Tr-WNV-37 modified by the insertion of six thymines at the 3′ end is 5′-GTACC TGCTC TACAC CGTAC CTGCT CTAAG CACGC CAAGG GACTA TAGAG CAGGT TTTTT-3′.
The Tr-WNV-37 aptamer and the West Nile virus (WNV) envelope (E) protein were used as docking partners. Domain III (DIII), a well-characterized target known for its high antigenicity and critical role in receptor binding, has been extensively studied as a binding epitope for both aptamers and antibodies.28 Residues on domain III (DIII) previously identified as directly involved in aptamer binding were defined as active, meaning that the scoring algorithm assigns a penalty if these residues are not interacting within the docked configuration. Residues from domains I and II (DI and DII) were defined as passive: they were included in the docking calculations and could contribute to the interaction through steric and non-bonded effects, but the scoring function does not enforce their participation. Importantly, DI and DII were not truncated or removed from the protein structure prior to docking. For the aptamer, all nucleotides were marked as active to enable full exploration of possible binding interfaces.
Rigid-body docking was carried out to efficiently explore initial binding orientations according to a well-established computational protocol,36,47,48 followed by enhanced-sampling MD refinement to incorporate the intrinsic flexibility of both the aptamer and the protein at the interface. When rigid-body docking was carried out, it generated multiple aptamer–protein complexes. Despite differences in the number of top-ranked solutions per run, the 6 best models were selected based on their HADDOCK scores. These models were further evaluated by cluster analysis, ranking them according to their Z-scores and energy terms.
The most representative cluster was selected based on its relative population, accounting for 60% of all models (Cluster 1), indicating it was the most recurrent among the generated solutions. A second cluster was found to account for 20% of the resulting poses (Cluster 2), while the remaining association complexes were distributed among four different outlier clusters (Clusters 3, 4, 5, and 6), each representing only 5% of the structures. Additionally, the most representative cluster had the lowest Z-score (−2.1). This combination of high recurrence and favorable scoring suggests that this cluster represents a statistically robust and high-quality docking solution. The energy breakdown revealed that the primary driving force for the initial association complex was electrostatics, followed by van der Waals interactions. The representative structure corresponding to this most favorable Z-score and dominant population was chosen as the starting configuration for molecular dynamics (MD) simulations (see Fig. 1(C)), ensuring a stable and energetically favorable model for further refinement and analysis.
The hydrogen bonds present in the starting aptamer-protein configuration are detailed in Table 1. They can be compared with the map of most frequent contacts in Fig. 16 to assess that most (but not all) of the initial interactions are preserved during the simulation, a signal that the initial docking analysis is sound, and that MD has had enough time to correct the picture, when needed.
| Type | Id | Type | Id | Notes |
|---|---|---|---|---|
| SER | 16 | Cyt | 10 | |
| LEU | 349 | Cyt | 20 | |
| LEU | 349 | Cyt | 21 | |
| THR | 350 | Cyt | 20 | |
| PHE | 379 | Cyt | 13 | |
| ARG | 388 | Thy | 22 | Double |
| ARG | 388 | Ade | 38 | |
| GLN | 392 | Ade | 38 |
![]() | ||
| Fig. 2 Initial configuration of the systems that have been simulated in the absence of the E protein. The number of aptamers has been chosen to have matching aptamer density (see Table 2). | ||
| Data set | Mass | Charge | Apt. density |
|---|---|---|---|
| [kDa] | [e] | [nm−2] | |
| Au314(SR)96 | 74.6 | 0 | 0.0000 |
| Au314(SR)96 + 1 apt. | 92.9 | −59 | 0.0468 |
| Au314(SR)96 + 3 apt. | 125.5 | −177 | 0.1404 |
| Au314(SR)96 + 6 apt. | 184.5 | −354 | 0.2808 |
| Au144(SR)60 | 36.4 | 0 | 0.0000 |
| Au144(SR)60 + 1 apt. | 54.7 | −59 | 0.0719 |
| Au144(SR)60 + 2 apt. | 70.3 | −118 | 0.1438 |
| Au144(SR)60 + 4 apt. | 104.2 | −236 | 0.2876 |
All systems in Table 2, except the two consisting of NPs without aptamers, have also been simulated in the presence of the E protein, always starting from a configuration in which the relative position between the protein and one of the aptamers is initially the same resulting from the docking procedure detailed in section 2.1.2. In all of the MD traces this first aptamer remains close to the E protein throughout the entire simulation, even if a rearrangement of their relative position is quite common. If presents, also other aptamers can, and often do, get close to the protein, either for a while, or until the end of the trajectory. A representative final frame from each of these simulations is depicted in Fig. 3. An additional set of simulations was run for the aptamer alone (not connected to any NP) in interaction with the E protein.
Once the complete aptamer–SAM–AuNP system was constructed using an in-house script, force-field parameters were derived to ensure full compatibility with amber99bsc1. RESP charges for the linker atoms were derived at the HF/6-31G* level of theory using the PyRED server,51 following the RESP-A1 scheme, which is fully consistent with AMBER standards. The derivation workflow, comprising geometry optimization and electrostatic potential (ESP) fitting, was based on procedures previously validated against experimental data.3,27,48
A detailed table containing the RESP charges computed by PyRED and the adjusted values we used in our simulations is available in section S2 of SI.
Velocities for all atoms are randomly generated according to a Maxwell distribution at temperature 300 K, then the system undergoes two rounds of equilibration with position restraints preventing the displacement of all heavy atoms of the solute: first, 100 ps of isothermal dynamics, under velocity-rescaling thermostat,57 followed by other 100 ps of isothermal-isobaric dynamics with added Parrinello–Rahman pressure coupling at 1 bar.58,59 Finally, position restraints are turned off and we perform 500 ns of isothermal-isobaric molecular dynamics, with standard leap-frog algorithm and 2 fs integration step. Each simulation is repeated 5 times, with different random seeds for the velocity generation. A snapshot of the system containing the position of each atom is saved every 0.1 ns; each MD trajectory comprises therefore 5000 frames.
The very small variance of the radius of the sphere, with no discernible trend in its time evolution, is a confirmation that the force field for the gold core of the NP is very stable for the entire duration of the simulation. In order to quantify the quality of the spherical approximation, we can compute the Root Mean Square Deviation (RMSD) of the sulfur atoms from the optimized spherical surface. It turns out that the spherical approximation is excellent both for the smaller NP (average RMSD is 0.20 Å, only 2.2% of the radius) and for the larger NP (average RMSD is 0.38 Å, which represent a 3.2% of the radius).
Nonetheless, certain distinctive traits do emerge. Residues from 3 to 12 are complementary to residues from 55 to 46; this is somehow underestimated by the prediction software we used to create a starting configuration for the aptamer (see section 2.1.1). Even if they are not initially in contact (Fig. 1(B)), during the simulation matching base-pairs tend to find each other on times of the order ∼100 ns and form an almost full turn of double-helix. Once formed, this helix is generally conserved until the end of the MD run. By analogy with the nomenclature commonly used for DNA hairpins, we call this region the stem. In contrast, the central portion of the aptamer (residues from 13 to 54), exibit sparse and complementary pairing, a highly heterogeneous backbone configuration, and nucleobases that frequently protrude from the backbone, making them available for interactions with the surrounding environment. We refer to this region as the bulge: note that the nomenclature of secondary structure motifs in nucleic acid aptamers is not yet standardized.42 A typical structure exemplifying these characteristics is represented in Fig. 4.
It should be kept in mind that although the maximum likelihood estimator used by us outperforms other competing methods, it is still affected by a rather large intrinsic uncertainty (its theoretical variance is 1/3 of the real value of D61), essentially due to the fact that we are estimating an ensemble average from a single—and not so long—trajectory. Moreover, the number of simulations in each data set is small (10 for the NPs without aptamers, 5 for the others). In this light, anomalies like the unexpectedly large values of D for Au144(SR)60 and Au314(SR)96 with 3 aptamers should not be considered particularly significant.
However, despite these limitations, the correlation between the radius of gyration Rg and the hydrodynamic radius Rh (i.e., the radius of a spherical particle which has the same diffusion coefficient as the functionalized NP) shown in Fig. 6 is reasonably linear, and the trend for smaller and larger NPs is very similar. This suggests a way to give a very rough estimate of Rh starting from a measure of Rg, which, unlike Rh, requires modest computational resources.
![]() | ||
| Fig. 6 Comparison between the hydrodynamic radius Rh and the radius of gyration Rg for the NPs of Fig. 2. Error bars on Rg are too small to be visible for the NPs without aptamers. The slope of the linear regression lines for larger and smaller NPs are similar: 1.4 and 1.5, respectively. | ||
In the analysis of the distribution of ligand angles, we have found it useful to distinguish between the ligands on Au144(SR)60 and the ligands on Au314(SR)96, and between ligands to which an aptamer is attached (which we indicate with the residue name ET6) and ligands with a free end (residue name ETO). The four resulting distributions are represented in Fig. 7. As shown in Table 3, the mean values are very similar and around 35°; the standard deviations are also quite similar and around 15° for all data sets. The main difference is found in the right tail of the distributions: of course, free ligands find it easier to reach large values of ϑ than ligands with an attached aptamer. Less obvious is the fact that ligands on the small NP have a significantly better chance of bending close to the surface than similar ligands on the large NP. The variation in ligand angle reflects a curvature-related effect, influencing local surface accessibility and ion interactions. While standard deviations are large, the trend is consistent across simulations, highlighting a systematic impact on nanoparticle stability and binding behavior relevant for design considerations.
| Data set | Mean | St. dev. | p(ϑ > 60°) |
|---|---|---|---|
| ETO Au144 | 37.0° | 16.1° | 0.080 |
| ETO Au314 | 35.2° | 15.2° | 0.053 |
| ET6 Au144 | 34.9° | 14.9° | 0.047 |
| ET6 Au314 | 34.9° | 13.9° | 0.021 |
![]() | ||
| Fig. 8 Distribution of ion distances to the NP Au314 with 6 aptamers obtained by summing all frames with t > 50 ns from 5 independent trajectories. The inset is a magnified detail of the same graph. | ||
To answer this question, we restrict our analysis to the cations of the highest peak, those that establish contact with the NP, which we define by the condition d < 3.0 Å. For each cation in contact with the NP, we record the closest residue; the resulting histogram is represented in Fig. 9.
We can clearly distinguish three regions: residues 1–144 are gold atoms of the NP core, 145–202 are ligands of the NP coating, 203–263 and 264–324 are two aptamers. The rare cations that reach the gold core tend to stay there for extended periods, kinetically trapped by the ligands surrounding them, hence leaving a sizable trace in the histogram of contacts. In comparison, more cations establish a contact with one of the thiolate ligands, but such contacts tend to be shorter-lived. All other data sets exhibit the same features (see Fig. S10 in SI).
Remarkably, the pattern of contacts established with each aptamer appears to be repeated. This suggests the idea of focusing on the cations in contact with an aptamer, thus observing the relative propensity of different aptamer residues to be in contact with positive ions. Where more than one aptamer is present, this is achieved by classifying contacts by aptamer residue (instead of by NP residue, as in Fig. 9), thus summing together the histogram portions that refer to all aptamers.
In order to be able to compare different data sets, it is necessary to normalize the respective histograms. If we divide the tally by the number of frames multiplied by the number of aptamers, we measure the frequency of ion contacts as a function of the aptamer residue. Note that, as a single aptamer residue can happen to be the closest part of the NP for more than one ion at the same time, here the word “frequency” should not be intended as a quantity necessarily less than one. The plots thus obtained for each individual data set are available as Fig. S11 in SI, while the cumulative histogram that takes into account all data sets is represented in Fig. 10.
It is clear that certain residues are systematically more capable than others of retaining positive ions in their proximity, and this effect does not depend on the nucleotide base (see Fig. S12 in SI). An attempt to link such propensity to an easily computed property such as the solvent-accessible surface area (SASA, computed using the algorithm introduced by Eisenhaber et al.62 as implemented in the GROMACS library of analysis tools) yields very poor correlation coefficients, as shown in Fig. S13 in SI.
![]() | ||
| Fig. 11 Frequency of cation contacts with the aptamer residue Ade 38 computed in all MD trajectories. | ||
However, if we extend this analysis to the other key aptamer residues in Fig. 16, the situation is not so clear (see Fig. S15 in SI). Instead of focusing on the cation contacts of a single aptamer residue, it is more informative to look at the global modifications undergone by the histogram in Fig. 10 when the protein is present, as illustrated in Fig. 12.
![]() | ||
| Fig. 12 Frequency of cation contacts by aptamer residue, as in Fig. 10. Effect of the presence of the protein. | ||
It can be seen that the presence of the protein does not really affect the frequency of cation contacts for the residues in the lower stem of the aptamer (residue numbers less than 10 or more than 50). For the central residues, the protein induces a certain displacement of the cations in contact with the aptamer, but this effect tends to be localized in a small region: as shown in Fig. 13, the probability of establishing contact with a cation lost by Ade 38 and Ade 39 is gained by the neighboring residues.
![]() | ||
| Fig. 13 Detail of Fig. 12, showing the region around the residue Ade 38, which is the most engaged in interactions with the E protein. | ||
In summary, our ion distribution analysis reveals that positively charged ions initially placed near the aptamer backbone remain stably associated with the NP system throughout the simulations. While ions at greater distances exhibit a random, bulk-like distribution, a significant population of cations remains tightly localized near the NP surface, forming distinct peaks at short distances. These tightly bound cations primarily associate with the gold core and thiolate ligands, with some exhibiting long-lived interactions indicative of kinetic trapping.
Remarkably, specific aptamer residues consistently attract cations, highlighting a non-uniform charge distribution along the DNA strand. This behavior is not correlated with solvent-accessible surface area (SASA), suggesting that local electrostatic environments and aptamer conformation play a more dominant role in ion association.
When the E protein of West Nile virus is present, the ion distribution remains mostly unchanged in the bulk region, but slight shifts in local cation binding are observed. Notably, residues involved in protein binding, particularly Ade 38 and Ade 39, exhibit reduced cation interactions, in line with their involvement in protein contacts. The displaced cation interactions are redistributed to nearby residues, suggesting a localized reorganization of the electrostatic environment upon protein binding.
Overall, these findings emphasize the dynamic and selective nature of cation association in aptamer-functionalized nanoclusters and underscore its relevance for biosensor performance, where surface charge modulation can significantly influence both target binding and signal readout.
In the following, we report statistics about the frequency of contacts by protein residue, aptamer residue, and specific pair of protein-aptamer residues. We will call primary contacts those formed by the aptamer originally bound to the protein (to which we will refer as the first aptamer), and secondary contacts those established in the course of the simulation by other aptamers (if present). Such secondary contacts are by no means a rare occurrence in our simulations: in fact, the mean number of secondary contacts per frame (computed over all traces where more than one aptamer is present) is 1.03, which suggests that finding other aptamers in interaction with the E protein is more likely than not.
000 time snapshots. The residues with a mean number of contacts larger than 0.5 are listed in Table 4.
| Type | Id | Mean | Type | Id | Mean |
|---|---|---|---|---|---|
| ARG | 388 | 2.571 | PRO | 351 | 0.859 |
| VAL | 304 | 1.186 | LYS | 337 | 0.747 |
| LEU | 349 | 1.068 | SER | 16 | 0.723 |
| LYS | 36 | 1.003 | PRO | 339 | 0.697 |
| THR | 350 | 0.929 | GLY | 387 | 0.609 |
| SER | 306 | 0.900 | GLU | 390 | 0.570 |
| LYS | 307 | 0.881 | SER | 341 | 0.533 |
The residues most likely to be in contact with the first aptamer belong to domain DIII of the protein (residues from 298 to 403, see Fig. 1(A)), with the exception of a few residues in DI, as shown in Fig. 14. The residue ARG 388 stands out as by far the most important for interactions with the first aptamer; its very exposed position enables it to regularly engage in more than one contact simultaneously.
![]() | ||
| Fig. 14 Reference structure of the E protein color-coded by average number of primary contacts with the aptamer. | ||
Secondary contacts, on the other hand, involve almost exclusively the domain DII, and no individual residue stands out as particularly more attractive than others (see Table 5). We can notice, however, a prevalence of secondary contacts with the positively charged (ARG, LYS) or the polar (SER, THR, ASN, GLN) amino acids, a feature that is not unexpected due to the negatively charged nature of the aptamer. As DII is normally connected to the virus envelope, it is not as available in vivo to interact with aptamers as is in our simulations. However, we can expect secondary contacts to be common and to play a role in in vitro experiments where the E protein is detached by the rest of the virus.
| Type | Id | Mean | Type | Id | Mean | Type | Id | Mean |
|---|---|---|---|---|---|---|---|---|
| ASN | 199 | 3.2% | GLN | 94 | 2.6% | VAL | 96 | 1.8% |
| LYS | 110 | 3.1% | ARG | 234 | 2.5% | SER | 175 | 1.7% |
| ARG | 193 | 3.0% | THR | 177 | 2.4% | SER | 230 | 1.6% |
| LYS | 249 | 2.9% | ARG | 93 | 2.2% | TYR | 176 | 1.6% |
| ARG | 215 | 2.6% | LYS | 132 | 2.1% | TRP | 225 | 1.6% |
| THR | 231 | 2.6% | ARG | 85 | 2.0% | ARG | 57 | 1.6% |
| Type | Id | Mean | Type | Id | Mean |
|---|---|---|---|---|---|
| Ade | 38 | 5.470 | Cyt | 10 | 0.864 |
| Ade | 29 | 2.192 | Cyt | 15 | 0.795 |
| Gua | 39 | 1.304 | Gua | 30 | 0.791 |
| Ade | 14 | 1.098 | Ade | 28 | 0.584 |
| Ade | 37 | 1.066 | Thy | 22 | 0.547 |
| Cyt | 21 | 0.969 | Thy | 11 | 0.533 |
As ARG 388 was pre-eminent among the protein residues active in forming contacts, so is Ade 38 among the aptamer residues. The fact that in most of the time snapshots this adenine sticks clearly out of the DNA backbone allows it to establish contacts with as much as 8 amino acids at the same time (see Fig. S18 in SI, the panel about run 2), thus creating a very strong binding to the protein. The other top residues belong mostly in the bulge, with the exception of a group of them at around half of the aptamer length, as shown in Fig. 15.
![]() | ||
| Fig. 15 Reference structure of an aptamer color-coded by average number of primary contacts with the E protein. | ||
Residues involved in establishing secondary contacts are considerably less localized, as they can be found along the entire length of the aptamer (see Table 7).
| Type | Id | Mean | Type | Id | Mean | Type | Id | Mean |
|---|---|---|---|---|---|---|---|---|
| Cyt | 35 | 5.1% | Cyt | 20 | 3.8% | Gua | 39 | 3.1% |
| Gua | 1 | 4.9% | Ade | 14 | 3.7% | Gua | 50 | 3.0% |
| Ade | 29 | 4.4% | Cyt | 51 | 3.6% | Cyt | 21 | 2.7% |
| Cyt | 15 | 4.0% | Thy | 27 | 3.6% | Ade | 3 | 2.6% |
| Ade | 49 | 3.9% | Ade | 38 | 3.4% | Cyt | 16 | 2.5% |
000 frames analyzed from all trajectories. If we consider a subset of the contact map containing the most frequently observed primary contacts (Fig. 16), it is evident the key role played by nucleotide Ade 38.
![]() | ||
| Fig. 16 A portion of the contact map showing the 19 most frequent contacts between protein and aptamer. | ||
Apart from the contacts established by the aptamer originally bound to the protein, in the simulations of the NPs with more than one aptamer it is quite common to observe the formation of other, non-specific, contacts. A portion of the contact map large enough to contain the 15 most frequent non-specific interactions is shown in Fig. 17.
Comparison with the data set labeled apt where the aptamer is not attached to any NP shows that immobilization does not spoil the interaction with the protein. If anything, it slightly improves the binding; in Fig. 19 we compare two distributions: one represents the values of ΔG computed in all trajectories belonging to the apt data set, where the aptamer is free. The other is the distribution of ΔG values obtained in all trajectories where the aptamer is immobilized on a NP, and no secondary contacts are observed (the white dots of Fig. 18). The mean values are −90 kcal mol−1 and −104 kcal mol−1, respectively, but the clear bimodality of the second distribution suggests that further distinctions could be pursued in this case.
On the other hand, it is very clear that the presence of secondary contacts strengthens the interaction between the functionalized NP and the protein, as shown in Fig. 20. That this effect is actually due to secondary contacts is confirmed by the Pearson correlation coefficient −0.74 between the estimated ΔG and the mean number of secondary contacts per frame (the color scale in Fig. 18).
At the molecular level, our simulations demonstrate that the Tr-WNV-37 aptamer retains strong and specific interactions with the E protein upon immobilization, primarily through residue Ade 38 engaging ARG 388 in domain III, with secondary contacts involving domain II. This multi-point binding mechanism suggests a stable and cooperative recognition mode, which could enhance target capture efficiency and selectivity in practical sensing configurations.
The development of ad hoc force-field parameters for aptamer-functionalized AuNPs ensured accurate representation of hybrid bio-nano interfaces, establishing a transferable computational framework for future studies. Together, these results identify nanoparticle size, aptamer surface density, and ionic environment as key design variables for tuning sensor performance and reliability.
This work represents a critical step toward designing an aptamer-based diagnostic platform for West Nile virus (WNV) detection, to be validated via Surface Plasmon Resonance (SPR) measurements. Recent studies, such as those by Petronella et al.,66,67 have shown the sensitivity of plasmonic nanostructures for label-free pathogen detection. These molecular-level details, difficult to access experimentally, will be directly compared with forthcoming SPR data to ensure strong correspondence between theory and experiment. Ongoing work focuses on deriving coarse-grained (CG) models from our atomistic simulations, following established methodologies for protein–nanoparticle systems,68,69 to investigate aptamer–E protein recognition and selectivity under realistic, crowded in vitro conditions.
Overall, our work demonstrates that gold nanocluster size, aptamer flexibility and surface coverage all play critical roles in modulating interaction dynamics with protein targets. The results highlight key design parameters for optimizing aptamer-functionalized nanomaterials in biosensing platforms. This integrated computational approach provides a powerful framework for predicting and rationalizing aptamer-target interactions and can be extended to other viral or protein biomarkers to guide the development of next-generation diagnostic nanodevices.
| This journal is © The Royal Society of Chemistry 2026 |