Judith
Escrig
,
Juan Carlos
Gil-Redondo
,
Alejandro
Valbuena
* and
Mauricio G.
Mateu
*
Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Universidad Autónoma de Madrid, 28049 Madrid, Spain. E-mail: mauricio.garcia@uam.es; mgarcia@cbm.csic.es; avalbuena@cbm.csic.es
First published on 18th August 2025
Protein nanoarrays and other protein-based nanostructures are being developed for many biomedical and technological applications. However, many protein assemblies are prone to disruption under load, and susceptible to material fatigue. Acquisition of a fundamental knowledge on the relationship between structure and mechanical properties of protein assemblies may guide the engineering of protein nanostructures with a higher strength and resistance to fatigue. The capsid protein (CA) of the human immunodeficiency virus can self-assemble into a single molecule-thick, flexible nanoarray that can coat large surfaces. In this study, a genetic engineering strategy was used to individually remove in the CA protein array the side chain of 7 amino acid residues per subunit involved in intermolecular interactions. The effects of eliminating those side chains, and their intermolecular interactions, on the equilibrium dynamics, stiffness, strength, and resistance to fatigue of the CA nanoarray were quantified using atomic force microscopy. The results revealed that removal of different types of attractive intermolecular interactions (van der Waals contacts, hydrogen bonds, and/or ionic bonds), increased the conformational flexibility of the protein array, and decreased its stiffness and strength. In contrast, removal of a naturally occurring, repulsive charge–charge intermolecular interaction in the nanoarray actually led to dramatic increases in strength against point loads (by ∼60%) and resistance to fatigue (by ∼50%), without increasing its stiffness. These findings suggest a general genetic strategy to increase the intrinsic strength and resistance to fatigue of nanostructured protein materials, based on the optimization of the ionic interactions between subunits.
Protein-based nanostructures are endowed with a number of features that make them potentially advantageous relative to other materials for many biomedical or industrial applications.7 Their strengths include: (i) spontaneous and efficient self-assembly and disassembly under controlled conditions; (ii) monodispersity and well-defined composition and architecture; (iii) inbuilt chemical and biological functionalities (e.g., a high affinity and specificity for a particular ligand molecule, or programmed entry into a particular cell or cell compartment); (iv) propensity to undergo controlled conformational rearrangements; (v) adequacy for engineering structural or functional modifications to suit a particular application, using chemical approaches and/or genetic strategies (site-directed mutagenesis or directed evolution); (vi) reproducible and economical large-scale production; (vii) biocompatibility, absence of toxicity, and biodegradability.
However, protein-based materials may also present some serious weaknesses for many intended applications. In particular, many protein assemblies are “soft” materials,8 held together by weak intermolecular non-covalent interactions and, thus, prone to disruption by point loads or mechanically induced material fatigue.9 Material fatigue is the most common cause of failure of many materials,10 and different protein assemblies appear indeed to be highly sensitive to fatigue, elicited even by weak forces.9,11–13 From a practical standpoint, resistance to compression and tension stress and material fatigue should be taken into account, for example, when designing protein-based biocompatible coatings of implants for cell tissue regeneration, as the implant will be repeatedly bent and stretched.14 Some natural protein assemblies do show a very high mechanical strength. They include fibrous (1D) proteins such as fibroin, keratin, or collagen, but also 3D complexes such as some bacteriophage capsids.9 These and other naturally evolved protein assemblies provided the first indication that protein complexes can be made stronger through genetic mutation. While some natural proteins may be mechanically strong enough, they may not possess the adequate architecture or functionality required for the intended applications in biomedicine or the industry. These and other considerations encourage the acquisition of the fundamental knowledge needed to guide the engineering of natural and artificial protein-based nanostructures with high enough mechanical strength and resistance to fatigue.15,16
Protein-based 2D nanostructures in particular are being developed for many uses, including functionalized, biocompatible coatings of inert materials for cell tissue regeneration, antifouling coatings, molecular nanosieves with pores of a precisely defined size, and biosensors.17–21 Natural protein-based 2D nanostructures include S-layers, which are biologically relevant envelopes of certain microorganisms.22 An example of artificial protein-based 2D nanostructures is provided by computationally designed nanosheets self-assembled from engineered protein STM4215 hexamers from S. typhimurium.23 The capsid protein (CA) of human immunodeficiency virus type 1 (HIV-1) can readily and efficiently self-assemble onto an inorganic substrate as a single-molecule thick, 2D nanostructure that can coat very large surfaces24 (Fig. 1). This extended nanocoating is arranged as a regular, hexameric protein lattice that reproduces the fundamental CA lattice of the authentic HIV-1 capsid,25 but without the pentameric “defects” that allow, in the maturing virion, the closure of the lattice into a cone-shaped nanoparticle. The CA-based protein monolayer constitutes an interesting addition to the still limited, but rapidly growing number of available 2D protein nanostructures.
![]() | ||
Fig. 1 The nanostructured CA protein lattice. (A) Ribbon model of the CA protein monomer (PDB ID: 4XFX). The NTD and the CTD are respectively colored orange and blue. The 7 amino acid residues chosen for mutational analysis are depicted as space-filling models, labelled, and color coded (red, green, and purple, residues that respectively participate in NTD-NTD, NTD-CTD and CTD-CTD interfaces) (B) atomic structure of the CA protein lattice (PDB ID: 4XFX). Ovals identify NTD-NTD (red), NTD-CTD (green), and CTD-CTD (purple) interfaces. (C) High-resolution AFM image of the CA protein lattice. The scale bar is 10 nm long. |
There are few studies on the molecular determinants of the mechanical properties of protein nanostructures in general,9 and of 2D protein nanoarrays in particular. Regarding HIV-1, Dr Itay Rousso and colleagues found that immature virions are stiffer than mature virions, revealing that softening of the virion is required for infection.26 The differences in stiffness were due to the effect of the C-terminal domain of the Env protein, which is embedded in the viral lipid envelope and does not form a part of the virus capsid. No comparison of mechanical properties between the immature, spherical retroviral capsid made of the Gag polyprotein and the cone-shaped mature capsid made of CA is available. While both Gag-based and CA-based 2D nanoarrays could provide interesting protein-based materials, conditions for assembly have been found for the CA-based array only. Our previous mechanical analyses have shown that the natural CA protein 2D lattice is highly sensitive to mechanical disruption and material fatigue,13,24 as also observed for other protein-based assemblies.9,11,12 The CA protein lattice provides, thus, an excellent model system to investigate, at the atomic level, the structural determinants of the mechanical properties of protein-based nanostructures and, more specifically, 2D protein arrays. It also provides a model to investigate different strategies to engineer the mechanical properties of a protein assembly. The knowledge acquired may help guiding the development by rational engineering of modified protein-based materials with increased mechanical strength and resistance to fatigue.
The HIV-1 CA protein monomer consists of a N-terminal domain (NTD) and a C-terminal domain (CTD) connected by a flexible peptide segment (Fig. 1A). In the hexagonal CA lattice, CA monomers are organized as hexamers in which the monomers are bound through NTD-NTD and NTD-CTD interfaces, with each hexamer bound to the 6 neighbor hexamers through CTD-CTD interfaces (Fig. 1B).25 The CA monolayer (Fig. 1C) interacts with a negatively charged (mica) substrate through charge–charge interactions, and its structural elements are relatively free to deform and undergo lateral movements on the solid substrate.13,24
The starting hypothesis for the present study was that the mechanical stiffness, strength and resistance to fatigue of a 2D nanostructured material made of identical protein subunits may largely depend on the number and types of intermolecular interactions established between its subunits. We have individually removed by genetic means, following a protein engineering approach, the side chain (beyond the Cβ) of 7 amino acid residues of each CA monomer that are involved in different intermolecular interactions in the HIV-1 CA protein lattice. The effects of removing those side chains, and the inter-protein interactions they establish in the CA lattice, on the equilibrium dynamics, stiffness, strength, and resistance to material fatigue of this 2D nanostructure were quantified using atomic force microscopy (AFM).
y = y0 + D exp(−(2(a − ai)/ab)) | (1) |
SA = SA0 + (SAF − SA0)/(1 + (t50/t)n) | (2) |
The ratio between the time required, under the conditions of the experiment, for disruption of half the indented mutant monolayer and half the indented wt monolayer, tmutant50/twt50, was taken as a measurement of the resistance to material fatigue of the mutant CA relative to non-mutated CAwt.
CA mutant | Intermolecular interactions established by the original residuea | Conservation in HIV-1 of the original residueb (%) | ||
---|---|---|---|---|
H-bond | vdW (C–C) | Ionic | ||
a The interactions of different types in the atomic CA lattice structure (PDB 4XFX,39) that involve the side chain of the original residue (beyond Cβ), and that would be removed by mutation to alanine, were identified using the WHATIF software. The number of van der Waals (vdW) interactions between carbon atoms that contribute to protein–protein association through the hydrophobic effect are indicated in parenthesis. The E180A substitution actually increased the CTD-CTD dimerization affinity because the E180 of both monomers are involved in a mutual charge–charge repulsion.27 Cutoff distances considered are 3.8 Å between electronegative atoms for hydrogen bonds (H-bonds); the sum of the vdW atomic radii plus 0.5 Å for vdW interactions; and 5 Å for ionic (charge–charge) interactions. b Percent conservation of each residue among the HIV-1 variants whose CA sequences were reported within the last 10 years. Values refer to absolute conservation, except those labelled with an asterisk, which correspond to chemical conservation (S or T for 178; E or D for 180). | ||||
P38A | 3 (2) | 100% | ||
E45A | 1 | 2 | 1 attractive | 99% |
Q63A | 1 | 1 | 100% | |
V165A | 2 (2) | 99% | ||
S178A | 1 | 2 | 99% (*) | |
E180A | 1 | 7 | 1 repulsive | 97% (*) |
Q192A | 3 (1) | 99% |
The strategy followed was based on the individual and specific removal, through site-directed mutagenesis to alanine, of the targeted amino acid side chains (beyond the Cβ), without introducing any other chemical group. In this way, the intermolecular interactions established by each tested side chain would be removed without introducing other interactions, and with a minimal probability of substantially distorting the protein main chain conformation.
The CA protein mutants and the non-mutated CAwt protein control were expressed, purified and used to build two-dimensional, one molecule-thick, flat protein assemblies that reproduce the hexameric CA lattice in the authentic HIV-1 capsid.24,25 The CA monolayers were allowed to self-assemble in a physiological buffer (phosphate-buffered saline, PBS, pH = 7.4) at ambient temperature (25 °C) onto a suitable solid substrate (exfoliated mica).24 The assembled monolayers were imaged by AFM to assess their supramolecular architecture and check their structural integrity. All 7 self-assembled mutant CA monolayers showed the exact same hexagonal protein lattice as the CAwt control (Fig. 1C). The monolayers virtually covered 100% of the substrate surface, and no lattice defects were discerned. The intact CA monolayers, always in PBS at room temperature, were then subjected to mechanical analysis as described below.
![]() | ||
Fig. 2 k s CA mutant/ks CAwt ratios for engineered CA monolayers. The reference ks value determined for CAwt was ks = 0.37 ± 0.01 N m−1. The standard error values are indicated. |
CA protein | N |
k
s![]() |
p
k
s![]() |
---|---|---|---|
a Number of Fz traces analyzed. b Average value ± standard error. Standard errors actually ranged between 0.005 and 0.009 and have been rounded to two decimal values. c p-Value obtained in a Student t-test for the difference in ks between each mutant CA and CAwt using α = 0.05. | |||
CAwt | 100 | 0.37 ± 0.01 | — |
P38A | 83 | 0.31 ± 0.01 | 3.97 × 10−7 |
E45A | 73 | 0.32 ± 0.01 | 1.13 × 10−9 |
Q63A | 81 | 0.30 ± 0.01 | 9.86 × 10−5 |
V165A | 60 | 0.32 ± 0.01 | 4.19 × 10−3 |
S178A | 79 | 0.33 ± 0.01 | 2.01 × 10−3 |
E180A | 78 | 0.36 ± 0.01 | 0.284 |
Q192A | 63 | 0.30 ± 0.01 | 2.94 × 10−9 |
All 7 tested amino acid substitutions in CA led to a decrease in the stiffness of the protein material (as revealed by a lower ks value). This decrease ranged from a nearly negligible effect (3%) for E180A, up to a substantial one (23%) for Q63A and Q192A. These results indicate that removal of different types of intermolecular interactions at any of the different intermolecular interfaces can decrease the stiffness of this nanostructured protein material.
The distances between many pairs of neighbor hexamers in each mutant CA lattice and the CAwt control were determined from high-resolution AFM images of the corresponding protein monolayer.24 Depending on the CA variant, between 249 and 528 distance measurements were obtained. From this data set, the equilibrium (average) distance between the centers of neighbor hexamers (the lattice parameter ai) was determined for each mutant CA lattice, and compared to the ai value obtained for the CAwt lattice control in the same set of experiments (Fig. 3A). The ai values for the different mutants ranged between 2% lower and 2% higher than the non-mutated control. Thus, none of the mutations had a significant effect on the equilibrium distance between CA hexamers in the protein lattice, whose architecture remained unaltered.
Then, the data set of distance values between hexamer pairs for each CA variant was fitted to a Gaussian distribution as described in Experimental. The width of the distance distribution, or breathing amplitude Ab, was determined.24 The ratio between the Ab value for the CA mutant and the Ab value for CAwt was then obtained as a parameter related to the effect of each tested mutation on the lateral flexibility of the CA lattice at equilibrium24 (Fig. 3B).
As observed for the decreased stiffness determined by indentation of the protein lattice, all 7 tested mutations led to an increase in the breathing amplitude of the lattice, albeit to different extents. The range of increased breathing amplitude Ab for the ensemble of mutants (4%–21%) was similar to the range of decreased stiffness (3%–23%). The precision of these measurements is not enough to assess whether there is a quantitative correlation between a decrease in ks and an increase in Ab. However, the results do indicate that removal of different types of intermolecular interactions decreases the stiffness along the vertical dimension to fairly comparable extents than it increases the equilibrium conformational flexibility along the horizontal dimension of this protein material.
Depending on the CA variant, between 44 and 86 Fz traces were analyzed. In each case, the Fr1 and Fr2 values obtained were separately averaged, and the ratio between the average Fr1 or Fr2 value for the CA mutant and the corresponding average Fr1 or Fr2 value for the CAwt control was obtained (Table 3 and Fig. 4).
CA protein | N |
F
r1![]() |
p
F
r1
![]() |
F
r2![]() |
p
F
r2![]() |
---|---|---|---|---|---|
a Number of Fz traces analyzed. b Average value ± standard error. c p-Value obtained in a Student t-test for the difference in Fr between each mutant CA and CAwt using α = 0.05. | |||||
CAwt | 86 | 0.33 ± 0.01 | — | 0.63 ± 0.01 | — |
P38A | 44 | 0.30 ± 0.01 | 2.51 × 10−2 | 0.58 ± 0.01 | 8.20 × 10−3 |
E45A | 37 | 0.28 ± 0.01 | 7.96 × 10−6 | 0.58 ± 0.01 | 2.69 × 10−3 |
Q63A | 53 | 0.22 ± 0.01 | 1.18 × 10−17 | 0.47 ± 0.01 | 1.45 × 10−15 |
V165A | 47 | 0.30 ± 0.01 | 9.77 × 10−3 | 0.57 ± 0.01 | 7.11 × 10−5 |
S178A | 64 | 0.28 ± 0.01 | 7.83 × 10−6 | 0.53 ± 0.01 | 1.13 × 10−10 |
E180A | 84 | 0.53 ± 0.02 | 6.79 × 10−16 | 1.03 ± 0.03 | 1.01 × 10−21 |
Q192A | 81 | 0.36 ± 0.01 | 4.13 × 10−2 | 0.69 ± 0.01 | 5.46 × 10−5 |
The effects of each tested amino acid substitution on the Fr1 value were very similar to the effects on the Fr2 value. Most (5 out of 7) tested mutations led to a decrease in the strength of the protein material, albeit to significantly different extents. Depending on the CA mutant, the Fr2 value was between 8% and 25% lower than the wt control. In contrast, the two remaining substitutions led to significant increases in the mechanical strength of the CA monolayer, 10% for Q192A, and as much as 63% for E180A. This constitutes a particularly revealing result, as the E180A and Q192A mutations were previously shown to increase, instead of decrease, the association affinity between CA subunits. Moreover, an experimental thermodynamic double mutant cycle had shown that the E180 residue is involved in charge–charge repulsions between CA subunits, which would be eliminated by the E180A mutation.27 The Q192A mutation may also alleviate some CA-CA repulsions.27 These results indicate that removal of different types of attractive intermolecular interactions in any of the different interprotein interfaces can decrease the mechanical strength; and that, conversely, the removal of repulsive charge–charge intermolecular interactions can actually increase the mechanical strength of this nanostructured protein material.
The resistance of the engineered E180A CA protein monolayer to material fatigue was compared to that of the CAwt monolayer control by scanning 200 × 200 nm areas of the monolayers, with a maximum applied force set to 40 pN, and an indentation frequency of 0.7 min−1. The percent monolayer surface area subjected to fatigue that remained assembled as a function of time (or number of indentations) (Fig. 5) was determined using the Flooding tool included in the WSxM software.28 The ratio between the time required, under the conditions of the experiment, for disruption of half the indented mutant monolayer and that needed for disruption of half the indented wt monolayer, tmut50/twt50, was determined. This parameter was used for a quantitative measurement of the resistance to material fatigue of the mutant CA lattice relative to non-mutated CAwt lattice.30
Seven independent experiments were performed with E180A monolayers and CAwt monolayer controls. Fig. 5A shows images of the two monolayers taken at different times during a representative experiment. The CA lattice surface area remaining as a function of time in three representative experiments is shown in Fig. 5B. When compared to the CAwt monolayer, the E180A mutant monolayer showed, in each independent experiment, a substantially increased resistance against disruption by material fatigue (by 44–76%). The moderate variation in the absolute t50 values between experiments were likely due, in part, to a difference in the number of undetectable, minor initial defects in the individual monolayers analyzed. The tmut50/twt50 ratio (average plus standard deviation), obtained as described in Experimental, are indicated in Fig. 5C. On average, the engineered E180 monolayer was 55% (±15%) more resistant to fatigue than the wt monolayer.
A second series of 4 experiments was performed under similar conditions to those used for the first series of experiments described above. This first and second series of experiments yielded very similar results (averaged increase in resistance to fatigue 55% and 40%, respectively). The ensemble of results show that mutation E180A in CA, which removes a repulsive charge–charge interaction between subunits in the CA monolayer, increases to a remarkable extent (by ∼50%) the resistance to fatigue of this nanostructured protein material.
However, some other results were less predictable. H-bonds are directional, whereas vdW interactions, including “hydrophobic” carbon–carbon contacts, are not. Thus, a higher stiffness decrease could be predicted if H-bonds are removed instead of vdW contacts. In fact, amino acid substitutions E45A, Q63A, and S178A removed one hydrogen bond each (plus 1–2 vdW contacts) between pairs of protein subunits, but they decreased the stiffness of the array to a similar extent than P38A, Q165A, and Q192A, which removed 2–3 vdW contacts only. Another remarkable result is that substantial (up to 23%) stiffness decreases were achieved by removing few, relatively weak non-covalent interactions between each protein pair in the array (one H-bond and/or 1–2 C–C hydrophobic contacts, together with 1–2 additional vdW interactions). Also, the decrease in stiffness of the CA monolayer by any tested mutation, as determined by indentation along the perpendicular to the protein lattice plane, was accompanied by an increase in the conformational flexibility of the monolayer along the lattice plane.
The above results are in line with those we obtained when studying the effect of amino acid substitutions on the stiffness and conformational dynamics of small, spherical nanoparticles from HIV-1-unrelated viruses.40,41 The available evidence indicates that removal of even a few interactions in a viral protein-made nanostructure, irrespective of the type(s) of interaction removed, may elicit quite subtle conformational rearrangements. Such rearrangements may increase the conformational dynamics of the particle at equilibrium, leading to a corresponding decrease in stiffness as determined under a point load.16
It must be emphasized that those results were obtained with natural protein nanostructures that form the capsids of different viruses and have been, thus, subjected to selective pressures and biological evolution. There is experimental evidence that the inter-related changes in stiffness, conformational flexibility and propensity for conformational rearrangements of viral capsids have been optimized through structural changes elicited by mutation to provide a selective advantage for virus survival.9,16 It remains to be investigated whether the individual removal of intersubunit interactions in artificial protein arrays, whose structure and physical properties have not been subjected to biological evolution, would lead to changes in stiffness similar to the ones determined in this study.
No correlation, however, was observed between the decrease in stiffness and the decrease in mechanical strength of the monolayer as a consequence of the different amino acid substitutions tested and intermolecular interactions removed. Whereas all mutant CAs formed arrays of decreased stiffness relative to the CAwt control, a much higher variability was observed regarding their relative mechanical strength. In particular, CA mutants E180A and Q192A self-assembled into protein arrays with the same architecture, but showed a higher mechanical strength than the CAwt control. We had already observed this lack of correlation between changes in stiffness and strength for unrelated, small spherical virus nanoparticles.42 These two mechanical properties of protein nanostructures rely on different physico-chemical foundations.16 Thus, although for most tested mutations a decrease in stiffness of this nanostructured array was accompanied by a decrease in its strength, these two properties are not intrinsically linked in this or other protein-based materials.
An experimental, rigorous thermodynamic double mutant cycle revealed a mutual coulombic repulsion between the negatively charged carboxylates of the E180 residues at the CTD-CTD interface between CA monomers.27 The E180A mutation eliminates the E180 carboxylates and their mutual repulsion, increasing the free energy of CA-CA association through the CTD-CTD interface.38 As a consequence, a higher mechanical force could be expected to be required for disruption of the CA array, as experimentally observed in the present study. The Q192A mutation also increased the free energy of association between CA monomers through the CTD-CTD interfaces. This mutation could create a cavity at the CTD-CTD interface, leading to a local conformational rearrangement that would increase the distance between the positively charged residues R154 of one monomer and K199 of the other monomer, thus reducing their mutual repulsion.38 Consistent with these observations, the Q192A mutation, like the E180A mutation, also increased the strength of the CA array, albeit to a lower extent. To sum up, the increased mechanical strength of the E180A and Q192A mutant arrays was associated to an increased chemical affinity between CA subunits and could be traced, at least in the E180A case, to the removal of repulsive interactions between subunits in the CA protein lattice.
The mutual coulombic repulsion between E180 residues of neighbor subunits in the CA protein lattice would favor the enlargement of the intermolecular “cracks” being created under cyclic load, leading to a relatively high susceptibility to material fatigue. The absence of those repulsive interactions in the E180A mutant lattice would facilitate the recovery of some of the intersubunit interactions being disrupted under cyclic load, leading to the observed, substantially increased resistance of this engineered protein nanostructured material to fatigue.
The results of this study are relevant also from an applied perspective. We have recently shown that the mechanical strength and resistance to fatigue of the CA-based nanostructured coating can be increased by specific binding of some small organic molecules, including antiviral or proviral agents.30 However, from a practical standpoint it may be clearly unfeasible to rely on the non-covalent binding of a small molecule to increase the mechanical resistance of a biomaterial. Irreversible modification of the material through chemical reactions (e.g., by covalent crosslinking) may also present serious issues, such as batch-to-batch reproducibility, increased production costs, and complex quality control. In another recent study, we showed that genetic introduction through cysteine substitutions of a “chain mail” based on covalent disulfide bonds between protein subunits led to a CA-based nanostructured coating that showed increased mechanical strength and outstanding resistance to material fatigue.43 Still, while not an expected serious issue, preservation of the chain mail requires the control of redox conditions for maintaining an oxidizing environment. In contrast, introduction of a genetically-encoded structural modification in mutant E180A results in an intrinsically strong protein material that could be reproducibly obtained, eliminating batch-to-batch variability and simplifying production and quality control. The E180A CA protein, like the non-mutated CA, can be readily obtained in large quantities and has no tendency to form unspecific aggregates even at very high concentrations, far superior than those needed for its assembly. In addition, the 2D nanosheet can fully cover large substrate areas provided enough protein is used. This study provides proof-of-concept for an alternative genetic strategy to the introduction of disulfide bonds to increase the intrinsic mechanical strength and resistance to fatigue of the same nanostructured protein material. This strategy was based on the optimization of the ionic interactions between subunits, especially through the removal of any coulombic repulsions present in the starting material.
This journal is © The Royal Society of Chemistry 2025 |