Ruichao Maoa,
Han Zhanga,
Lihua Biea,
Lu-Ning Liu*bc and
Jun Gao*a
aHubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, Hubei, China. E-mail: gaojun@mail.hzau.edu.cn
bInstitute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK. E-mail: luning.liu@liverpool.ac.uk
cFrontiers Science Center for Deep Ocean Multispheres and Earth System, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
First published on 27th February 2023
Protein–protein interface interactions dictate efficient excitation energy transfer from light-harvesting antennas to the photosystem II (PSII) core. In this work, we construct a 1.2 million atom-scale model of plant C2S2-type PSII-LHCII supercomplex and perform microsecond-scale molecular dynamics (MD) simulations to explore the interactions and assembly mechanisms of the sizeable PSII-LHCII supercomplex. We optimize the nonbonding interactions of the PSII-LHCII cryo-EM structure using microsecond-scale MD simulations. Binding free energy calculations with component decompositions reveal that hydrophobic interactions predominantly drive antenna–core association and the antenna–antenna interactions are relatively weak. Despite the positive electrostatic interaction energies, hydrogen bonds and salt bridges mainly provide directional or anchoring forces for interface binding. Analysis of the roles of small intrinsic subunits of PSII suggests that LHCII and CP26 first interact with small intrinsic subunits and then bind to the core proteins, whereas CP29 adopts a one-step binding process to the PSII core without the assistance of other factors. Our study provides insights into the molecular underpinnings of the self-organization and regulation of plant PSII-LHCII. It lays the framework for deciphering the general assembly principles of photosynthetic supercomplexes and possibly other macromolecular structures. The finding also has implications for repurposing photosynthetic systems to enhance photosynthesis.
Photosystem II (PSII) is the only enzyme that catalyzes water splitting in oxygenic photosynthesis and has been a primary target in designing biomimetic photosynthetic systems.4 Our knowledge about the assembly principles of the PSII core has been greatly improved by studies on high-resolution structures (up to 1.9 Å) and functions.5–7 Moreover, substantial developments have been made recently in PSII-based hybrid systems and the utilization of PSII assemblies for photocatalytic water splitting.3,8,9 For example, a hybrid natural-artificial photosynthetic platform was constructed by wiring a PSII core and a silicon photoelectrochemical cell, which could perform the water-splitting process under solar irradiation.8
Plant PSII dimeric core associates with membrane-spinning light-harvesting antenna complexes (LHCII) to form PSII-LHCII supercomplexes. The dynamic interactions between LHCII and PSII have been the major challenge in the high-resolution structural characterization of PSII-LHCII supercomplexes.10 This has been drastically improved by the technological advances of cryo-electron microscopy (cryo-EM), demonstrating the power in revolving large biomolecular assemblies.11–18 However, the resolved 3D structures reflect the static states of multiprotein supercomplexes under specific experimental conditions.19 They could not delineate dynamic protein associations and the assembly process of photosynthetic PSII-LHCII supercomplexes. Indeed, cryo-EM structures have suggested a higher degree of flexibility of atomic positions at the antenna-PSII core interface and dynamic associations of LHCII with the PSII core,11 which are fundamental for state transitions and PSII repair.20–22 Moreover, PSII-LHCII undertakes a stepwise assembly pathway, and the attachment of LHCII to the dimeric PSII core appears to occur at the final step of PSII-LHCII assembly.23 Some protein subunits, including PsbW, PsbZ, and PsbH, were suggested to be involved in the antenna-PSII core assembly.11,23–26 The detailed protein–protein interactions and assembly mechanisms that govern the association of LHCII with the PSII core remain elusive.
Molecular dynamics (MD) simulations provide a powerful toolkit to investigate in silico the dynamics of biomolecular assemblies with atomic resolutions at a time scale ranging from femtoseconds to milliseconds.19,27 Large-scale MD simulations have been applied in studying the structures and functions of photosynthetic macromolecular complexes and organelles at different scales.17–19,28–30 Here, we construct a million-atom scale model of plant C2S2-type PSII-LHCII supercomplex embedded in the solvated membrane, based on the cryo-EM structure,11 and perform microsecond-scale MD simulations to study the protein interactions and assembly process of the sizeable PSII-LHCII supercomplex in a near cellular environment. Our results indicate the general binding principles and atomistic details of the PSII-LHCII assembly.
Histidine residues in the system were singly protonated on Nε, except those coordinating to non-heme FE, HEM or CLA via Nε (and were thus protonated on Nδ). In addition, the two disulfide bonds contained in each monomer (C112–C135 of PsbO and C17–C26 of PsbTn) were explicitly considered. The AMBER ff14SB force field parameter set34 was selected for standard amino acids residues. For the ten cofactors (PHO, BCR, PL9, LHG, SQD, LMG, DGD, LUT, XAT, NEX) (full names are shown in Table S1†), the generalized Amber force field (GAFF) parameter set was adopted.35 The parameters of CLA and CHL were taken from those developed by Ceccarelli et al. for bacteriochlorophyll a.36 The parameters of HEM were taken from the AMBER parameter database.37 The parameters of POPC were taken from previous work.38 For the non-heme Fe clusters, we obtained the parameters using the MCPB.py program.39 The atomic charges of the cofactors were determined by fitting the electrostatic potential around these molecules by using the RESP model.40 The atomic charges of the OEX complexes were assigned according to the redox states of its atoms in the dark-adapted (S1) state as follows: Mn1–Mn2, +3; Mn3–Mn4, +4; O1–O5, −2, Ca, +2. According to Ogata et al.,41 the equilibrium bond lengths, bond angles, and torsion angles were set as the average value of OEX in the two monomers in the cryo-EM structure. The force constants for bond lengths, bond angles, and torsion angles were appropriately set at 1500 kcal mol−1 Å−2, 500 kcal mol−1 Å−2, and 100 kcal mol−1 Å−2, respectively, to maintain the configuration of OEX during the simulations.
The MM/PBSA method calculates the binding free energy of the complex (ΔGbind) by eqn (1). In eqn (1), −TΔS is the contribution of entropy to the system's free energy, which mainly involves the energy changes caused by conformational changes such as translation and rotation of the protein. Given that there is no significant change in protein subunits conformation, and the analysis mainly focuses on the electrostatic and hydrophobic contributions of the system rather than the absolute binding free energy, the contribution of the entropy (−TΔS) was not calculated as that in recent literature.48–50 ΔGgas is the sum of the electrostatic interaction energy ΔGelec and the van der Waals interaction energy ΔGvdW in vacuum (eqn (2)), representing the contribution of the molecular potential energy. The solvation energy includes the electrostatic solvation energy ΔGPB and the non-polar solvation energy ΔGnp (eqn (3)).
ΔGbind = ΔGgas + ΔGsol − TΔS | (1) |
ΔGgas = ΔGelec + ΔGvdW | (2) |
ΔGsol = ΔGPB + ΔGnp | (3) |
Gnp = γ × SASA + b | (4) |
For the solvated PSII-LHCII complexes with lipid bilayers, the binding free energy of the protein–protein interfaces was calculated using the MM/PBSA approach with the MMPBSA.py module in AmberTools20. For the implicit membrane model, the heterogeneous dielectric membrane model is used here because it describes the membrane environment more accurately than the single dielectric membrane model.47 The spline fitting was adopted, and the implicit membrane thickness was obtained by calculating the average explicit membrane thickness of the last 800 ns trajectory. This was accomplished by calculating the location of the center of mass of the N and P atoms in the phosphatidylcholine headgroups at the top of the membrane, performing the same calculation on the bottom of the membrane, and taking the difference between them. Periodic boundary conditions were used here so that the value of the electrostatic solvation energy ΔGPB is always 0, as it is part of the vacuum electrostatic interaction energy ΔGelec. Other PB settings are consistent with Greene et al.47 The internal dielectric constant settings for protein in the implicit membrane model and the implicit solvent model will be discussed in the next section. ΔGnp is calculated by eqn (4), where the values of constant γ and constant b are set to 0.005 kcal mol−1 Å−2 and 0.92 kcal mol−1, respectively. Solvent accessible surface area (SASA) was calculated using a water probe radius of 1.4 Å.
The free energy contributions of protein residues could be divided into polar (ΔGpolar) and nonpolar (ΔGnonpolar) interactions according to eqn (5) where each part is the sum of two energy terms, as shown in eqn (6) and (7). In the following analysis, ΔGpolar is considered as the contribution of electrostatic interactions, while ΔGnonpolar is considered as the contribution of hydrophobic interactions.48,49
ΔGbind = ΔGpolar + ΔGnonpolar | (5) |
ΔGpolar = ΔGelec + ΔGPB | (6) |
ΔGnonpolar = ΔGvdW + ΔGnp | (7) |
According to the previous literature,51 the hot spots are identified as those whose absolute free energy value is larger than 1.5 kcal mol−1. 8000 snapshots were extracted at intervals of 100 ps along the trajectory. It should be mentioned that since the PSII-LHCII complex is a dimer, the binding free energy values are discussed using the average of the two monomers.
The conformations from the 1 μs MD production run were compared to the cryo-EM structure to check the quality of MD simulations. The average structure was obtained over a 1 μs trajectory, and the snapshot with the smallest RMSD value from the average structure was adopted as the representative structure. We found that the representative structure has only slight differences with the cryo-EM structure (Fig. S2†). First, Ramachandran plot analysis from PROCHECK53 revealed that the MD structure has a wider conformational distribution space and most of the backbone in the allowed regions of Ramachandran plot, compared with the cryo-EM structure (Fig. 1B and C). The number of residues in disallowed regions was reduced from 0.6% to 0.2%, and the number in generously allowed regions was reduced from 0.6% to 0.3% (Table 1). The number of residues in the most favored regions was similar (88.1% for MD structure and 88.3% for cryo-EM structure). These results confirmed the good quality of the cryo-EM structure and indicated that MD simulations only slightly improved the stereochemistry of the cryo-EM structure. Then, we examined the quality of nonbonded interactions using ERRAT program.54 The structures were evaluated based on the quality factor, which depends on the statistics of nonbonded atomic interactions in the 3-D protein structure. The results showed that the overall quality factor of the MD structure (90.2%) was greater than that of the cryo-EM structure (79.8%) (Table 1), indicating that MD simulations did improve the quality of nonbonded interactions from the PSII-LHCII cryo-EM structure. Since the antenna–core binding mainly depends on nonbonded interactions between the protein interfaces, the improvement in the accuracy of nonbonded interactions here is very important for the subsequent accurate calculation of protein interface affinities.
Structure | Ramachandran plot statistics (%) | ERRAT (%) | |||
---|---|---|---|---|---|
Most favored | Additionally allowed | Generously allowed | Disallowed | Overall quality factor | |
Cryo-EM | 88.3 | 10.5 | 0.6 | 0.6 | 79.8 |
MD | 88.1 | 11.4 | 0.3 | 0.2 | 90.2 |
The RMSD values of the peripheral antennas LHCII, CP29, and CP26 (ranging from 1.50 Å to 3.07 Å, Fig. 1D) were generally more significant than those of the core proteins (ranging from 0.58 Å to 2.31 Å, Fig. 1E), suggesting that the light-harvesting antennas possess larger conformational fluctuations than the PSII core subunits, consistent with the experimentally observed high mobility of antenna proteins.10 To further characterize the fluctuation features of protein residues, we investigated root mean square fluctuation (RMSF) of the protein backbone Cα atoms around their average positions. The volatility of extrinsic subunits (average RMSF value of 1.91 Å, Fig. 2A) and antenna proteins (average RMSF value of 1.71 Å, Fig. 2B) was markedly higher than that of the core complex (average RMSF value of 1.16 Å, Fig. 2B), consistent with RMSD results (Fig. 1). In addition, the fluctuation of pigments was strongly correlated with the fluctuation of binding proteins (Fig. S4†). For individual antennas, CP26 has a notably higher RMSF value (average value: 2.22 Å) than LHCII monomer adjacent to the core (LHCII(A), average value: 1.36 Å) and CP29 (average value: 1.45 Å). The greater fluctuation was mainly ascribed to the loop regions of CP26, which is away from the SCP26/core interface; likewise, the fluctuating regions of CP29 and LHCII(A) are also not at the antenna–core interfaces (Fig. 2C–E). Consequently, although the peripheral antennas showed overall conformational fluctuations, SLHCII/core, SCP26/core, and SCP29/core exhibited relatively stable conformations (Fig. 2B), presumably favoring the association of antennas with the PSII core and energy transfer. The analysis also verified the reliability of MD simulations using the constructed structural model.
Based on the heterogeneous implicit membrane model, free energies of the three antenna–core protein interfaces were obtained from equilibrium trajectories (i.e., 200–1000 ns) (Table 2). To verify the convergence of the free energy calculations, we performed convergence tests for SLHCII/core (−58.0 ± 7.2 kcal mol−1, −57.0 ± 7.9 kcal mol−1, −59.4 ± 7.2 kcal mol−1, −56.4 ± 13.6 kcal mol−1), SCP26/core (2.3 ± 5.5 kcal mol−1, 1.5 ± 5.6 kcal mol−1, 2.0 ± 5.6 kcal mol−1, 3.2 ± 6.4 kcal mol−1) and SCP29/core (−111.6 ± 9.6 kcal mol−1, −112.7 ± 9.7 kcal mol−1, −116.1 ± 9.4 kcal mol−1, −116.1 ± 9.8 kcal mol−1) using trajectories of 800–1000 ns, 600–1000 ns, 400–1000 ns and 200–1000 ns, respectively. In addition, non-equilibrium trajectories (0–50 ns) were also used to calculate the binding free energies of SLHCII/core (−2.7 ± 14.8 kcal mol−1), SCP26/core (12.2 ± 5.1 kcal mol−1) and SCP29/core (−84.5 ± 8.7 kcal mol−1), the results are clearly different from the above calculations using equilibrium trajectories, indicating that a long simulation scale is necessary for accurate free energy calculations of the antenna–core interface.
Interface | ΔGvdW | ΔGnp | ΔGeleca | ΔGnonpolarb | ΔGpolarc | ΔGbindd |
---|---|---|---|---|---|---|
a Includes the electrostatic solvation energy, see methods.b Formula (6).c Formula (7).d Formula (5). | ||||||
SLHCII/core | −103.5 ± 12.5 | −11.4 ± 1.0 | 58.5 ± 2.5 | −114.9 | 58.5 | −56.4 ± 13.6 |
SCP26/core | −46.5 ± 5.3 | −6.2 ± 0.5 | 55.9 ± 3.0 | −52.7 | 55.9 | 3.2 ± 6.4 |
SCP29/core | −167.4 ± 8.5 | −18.5 ± 0.7 | 69.8 ± 3.8 | −185.9 | 69.8 | −116.1 ± 9.8 |
SCP29/core (−116.1 kcal mol−1) has the largest total free energy among the three interfaces, which is double of SLHCII/core (−56.4 kcal mol−1), indicating the strong binding affinity at SCP29/core. The electrostatic interaction energies (ΔGelec) of SLHCII/core, SCP26/core, and SCP29/core were 58.5 kcal mol−1, 55.9 kcal mol−1 and 69.8 kcal mol−1. In contrast, their hydrophobic interaction energies (ΔGnp) were −114.9 kcal mol−1, −52.7 kcal mol−1, and −185.9 kcal mol−1, respectively. As described in Methods (eqn (6) and (7)), the binding free energies can be divided into ΔGpolar and ΔGnonpolar (hydrophobic interaction). Our analysis suggested that ΔGnonpolar provides the dominant contribution to the binding of the antenna to the PSII core. Meanwhile, the ΔGpolar values of all interfaces are positive, indicating that polar or electrostatic interactions between the antenna and PSII provide repulsive forces unfavorable for binding. This conclusion was also supported by the calculation using the implicit water model (Table S3†). In contrast, the binding free energies of the two antenna–antenna interfaces, SLHCII/CP26 and SLHCII/CP29, were both positive values (Table S4†), indicative of the very weak antenna–antenna association (Table 2).
For SLHCII/core, 23 amino acids were identified as hot spots (Fig. 3E and Table S5†). Among them, 20 hot spots had negative free energy values, and their total free energy contribution was −42.22 kcal mol−1, accounting for 65% of the total binding free energies (−64.5 kcal mol−1). Three residues in LHCII, E175, K179, and D215, have positive free energy contributions. All ΔGnonpolar of residues are negative, but ΔGpolar of 13 residues are positive. For SCP26/core, only 5 amino acids were identified as hot spots (Fig. 3E and Table S5†), consistent with the relatively low interface affinity (Table 2). Interestingly, ΔGpolar of most of the residues are positive (Fig. 3E and Table S5†). A large number of hot spots at SCP29/core (Fig. 3J and Table S6†), compared with those at SLHCII/core and SCP26/core, is consistent with its higher interface affinity (Table 2). 26 of the 29 hot spots contribute negative free energies. Their total free energy contribution was −79.54 kcal mol−1, accounting for 68% of the total binding free energy. In contrast, 19 hot spots have positive ΔGpolar contribution. The free energy decomposition indicated that the ΔGpolar contribution (i.e. electrostatic interactions) is primarily positive, representing unfavorable interactions, consistent with our free energy analysis (Table 2). It should be noted that the residues with hydrogen bonds may contribute negative ΔGpolar; we will discuss it in the next section.
Fig. 3 Binding free energy and corresponding component contribution of hot spots on SLHCII/core, SCP26/core and SCP29/core and their localization. (A and B) Localization of hot spots on LHCII and the core complex (at the SLHCII/core), respectively. (C and D) Localization of hot spots on CP26 and the core complex (at the SCP26/core), respectively. The protein surface map colored according to the free energy contribution of each hot spot. The yellow dotted lines in panels (B) and (D) correspond to the positions of the small intrinsic subunits PsbH and PsbZ, respectively. The unit of energy is kcal mol−1. (E) Binding free energy and corresponding component contribution of hot spots, including hydrophobic (red), electrostatic (blue) and total (yellow) contribution. Color scheme is presented as the same as shown in Fig. 1A. The dotted box lines with different colors represent different subunits, which are CP43 (tan), PsbW (purple), LHCII (lime), PsbZ (silver) and CP26 (pink). Hot spots that participate in the formation of hydrogen bonds or salt bridges are marked with *. See Table S5† for more details. (F and H) Localization of CP29 and corresponding hot spots. (G and I) Are the localization of core complex (at the SCP29/core) and corresponding hot spots. The yellow dotted lines in panel (i) correspond to the positions of the small intrinsic subunits PsbH. (J) Binding free energy and corresponding component contribution of hot spots. The dotted box lines with different colors represent different subunits, which are CP47 (ochre), D1 (ice blue), PsbH (mauve), PsbL (green), and CP29 (cyan), and the colors correspond to Fig. 1A. See Table S6† for more details. |
We also determined the spatial distribution of hot spots. At SLHCII/core, 12 hot spots are located in LHCII, including 4 hot spots (L164, L166, E175, K179) at the stromal side and 8 hot spots (F81, L113, L85, V90, G89, L213, P216, D215) at the luminal side. Eleven hot spots are located in the core (Fig. 3A and B), including 3 hot spots (L127, G126, and S125) at the stromal side and 8 near the luminal side. At SCP26/core, 3 hot spots are located at CP26, and 1 hot spot is in the core complex, all of which are near the luminal side or the stromal side (Fig. 3C and D). At SCP29/core, all hot spots are concentrated at the stromal side: 16 hot spots were localized in the core complex, and 13 were localized at CP29 (Fig. 3F–I). Moreover, the hot spots in the core complex are widely distributed in multiple protein chains: 7 of the 16 hot spots are located at PsbH, 4 hot spots are at CP47, 3 hot spots are at D1, and the remaining one is localized at PsbL (Fig. 3I). At CP29, all the 13 hot spots are localized at Motif II (Pro42–Phe87) of the N-terminal long loop chain, implying its role of CP29–core complex binding. Collectively, the hot spots are mainly distributed at the luminal or stromal side, indicating that the main driving force in these interfaces' binding originates from the luminal or stromal side.
Hydrogen bond | Distance (Å) | Occupancy (%) | PSII subunits | Cryo-EM structuresa | ||
---|---|---|---|---|---|---|
Donor | Acceptor | |||||
a Interfacial HBs/SBs present in the cryo-EM structure of spinach (PDB ID: 3JCU, 3.2 Å), pea (PDB ID: 5XNL, 2.7 Å) and arabidopsis (PDB ID: 7OUI, 2.79 Å). | ||||||
SLHCII/core | VAL_90@N | ASN_103@Oδ | 2.96 ± 0.16 | 64 | PsbW | |
ASN_88@Nδ | PRO_97@O | 2.88 ± 0.14 | 38 | PsbW | ||
ASN_103@Nδ | LEU_113@O | 2.92 ± 0.15 | 95 | PsbW | ||
SER_205@N | ALA_214@O | 2.92 ± 0.15 | 66 | CP43 | ||
ASN_103@Nδ | VAL_90@O | 2.94 ± 0.15 | 88 | PsbW | ||
GLY_126@N | GLU_175@Oε | 3.03 ± 0.19 | 88 | PsbW | ||
TRP_107@Nε | TYR_112@Oη | 3.14 ± 0.17 | 32 | PsbW | ||
SER_101@N | ASN_88@O | 2.99 ± 0.16 | 37 | PsbW | ||
SER_125@Oγ | GLU_175@Oε | 2.64 ± 0.11 | 34 | PsbW | ||
SCP26/core | ARG_32@Nη | SER_143@O | 2.90 ± 0.17 | 37 | CP43 | |
TRP_33@Nε | GLY_38@O | 2.92 ± 0.16 | 61 | PsbZ | 5XNL, 7OUI | |
SER_59@Oγ | LEU_231@O | 2.78 ± 0.17 | 31 | PsbZ | 3JCU, 5XNL | |
LYS_37@Nζ | LEU_39@O | 2.88 ± 0.15 | 40 | PsbZ | 3JCU, 5XNL |
Hydrogen bond | Distance (Å) | Occupancy (%) | Proteins in PSII | Cryo-EM structuresa | |
---|---|---|---|---|---|
Donor | Acceptor | ||||
a Interfacial HBs/SBs also present in the cryo-EM structure of spinach (PDB ID: 3JCU, 3.2 Å), pea (PDB ID: 5XNL, 2.7 Å) and arabidopsis (PDB ID: 7OUI, 2.79 Å). | |||||
TYR_48@N | GLY_31@O | 2.94 ± 0.14 | 99 | PsbH | 3JCU, 5XNL |
ASN_56@Nδ | GLU_8@O | 2.92 ± 0.15 | 96 | PsbL | |
GLN_47@Nε | TYR_30@O | 2.92 ± 0.16 | 81 | PsbH | 3JCU |
GLN_55@N | ASN_7@Oδ | 2.99 ± 0.18 | 81 | PsbL | |
LEU_46@N | VAL_33@O | 2.94 ± 0.14 | 87 | PsbH | 5XNL, 7OUI |
TYR_50@Oη | SER_132@Oγ | 2.85 ± 0.16 | 59 | CP47 | |
ASN_56@Nδ | ASN_5@O | 2.84 ± 0.12 | 56 | PsbL | |
ASN_56@N | PRO_6@O | 2.98 ± 0.16 | 48 | PsbL | |
GLN_55@Nε | THR_10@O | 3.03 ± 0.19 | 34 | CP47 | 3JCU, 5XNL, 7OUI |
ASN_60@Nδ | ARG_476@O | 2.90 ± 0.14 | 50 | CP47 | 5XNL |
GLN_55@Nε | ASN_230@Oδ | 2.97 ± 0.17 | 45 | D1 | 5XNL, 7OUI |
ASN_56@Nδ | PRO_6@O | 2.92 ± 0.16 | 33 | PsbL | |
SER_84@Oγ | GLU_29@Oε | 2.68 ± 0.13 | 32 | PsbH | |
VAL_33@N | LEU_46@O | 2.91 ± 0.14 | 99 | PsbH | 7OUI |
ASN_230@Nδ | ASN_56@Oδ | 2.90 ± 0.15 | 95 | D1 | 5XNL, 7OUI |
GLU_229@N | LEU_57@O | 3.02 ± 0.17 | 83 | D1 | 5XNL |
GLN_9@Nε | ASN_56@Oδ | 2.90 ± 0.14 | 88 | PsbL | 5XNL |
ASN_230@N | GLN_55@O | 3.03 ± 0.16 | 59 | D1 | 3JCU, 7OUI |
GLN_223@Nε | TYR_48@Oη | 3.07 ± 0.18 | 48 | CP47 | |
ASN_14@Nδ | LEU_53@O | 2.97 ± 0.17 | 40 | CP47 | 7OUI |
ARG_230@Nη | ASN_60@Oδ | 2.94 ± 0.16 | 77 | CP47 | |
ARG_230@Nη | ASP_51@Oδ | 2.80 ± 0.11 | 48 | CP47 | |
LYS_137@Nζ | GLN_81@Oε | 2.84 ± 0.13 | 86 | CP47 | |
ARG_476@Nη | LEU_53@O | 2.98 ± 0.18 | 30 | CP47 | 3JCU |
ARG_230@Nε | ASP_51@Oδ | 3.08 ± 0.18 | 30 | CP47 |
Salt bridge | Distance (Å) | Occupancy (%) | Proteins in PSII | Cryo-EM structuresa | |
---|---|---|---|---|---|
Acidic | Basic | ||||
GLU_85 | LYS_32 | 3.80 ± 0.45 | 95 | PsbH | 3JCU, 7OUI |
ASP_49 | LYS_130 | 5.34 ± 0.39 | 55 | CP47 | 3JCU |
GLU_71 | ARG_127 | 4.32 ± 0.93 | 43 | CP47 | |
ASP_51 | ARG_230 | 4.47 ± 0.29 | 71 | CP47 | |
ASP_477 | LYS_59 | 5.35 ± 0.33 | 39 | CP47 | 3JCU, 7OUI |
ASP_483 | LYS_59 | 4.07 ± 1.23 | 79 | CP47 | 3JCU, 5XNL |
The total numbers and spatial locations of HBs/SBs found at SLHCII/core, SCP26/core, and SCP29/core are consistent with those of the hot spots identified. The numbers of HBs/SBs are 25/6 for SCP29/core, 9/0 for SLHCII/core, and 4/2 for SCP26/core (Table 3). Eight of the 9 HBs at SLHCII/core are mediated by PsbW, and 7 are distributed at the luminal side (Fig. 4A and Table 3). Three of the 4 HBs and all SBs at SCP26/core are mediated by PsbZ, and 3 HBs/SBs are located at the stromal side (Fig. 4B and Table 3). All the 25 HBs and 6 SBs at SCP29/core are distributed at the stromal side and are mediated by Motif II of CP29. Residues of Motif II form HBs/SBs with multiple protein subunits of the PSII core (PsbH: 5/1; PsbL: 6/0; CP47: 10/5; D1: 4/0) (Fig. 4C and Table 4).
Fig. 4 Hydrogen bond and salt bridge interactions at SLHCII/core (a), SCP26/core (b), and SCP29/core (c). Hydrogen bonds and salt bridges are represented by yellow and orange dashed lines, respectively. Colour scheme is presented as the same as shown in Fig. 1A. |
The differences between average binding free energies of the HB/SB-involved hot spots and all hot spots were −0.38 kcal mol−1 for SLHCII/core, −0.04 kcal mol−1 for SCP26/core, and −0.66 kcal mol−1 for SCP29/core (Fig. S5†), which accounted for 26%, 1.6%, and 27% of the average binding free energies of all hot spots, respectively. This suggests that the formation of HBs/SBs predominately modulates hot spots' binding free energy contribution and enhances their binding ability. Moreover, the change in binding free energies was dominated by the reduction in electrostatic interaction energies (Fig. S5†). At SLHCII/core, 8 of the 23 hot spots were involved in HB formation (Fig. 3A–E, residues marked with *), and 5 hot spots, including V90 and L113 (LHCII), S125 and G126 (PsbW), and S205 (CP43), provide negative electrostatic interaction energies. At SCP26/core, 3 of the 5 hot spots (W33, L39, L231) were involved in HB formation and W33 contributes negative electrostatic interaction energies (Fig. 3C–E). At SCP29/core, 19 of the 29 hot spots were involved in HB/SB formation (Fig. 3J, residues marked with *), and 8 hot spots, including N230 (D1), R127 and R230 (CP47), V33 (PsbH), Q9 (PsbL), as well as L46, N56, and K59 (CP29), contribute negative electrostatic interaction energy. Overall, our results revealed that HBs/SBs are favorable for the binding of the interfaces, which is consistent to a certain extent with the previous study60 that indicated that electrostatic interactions were the main driving force to mediate the binding of antennae to the PSII core. However, our analysis showed that hydrogen bonds contribute only about 20% of the binding energies (Fig. S5†). In contrast, most of the binding energies come from ΔGnonpolar (hydrophobic interaction). It is likely that hydrogen bonds mainly provide directional interactions and the anchoring of the interface.
Fig. 5 Binding free energy changes caused by the deletion of key subunits and a schematic diagram of the inferred assembly process. (A) Display of the small intrinsic subunits PsbW, PsbZ, PsbH, and Motif II of the N-terminal loop region of CP29, the colors of protein subunits correspond to Fig. 1a, and the Motif II is shown in yellow. (B) Affinity changes of the antenna–core interface in the absence of PsbW, PsbZ, PsbH and Motif II. |
Most of the hot spots and residues involved in HB/SB formation of CP29 were identified on Motif II of CP29, which represents a long loop region (Pro42–Phe87) and is responsible for binding to the core proteins.11 In the absence of Motif II, the binding free energy increased from −116.1 ± 9.8 kcal mol−1 to 56.6 ± 4.1 (Fig. 5B), implicating that CP29 can not bind to core proteins without Motif II. Moreover, CP29 interacts with multiple subunits including PsbH, CP47, D1, and PsbL. This differs from LHCII and CP26, which interact with core proteins via small intrinsic subunits, i.e., PsbW and PsbZ. The absence of PsbH only led to a relatively small percentage (32.9%) of binding free energy lose, suggesting that the function of PsbH in binding with CP29 and the core may not be as significant as expected.
In our study, the binding free energy calculations with corresponding component decompositions revealed that the antenna–core binding is a competitive process between electrostatic energy and hydrophobic energy, in which the electrostatic interactions provide mainly repulsive energy. In contrast, the hydrophobic contributions serve as the main driving force to facilitate antenna–core association (Fig. 3). This may imply a common mechanism underlying the protein–protein interactions and assembly of membrane macromolecular complexes, in which hydrophobic interactions play an essential role. In addition, compared to the cryo-EM structure, MD simulations reveal the HBs/SBs interactions at the antenna–core interface more comprehensively (Fig. 4). Electrostatic interaction energy analysis further indicated that the formation of HBs/SBs enhances the binding affinity of hot spots and is thought to play a positioning role during antenna–core assembly.
Fig. 6 Schematic diagram of the antenna–core assembly process of the C2S2-type PSII-LHCII. LHCII is represented in green, CP26 in pink, CP29 in blue, PsbW in purple, and PsbZ in gray, these colors correspond to Fig. 1A. Motif II of CP29 is shown with a yellow coil. Brown and maroon show dimerized PSII cores. Arrows with numbers represent potential assembly steps. |
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2ra08240c |
This journal is © The Royal Society of Chemistry 2023 |