Open Access Article
Newton A. Ihoeghian
,
Usman L. Abass,
Ibrahim Imam and
Qing Shao
*
Department of Chemical & Material Engineering, University of Kentucky, Lexington, KY 40506, USA. E-mail: qshao@uky.edu
First published on 24th December 2025
Apolipoprotein E (APOE) plays a significant role in determining the risk of Alzheimer's disease (AD). Three mutations—APOE3–R136S, APOE3–V236E, and APOE4–R251G—have been reported to reduce the risk of AD. Unveiling the molecular mechanism behind this reduction could lay a foundation for developing therapeutics for AD. To shed light on this subject, we investigate the mutation-induced variation in structural and dynamic properties of APOE3–R136S, APOE3–V236E, and APOE4–R251G in explicit solvent using molecular dynamics simulations. The APOE2, APOE3, and APOE4 were used as the reference. The analysis unveiled that the three protective mutations may exert protection through different mechanisms. The R215G mutation makes the flexibility of APOE4 proteins more similar to that of APOE2 and APOE3. In addition, this mutation reduced the exposure area of the oligomerization region by 5–16%. Such a reduction could alleviate the aggregation tendencies of the APOE proteins with amyloid-forming peptides. On the other hand, the R136S and V236 mutations alter the exposure area of the hydrophobic amino acid residues in the lipidation region. Their protective mechanisms may be due to the alteration in the lipidation capability of the APOE3 protein.
Studies have identified rare protective APOE mutations that could reduce the risk of AD.6,7 The APOE3-Jacksonville (V236E), APOE3-Christchurch (R136S), and APOE4–R251G variants are among the discovered protective mutations.6,8–10 Large-scale genetic studies have confirmed that V236E and R251G are associated with a 2- to 3-fold decrease in AD risk.6 The V236E carriers show a 10.5-year delay in AD onset, and R251G carriers exhibit a 6-year delay.6 Notably, R251G is always inherited with APOE4, while V236E is always inherited with APOE3, suggesting that their protective effects may stem from isoform-specific conformational modifications.8
Unveiling the mechanism behind these protective mutations is crucial for understanding their protective. Furthermore, the unveiled mechanism could guide the development of therapeutics mimicking these mutations. Protein functionality depends on its structure and dynamics.2,11,12 Thus, mutation-induced variations in protein structure and dynamics of APOE could be responsible for protection. One APOE protein consists of three domains: an N-terminal domain (RES 1–167), which mediates LDLR binding; a hinge region (RES 168–205), which provides flexibility; and a C-terminal domain (RES 206–299), which facilitates lipid binding.2,11 Nuclear magnetic resonance (NMR) studies show that the C-terminal domain folds against the N-terminal domain in lipid-free APOE, influencing its structural stability and receptor interactions.13 APOE4 adopts a conformation that enhances interdomain interactions, which may increase its aggregation propensity and impair lipid transport.5,14
Knowledge gaps still exist regarding the three protective variants APOE3-Jacksonville (V236E), APOE3-Christchurch (R136S), and APOE4–R251G, particularly in terms of their conformational flexibility and domain interactions with APOE.1,6,9,10,15–19 Molecular dynamics (MD) simulations are a powerful tool for understanding the conformation and dynamics of proteins and their function at the atomic level. MD simulations have provided valuable insights into the molecular mechanisms underlying APOE functions, including lipid binding20 and isoform-dependent effects on AD risk.21 Most of these simulations focus on APOE2, APOE3, and APOE4, three widely occurring variants. The MD simulations of Williams et al.22 revealed an APOE4-specific misfolded intermediate that may disrupt lipid transport and protein aggregation in AD. Henry et al.23 used cross-linking mass spectrometry and MD simulations to model lipidated APOE4. Their findings suggest an activation mechanism involving structural rearrangements for receptor recognition. The computer simulations of Gursky and his team24 showed that APOE undergoes conformational changes to bind amyloid-β fibrils. Mamchur et al.25 used MD simulations to show that the APOE2 isoform retained C-terminal structure while APOE4 showed unwinding at RES 260–266. Stuchell-Brereton et al.26 combined single-molecule spectroscopy and MD simulations to study monomeric APOE4. They found that APOE4 exhibits more disordered structures than previously known, maintaining conformational heterogeneity even when lipid-bound. These studies iterate that a small change in amino acid sequence may result in significant variation in the structure and dynamics of the APOE protein, which could alter its functions. Thus, it is crucial to unveil how the conformation and dynamics of APOE proteins may alter upon the three protective mutations.
The goal of this work is to explore the variation in APOE structure and dynamics induced by the three protective mutations: APOE3–R136S, APOE3–V236E and APOE4–R251G mutations using MD simulations. The three widely occurring isoforms, APOE2, APOE3 and APOE4 will be used as the reference. The analysis will focus on the lipidation and aggregation (oligomerization) domains of the APOE protein. The lipidation domain regulates the ability of APOE to bind to lipid nanoparticles and perform its function to deliver lipids to the desired place. The aggregation domain regulates the tendency of APOE to self-assemble into a dimer. This work will not consider the properties of the receptor binding domain because the receptor binding usually occurs after the APOE protein binds to the lipid nanoparticle. The rest of the paper will be organized as follows: Section 2 will detail the deployed model and simulation procedure. Section 3 will focus on the results and discussions. Section 4 will present the conclusion based on the results and discussions.
The lipidation region (RES 192–272)30,31 is responsible for lipid binding. The amphipathic helices of APOE interact with lipid surfaces.32 Efficient lipid redistribution is vital for the metabolic function of APOE in both circulation and the brain. The oligomerization region (RES 248–299)30,33 is responsible for APOE to oligomerize (aggregate). The lipidation and oligomerization regions overlap on RES 248–272.12 To facilitate the analysis, we divide RES 192–299 into three regions: lipidation-only: RES 192–247; overlapping: RES 248–272, the oligomerization-only: RES 273–299 (Fig. 1a).
The simulation system was built by surrounding the selected APOE protein with a 3.0-nm water molecule shell. This gave a water-shelled protein of volume 9877.47 nm3 and dimensions x − 26.90 nm, y − 19.30 nm, and z − 19.03 nm when placed in a box (Fig. 1b). This water-molecule shell is a balance between the accuracy and efficiency for simulating protein systems. First, it ensures that the APOE protein is immersed in an aqueous reservoir, a widely accepted model setting for proteins in the human body. Second, this water-molecule shell helps accelerate the MD simulation by reducing the number of water molecules in the system. The TIP3P water model34 was used to represent water molecules. Appropriate counterions Na+ ions were introduced to neutralize the net charge of the simulation box.35
The AMBER14 force field with the OL15 modification36 was used to describe the nonbond and bond interactions in the simulation systems. This field has been validated for accurately reproducing the structural and dynamic properties of proteins.37 Non-bonded interactions were calculated using a combination of short-range Lennard-Jones 12-6 potential for van der Waals interactions and long-range Coulombic potential for electrostatic interactions, as represented in eqn (1):
![]() | (1) |
APOE variants were parameterized with the AMBER ff14SB protein force field (as distributed in amber14sb_OL15.ff in GROMACS). ff14SB provides updated protein backbone and side-chain torsions relative to ff99SB and is widely used for protein simulations.36 Each protein was centered in a cubic periodic box and solvated with a spherical water shell of 3.0 nm thickness. The solvation step produces a droplet of explicit water around the protein within a larger periodic box; periodic boundaries were applied in x, y, and z directions for all runs. Using a solvent shell rather than filling the entire box minimizes system size while preserving the first hydration layers of the solute; this approach and related stochastic/spherical boundary methods are established in the literature for biomolecular MD when bulk solvent is not essential,41,42 and more recent implementations in QM/MM shells.43 Water molecules were taken from the AMBER-compatible TIP3P model for consistency with ff14SB (Amber water models overview). The systems were neutralized by replacing solvent with counterions, which adds the minimum number of ions needed for electroneutrality. Before dynamics, each system was minimized by steepest descent (tolerance emtol = 10 kJ mol−1 nm−1, step size emstep = 0.01 nm) to remove bad contacts. All dynamics were conducted in the canonical (NVT) ensemble because the systems are solvent droplets in vacuum. Temperature was controlled with the stochastic velocity-rescaling thermostat (v-rescale) at 310 K with a relaxation time of 1.0 ps.44 After minimization, we performed a 300 ns NVT equilibration to ensure the system was well equilibrated. This was followed by NVT production (up to 1 µs per replica).45–49 Coordinates were saved every 1 ns for analysis (nstxtcout = 500
000 at dt = 0.002 ps).
We used a 2 fs timestep (dt = 0.002 ps) with all covalent bonds involving hydrogen atoms were constrained. The stability and accuracy of v-rescale constraints are established Bussi et al.44 and LINCS efficiently maintains constraints at this timestep.50 Recent protein simulations at 310 K commonly use 2 fs with constraints—for instance, AD-relevant TREM2/ApoE systems (neighbor cutoff 1.4 nm; 2-fs timestep).45 Short-range Lennard-Jones (LJ) and real-space Coulomb interactions used 1.4 nm cutoffs with the Verlet cutoff scheme. The neighbor list was updated every 40 steps (80 fs). A long-range analytical dispersion correction was applied to account for truncated LJ tails, per GROMACS recommendations (long-range vdW corrections). The PME method was used for long-range electrostatics with 4th-order B-spline interpolation (pme-order = 4) and grid spacing 0.12 nm, consistent with standard, well-validated PME settings51 and SPME refinements.52 The 1.4-nm neighbor cutoff and 2-fs timestep are also common in recent AD-related protein MD.45
![]() | ||
| Fig. 2 RMSF of the backbone Cα atoms in the lipidation-only (RES 192–247), overlapping (RES 248–272), and oligomerization-only (RES 273–299) regions. The APOE4-specific peak at 248–260 lies within the Mahley et al. canonical lipid-binding segment (244–272).57 Peak attenuation by V236E and R251G mirrors reported increases in lipidation and reduced self-aggregation.6,17 | ||
The R136S, V236E and R251G mutations of the APOE proteins attenuate the differences from APOE2 and APOE3 in the lipidation-only region. The lipidation-only region shows two differences in flexibility among APOE2, APOE3 and APOE4. The first is around RES 192–202: both APOE2 and APOE3 RMSF curves exhibit a peak with a height of 0.5–0.6 nm, while the APOE4 curve presents a peak of 0.4 nm. Frieden et al. pointed that APOE3 (as determined by Chen et al.20) presents large intrinsic disordered region (IDRs) around RES 210–223 (particularly 199–209 and 224–239).2 From Fig. 2, we identify that APOE3 presents a flexibility up to 0.57 nm compared to 0.4 nm of APOE4 around RES 192–202. The APOE4 flexibility difference (0.17 nm) relative to APOE3 amount to 26.5% reduction (RMSD = 0.129). This implies that APOE4 may possess fewer IDRs within this segment. Frieden et al. highlighted the importance of IDRs. IDRs allow helices and structured portions of proteins to move relative to one another. The extent of IDR flexibility determines protein function.20 The APOE3 mutations R136S and V236E exhibit RMSF peaks of 0.44 nm and 0.64 nm, respectively. APOE3–R136S reduces the flexibility of APOE3 (12.72% reduction) and the APOE3–V236E variant shows a pronounced increase in flexibility (12.11% increase) around RES 192–202, reaching levels similar to APOE2 with rs (rp) of 0.910 (0.964). The APOE4 R251G mutation shows a flexibility peak of ∼0.52 nm. APOE4–R251G exhibits disrupted correlation with APOE4 (rp = 0.004, rs = −0.082) and introduces a mean flexibility increase of 0.114 nm (33.0% increase, RMSD = 0.139). The mutation shifts the flexibility of APOE4 toward APOE3 levels within this segment. Limited flexibility in APOE4 aligns with stronger inter-domain coupling. APOE4 coupling restricts motions near the IDR-modulated interface. The interface enables lipid-induced domain opening. Restoring flexibility toward APOE2/APOE3 levels by mutation supports the established domain-opening mechanism for lipid binding.12,20 The second is around RES 240–247: both APOE2 and APOE3 present RMSF values around 0.20–0.28 nm, while the RMSF of APOE4 presents a value around 0.36 nm. APOE4 shows moderate negative correlation with APOE3 (rp = −0.416, rs = −0.31) with a mean flexibility increase of 0.064 nm (24.2% increase, RMSD = 0.071). The APOE4 is more flexible than APOE2 and APOE3 in RES 240–247 with a mean flexibility increase of 0.119 nm (56.66% increase, RMSD = 0.119) and 0.064 nm (24.15% increase, RMSD = 0.071) respectively. The APOE3–R136S and APOE3–V235E curves exhibit distinct RMSF. R136S shows strong negative correlation with APOE3 (rp (rs) = −0.858 (−0.833) with a mean flexibility increase of 0.105 nm (39.7% increase, RMSD = 0.113). R136S shows similarity to APOE4 (0.32–0.38 nm) flexibility, while V236E exhibits a similar flexibility range (0.20–0.28 nm) as APOE2 and APOE3. V236E exhibits moderate positive correlation with APOE3 (rp (rs) = 0.436 (0.619)) with a mean flexibility decrease of 0.021 nm (8.0% reduction, RMSD = 0.03). A similar effect was observed with R251G mutation of APOE4. APOE4–R251G shows weak negative correlation with APOE4 (rp = −0.029, rs = −0.071) with a mean flexibility decrease of 0.077 nm (23.5% reduction, RMSD = 0.079). The RMSF curves exhibit 0.20–0.25 nm values. Thus, the V236E and R251G mutations make these proteins more rigid as APOE2 and APOE3 protein around RES 240–247.
The R251G mutation in APOE4 does not alter protein flexibility in the oligomerization-only region (RES 273–299). The APOE2, APOE3, and APOE4 proteins present three distinct peaks around RES 290: 0.28–0.35 nm for APOE2, 0.60 nm for APOE3, and 0.50 for APOE4. Both mutations (R136S and V236E) slightly reduce the flexibility of APOE3 (9.69% and 9.99% reduction respectively), particularly around RES 282–295. The R251G mutation introduce only minimal changes to APOE4 flexibility in the oligomerization-only region (with a mean flexibility increase of 0.022 nm accounting for only 5.8% increase). Chou et al. showed that the C-terminal domain (CTD) (192–271) promotes APOE self-association without marked isoform preference.58 Though they reported higher aggregation in APOE4, our flexibility plot do not provide sufficient information on distinct aggregation tendencies.
The R251G mutation of the APOE4 protein reduces the differences from APOE2 and APOE3 in the overlapping region. The APOE4 presents a 0.5-nm peak around RES 248–260, while APOE2 and APOE3 do not present such a peak in their RMSF curves (this APOE4 peak account for mean 72.81% and 75.94% increase over APOE2 and APOE3 respectively). This difference in RMSF curves illustrates that the APOE4 protein is more flexible than the APOE2 and APOE3 proteins for RES 248–260. The APOE3–V236E and APOE4–R251G RSMF curves do not present a peak similar to the APOE4 one. Instead, their RMSF curves show patterns similar to those of APOE2 and APOE3 (with rp (rs) of 0.958 (0.956) for APOE3_V236E vs. APOE2, 0.969 (0.918) for APOE3_V236E vs. APOE3 and 0.797 (0.736) for APOE4_R251G vs. APOE2 0.888 (0.824) for APOE4_R251G vs. APOE3. APOE4 peak flexibility for RES 248–260 falls within the canonical lipid-binding segment (244–272) defined by Huang and co-workers.57 Helices in this region mediate lipid binding across apolipoproteins.57 Bu had earlier postulated that a structural change in APOE4 by R251G mutation reduces its harmful effect or enhances physiological functions.11 Insights from our study support this position of structural change in APOE4 by R251G and its consequent impact on the binding tendencies, which is more physiological in function. Secondly, the RMSF curves of the APOE3–R136S and APOE3–V236E exhibit distinct peaks of 0.53 nm and 0.7 nm around RES 265. This variation in the flexibility within the overlapping region could imply a less rigid structure. Whereas the APOE2 protein and R251G mutation maintained a similar flexibility within this segment of the overlapping region. This observation is supported with rp (rs) of 0.784 (0.762) and Δμ of 0.028 for APOE4_R251G vs. APOE2.
Several reports have identified RES 261–272 as critical lipid-binding sites.2,12,59 The overlapping-region also participates in APOE oligomerization.60 IDR or high flexibility within this region could favor self-aggregation. Fig. 2 shows that from RES 240–260, APOE4 maintain a higher flexibility than APOE2/3. Higher APOE4 flexibility supports increased aggregation tendencies. On the reverse, APOE2/3 present a gradual flexibility increases from RES 261–272. RES 261–272 represents a segment critical to lipid-binding.2,12,59 Also, higher flexibility could encourage exposure of hydrophobic surfaces, thus improved solubilization.23,57,61,62 The observed trend (high flexibility at RES 262–272) could explain why V236E mutation of APOE3 reduces self-aggregation as reported by Bu and colleagues.17
APOE3–R136S maintained a similar flexibility with APOE4 from RES 238–260; however, it presents higher flexibility than APOE2/3/V236E/R251G within the critical lipid-binding segment. Only comparable to V236E flexibility. This observation still supports enhanced lipid-binding despite earlier similar flexibility trend to APOE4.
Flexibility differences between APOE2, APOE3 and APOE4 concentrate in the lipidation and overlapping regions. Specific mutations, particularly R251G, can restore APOE4 flexibility toward a protective APOE2/3-like state. The lipidation and overlapping regions play a central role in APOE isoform functional differences.
![]() | ||
| Fig. 3 Residue-wise secondary-structure fractions for lipidation-only region for the six proteins (a) APOE2, (b) APOE3, (c) APOE4, (d) APOE3–R136S, (e) APOE3–V236E (f) APOE4–R251G. | ||
Secondly, RES 232–240 exhibit additional differences. APOE4 maintains a random-coil conformation in this region, whereas R136E, V236E, and R251G mutants adopt helical structures. This pattern is similar to the helices observed in APOE2. The change from a random coil to helices indicates that these mutations partially restore structural features found in the other common isoforms. This coil to helix structural shift observed with the mutations could improve helical integrity and weaken self-aggregation.64 The mutations alter the secondary structure of APOE in the lipidation-only region.
The R136S, V236E, and R251G mutations cause changes in the secondary structure of RES 259–265 in the overlapping region of APOE3 and APOE4. Fig. 4 shows the secondary structure of the overlap region for APOE2, APOE3, APOE4, and the mutants APOE3–R136S, APOE3–V236E, and APOE4–R251G. In wild-type APOE4, residues in this segment predominantly adopt an α-helical structure, similar to APOE2 and APOE3. However, the mutations introduce structural changes in both APOE3 and APOE4 (see Fig. 4(d)–(f)). R136S mutation makes the helices adopt a coil around RES 252. RES 252 lies next to Glu255. Glu255 participates in N/C-terminal domain interaction that alters lipoprotein preference in APOE4.30,65 Structural changes near RES 252 may disrupt long-range domain coupling. Disrupted coupling could shift APOE toward lipidated states. In APOE3–V236E, residues around 264 and 270 transition from helices to turns and bends. Additionally, APOE4–R251G shifts residue 264 from a helix to a random coil. R251G mutation disrupts the stable helical conformation observed in APOE4. It increases local flexible structures in the overlapping region, shifting helices to coils. RES 261–272 has been reported to be critical to APOE lipid-binding sites.2,12,59 The structural changes from helices to coils presented by these mutation within these sites support higher flexibility in Fig. 2. The consequent effect of this higher flexibility is proposed to improve lipid-binding tendencies within this segment.12,20
![]() | ||
| Fig. 4 Residue-wise secondary-structure fractions for overlapping region for the six proteins (a) APOE2, (b) APOE03, (c) APOE4, (d) APOE3–R136S, (e) APOE3–V236E (f) APOE4–R251G. | ||
The R136S, V236E and R251G mutations introduce transitions in structures of the oligomerization-only region in APOE3 and APOE4. Fig. 5 presents the secondary structure of the oligomerization-only region for APOE2, APOE3, and APOE4, and APOE3–R136S, APOE3–V236E, and APOE4–R251G mutants. This segment is central to oligomerization. Truncating 273–299 yields monomeric APOE4.66,67 APOE4 shows a wider random-coil region around RES 288–299 compared to APOE2 and APOE3. This implies more flexibility compared to APOE2 and APOE3.12,26,58 As seen in Fig. 5(d) and (e), the APOE3 mutations (R136S and V236E) introduced more coils, turns and bends to APOE3 (RES 273–280). This suggests that the mutations alter the ordered structures of APOE3 to be more like APOE2 in the oligomerization-only region. In APOE4–R251G [Fig. 5(f)], RES 293–299 shift from a predominantly random-coil conformation to a mix of helices and extended structures. This structural change is particularly noticeable as it stabilizes into more ordered forms. These changes suggest the R251G mutation enhances more ordered structures in the oligomerization-only region of the APOE4 protein. It shifts random-coil to helices and extended forms.
![]() | ||
| Fig. 5 Residue-wise secondary-structure fractions for oligomerization-only region for the six proteins (a) APOE2, (b) APOE3, (c) APOE4, (d) APOE3–R136S, (e) APOE3–V236E (f) APOE4–R251G. | ||
![]() | ||
| Fig. 6 Probability and cumulative distributions of SASA in the lipidation region. (a) Total residues (b) hydrophobic residues (c) hydrophilic residues. | ||
The two protective mutations on APOE3 would not alter the total area of the lipidation domain on APOE3. As shown in Fig. 6(a), APOE2, APOE3, and APOE4 present distinct ranges for the lipidation region SASA: the APOE4 ranges from 52 to 67 nm2, a similar SASA distribution range to APOE2 (52–65 nm2) and APOE3 (55–62 nm2). The values of KS (0.21) and Cohen's d (0.05) indicate that the distributions of SASA of APOE3 and APOE4 are similar with a negligible effect size. APOE2 compared to APOE4 have a KS value of 0.27 and a Cohen's d of 0.43. The KS value shows the distributions are similar. The Cohen's d value indicates a moderate difference between the groups. This implies the distributions have significant overlap and variability within each group. In a surface plasmon resonance (SPR) experiment, Lund-Katz and co-workers demonstrated that APOE4 would need greater interfacial area for lipid binding that APOE3.68 Our study shows that APOE4 presents a similar SASA distribution to APOE3. This could mean, increasing the SASA of APOE4 may improve its lipidation. Furthermore, in Fig. 6(a), the lipidation domain on both APOE3–R136S and APOE3–V236E presents SASA in the range of 53–63 nm2, similar to that of APOE3. The values of KS (0.17 and 0.2) indicate that the lipidation domain on the three proteins also possesses similar distributions. The APOE4–R251G mutation reduce the SASA of APOE4 to the range of 52–62 nm2. The reduction by R251G is moderate as demonstrated by the KS (0.20) and Cohen's d (0.57) values (see S1 of the SI for a complete matrix of these statistical values). The density curves show the leftward shift for APOE4–R251G relative to APOE4, with no widening. This distribution agrees with hydrogen-exchange measurements that report lower helical stability and higher solvent accessibility in lipid-free APOE4 relative to APOE3.13 Furthermore, Stuchell-Brereton et al. reported that APOE4 presents a more disordered structure than initially reported.26 This implies larger, variable exposed surface than a single compact fold would.
The mutation effect on SASA emerges as we divide the functional regions (lipidation and oligomerization) into hydrophobic and hydrophilic segments. The hydrophobic segment is composed of ALA, GLY, ILE, LEU, MET, PHE, PRO, TRP, VAL, and TYR, and the hydrophilic segment contains HIS, TYR, TRP, LYS, ASP, GLU, ARG, CYS, GLN, SER, and THR. Soranno and his team identified isoform dependent solvent accessibility using hydrogen deuterium exchange mass spectrometry (HDX-MS) experiments.26 The APOE3–R136S and APOE3–V236E extends the hydrophobic SASA range of APOE3. Fig. 6(b) presents the exposure of hydrophobic residues in the lipidation region. In the figure, APOE4 presents a SASA range of 17–26 nm2, falling between APOE3 (15.5–25 nm2) and APOE2 (17–30 nm2). The KS test values (0.30 and 0.58) for APOE4 compared to APOE3 and APOE2, respectively, show moderate to greater differences. The Cohen's d values of 0.57 and 1.05 suggest medium to large effect sizes for APOE4 vs. APOE3 and APOE4 vs. APOE2. The APOE3–R136S (15–33 nm2) and APOE3–V236E (17–28 nm2) mutations extend the hydrophobic SASA range of APOE3. For APOE3–V236E, the KS value of 0.48 and Cohen's d of 0.34 indicate moderate difference with medium effect on APOE3. APOE3–R136S, with KS of 0.50 and Cohen's d of 0.73, shows significant difference and a medium-large effect size on APOE3 SASA distribution. These observations imply that APOE3–R136S mutation increased the hydrophobic SASA of APOE3 more compared to V236E mutation.
APOE4–R251G shifts the SASA range of APOE4 minimally, from 17–26 nm2 to 17–29 nm2. As seen in Fig. 6(b), this minimal shift is statistically supported by the KS values (0.21) and small effect size (Cohen's d = 0.28). These results suggest that R251G has minimal impact on the hydrophobic segment of APOE4. The analysis shows that, while APOE3–R136S and APOE3–V236E mutations lead to more hydrophobic exposure of APOE3, APOE–R251G does not for APOE4.
APOE4–R251G mutation reduces the scope of hydrophilic exposure in APOE4. Fig. 6(c) presents the surface exposure of hydrophilic residues in the lipidation region. APOE4 presents a hydrophilic SASA distribution 30–42 nm2, APOE2 32–38 nm2, and APOE3 33–40 nm2. The KS values of APOE4 against APOE2 (0.22) and APOE3 (0.29) indicate only small differences between their distributions. Cohen's d value of 0.15 and 0.57 supports APOE4 presents a small effect size with APOE2 and a moderate effect size with APOE3 on their SASA distributions. The APOE3–R136S and APOE3–V236E mutations do not significantly alter the hydrophilic SASA distribution (KS = 0.17 and 0.25) of APOE3. APOE3–V236E has a greater effect on APOE3 SASA distribution than APOE3–R136S. This is shown by their respective Cohen's d values of 0.57 and 0.19. Meanwhile, the APOE4–R251G mutation narrows APOE4's hydrophilic SASA to 30–35 nm2. The KS value (0.53) and Cohen's d of 1.11 indicate moderate difference and significant large effect size. These indicate that mutations alter common occurring APOE hydrophilic exposure to different degrees.
The evidence suggest that APOE3 mutations moderately increase hydrophobic SASA in the lipidation region. They do not significantly change total or hydrophilic SASA. APOE4–R251G reduces total and hydrophilic SASA. The complete statistic for these KS cumulative distribution plots and the timeseries plot of SASA for lipidation region of the six APOE proteins can be found in S1 and S5 of the SI.
Fig. 7 shows the cumulative distributions for the SASA of the oligomerization region in APOE2, APOE3, APOE4, APOE3–R136S, APOE3–V236E, and APOE4–R251G. The total cumulative SASA distribution is presented in Fig. 7(a), the hydrophobic in Fig. 7(b), and hydrophilic in Fig. 7(c).
![]() | ||
| Fig. 7 Probability and cumulative distributions of SASA in the oligomerization region. (a) Total residues (b) hydrophobic residues (c) hydrophilic residues. | ||
The two mutations of APOE3 introduce distinct changes in exposure area of APOE3 oligomerization region. As presented in Fig. 7(a), the SASA distributions for APOE4 and APOE3 largely overlap within the 32–42 nm2 range. The KS test shows a value of 0.10. This indicates a minimal statistical difference. Cohen's d (0.13) confirms the effect size is negligible. APOE3–R136S mutation introduces moderate changes to the SASA of APOE3. The KS test shows a value of 0.29 and Cohen's d is 0.46. This indicates a small-to-medium effect for APOE3–R136S on APOE3. APOE3–V236E does not change the SASA of APOE3. The KS test shows a value of 0.10 with Cohen's d of 0.10. This supports the null hypothesis that the distributions are identical.
The APOE4–R251G mutation reduce the exposure area of the APOE4 oligomerization region. APOE4–R251G shifts the APOE4 SASA exposure range for the oligomerization region to 27–40 nm2. The KS test (0.22) and Cohen's d of 0.71 suggest some similarity with a medium-to-large effect size. This effect size confirms the observed shift. The APOE4–R251G exposure range of 27–40 nm2 partially overlaps with the APOE2 SASA distribution of 27–44 nm2. The KS test shows a value of 0.22 and Cohen's d is 0.26. These statistics support the distributions overlaps and show statistical similarities. Generally, APOE4–R251G notably alters the APOE4 exposure with an approximate 5–16% reduction in total SASA distribution.
The APOE mutations demonstrate distinct SASA distribution effects on the hydrophobic segment of APOE's oligomerization region. Fig. 7(b) shows the SASA distributions for APOE2 (9–23 nm2), APOE3 (12–18 nm2), APOE4 (13–22 nm2), and the three mutations. Stuchell-Brereton et al. proposed that CTD of APOE contain α-helices that determines its aggregation in monomer-tetramer equilibrium, lipidation, and interaction with amyloid-β.26 APOE4 shows a different hydrophobic SASA distribution compared to APOE2 and APOE3.26 APOE3–R136S shows a range of 12–21 nm2. APOE3–V236E shows a range of 13–20 nm2. Both mutations maintain the hydrophobic SASA scope of APOE3 but present distinct distributions. The KS test shows values of 0.45 for R136S and 0.39 for V236E. Cohen's d is 0.63 for R136S and 0.57 for V236E. These values confirm medium distributional differences between the mutated variants and APOE3.
APOE4–R251G shows a hydrophobic SASA range similar to APOE4. The KS test shows a value of 0.28. Cohen's d is 0.41. These statistics confirm that APOE4–R251G has a distinct distribution with a marginal difference and medium effect size. These observations show that the mutations increase the hydrophobic exposure of APOE surface in the oligomerization region. The degree of increase differs between the mutations.
The APOE3–R136S, APOE3–V236E, and APOE4–R251G mutations do not alter the hydrophilic segment exposure of APOE protein. Fig. 7(c) presents the hydrophilic SASA distributions of the APOE isoforms APOE2, APOE3, and APOE4, and the mutants APOE3–R136S, APOE3–V236E and APOE4–R251G within the oligomerization region. APOE4 presents a similar SASA scope of 16–27 nm2. This range is similar to APOE3 but wider than APOE2, which shows 17.5–23 nm2. The APOE3–R136S and APOE3–V236E mutations maintain a similar hydrophilic SASA range to APOE3. The KS test shows values of 0.16 for APOE3–R136S and 0.28 for APOE3–V236E. These values support the null hypothesis of similar distributions. Cohen's d is 0.14 for APOE3–R136S and 0.56 for APOE3–V236E. These values indicate a negligible effect size for APOE3–R136S and a moderate effect for APOE3–V236E. These findings show that APOE3–R136S and APOE3–V236E do not significantly alter the distribution of the hydrophilic segment SASA in the oligomerization region. APOE4–R251G mutation introduces a small leftward shift in APOE4's SASA distribution range. APOE4–R251G maintains a distribution similar to APOE4. The KS test shows a value of 0.18. This confirms no statistically significant difference in hydrophilic SASA. Cohen's d test of 0.48 indicates a medium effect size on the distribution. The test suggests that the two distributions are somewhat separated, still there exists significant overlap and variability within each group. These results confirm that the mutation does not significantly alter the hydrophilic exposure of APOE4 in the oligomerization region. The complete statistic for these KS cumulative distribution plots and the timeseries plot of SASA for oligomerization region of the six APOE proteins can be found in S2 and S6 of the SI.
We analyze the solvation of the lipidation region of APOE protein based on the hydrogen bonds. First, we analyze hydrogen bonding of lipidation region residues and other residues. Then, between residues within the region, and between lipidation residues and water molecules. Previous reports indicate that the C-terminal domain contains three α-helices which interact with those in the N-terminal helix bundle domain through hydrogen bonds and salt-bridges.13,20,23,59,60,69,70 These interactions need be broken before helix opening occurs. Helix opening exposes hydrophobic surfaces necessary for binding to lipids.13,20,60–62 We define hydrogen bonds using geometric criteria from Chandler et al.71 The definition requires two specific parameters. The first parameter measures the distance between donor and acceptor heavy atoms. This distance must be less than or equal to 0.35 nm. The second parameter measures the donor–hydrogen–acceptor angle. This angle must be less than or equal to 30° from linearity. The absolute angle value must be greater than or equal to 150°. Both criteria must be satisfied simultaneously. We assume hydrogen bond exists only when both conditions are met.71
The APOE3–V236E mutation increases the number of APOE3's hydrogen bonds between the lipidation region and other residues by ∼42%. Fig. 8(a) presents the number of hydrogen bonds between the lipidation region and the rest of each protein. APOE4 hydrogen bonding ranges from 15 to 35. APOE2 and APOE3 show bonding range of 15–38 and 15–28, respectively. For the HB between APOE4 and APOE2, KS of 0.3 suggest marginal difference between their distribution and the Cohen's d of 0.72 reveals a moderate-to-large effect size. A smaller but still moderate effect is observed between APOE4 and APOE3, with Cohen's d of 0.33. The Cohen's value suggest moderate standardized deviations in the HB means of the two proteins.
APOE3–R136S mutation do not introduce significant changes to the HB of APOE3. In Fig. 8(a), APOE3–R136S presents HB range 15–28 similar to APOE3. The KS statistic of 0.0075 and Cohen's d of 0.12 indicate a negligible effect, and a minimal deviation from APOE3. The APOE3–V236E mutation increases HB within the lipidation region from approximately 33 in APOE3 to 44. The KS test (0.22) indicates this increase is not statistically significant. This is supported by the Cohen's d test (0.35) which shows only a moderate deviation and overlap in the standardized mean HB distribution of APOE3–V236E from APOE3. In addition, the APOE4–R251G mutation shifts the HB range from 15–35 in APOE4 to 15–38. The R251G mutation induces a moderate-to-large effect in HB of APOE4 within the lipidation region. The Cohen's d value is 0.73. The KS statistic of 0.3 indicates a marginal difference.
APOE3–R136S and APOE3–V236E mutations would not cause significant change in APOE3's hydrogen bonding within the lipidation region. Fig. 8(b) presents the hydrogen bonding number within the lipidation region for the APOE isoforms and the mutations. APOE4 presents range of 52–82 hydrogen bonding number within lipidation region. Hydrogen bond number range distribution presented by APOE2 (53–80) is similar to but wider than APOE3. The statistic between APOE4 vs. APOE2 (KS = 0.15, and Cohen's d is 0.35) and APOE4 vs. APOE3 (KS = 0.054, and Cohen's d is 0.16), indicates similarity between the distributions with a small effect for APOE2 and negligible effect for APOE3. The APOE3–R136S and APOE3–V236E mutations cause minimal shifts in APOE3's HB range to 55–82. As presented in Fig. 8(b), APOE3–R136S and APOE3–V236E do not alter the distributions of HB number (KS 0.094 and 0.29 with Cohen's d of 0.2 and 0.77). While APOE3–R136S only presents small effect in APOE3, APOE3–V236E shows a medium to large effect size on the number of hydrogen bonding within the lipidation region.
The APOE4–R251G alters the hydrogen bonding within the lipidation region of APOE4. Furthermore, in Fig. 8(b), APOE4–R251G presents the KS statistic 0.4. This indicates a moderate change in HB distribution of APOE4. The Cohen's d value of 1.1 suggests this mutation-induced change has large effect size on hydrogen bond numbers in the lipidation region of APOE4. To sum up, we see that APOE isoforms exhibit distinct hydrogen bonding patterns in the lipidation region.
APOE3 mutations do not introduce significant change in the hydrogen bonding between lipidation residues and water molecules in the lipidation region. Fig. 8(c) presents hydrogen bonding number of lipidation water molecules. From the figure, APOE4 (160–210) shows hydrogen bonding within the range of APOE2 (140–210) and APOE3 (170–220). The two protective mutations of APOE3 only slightly widens the scope of hydrogen bonding in APOE3 to 145–220. The KS statistics (0.11 and 0.25) supports accepting the null hypothesis that these distributions are from similar populations. The standardized mean deviations are however distinct as seen in the Cohen's d (0.1 and 0.65) statistic. These suggest negligible effect by APOE3–R136S and a small to medium effect by APOE3–V236E.
APOE4–R251G changes APOE4 lipidation water molecules hydrogen bonding to resemble APOE2. The APOE4–R251G mutation reduces the number of hydrogen bonds between the lipidation region and water molecules to 155–195. The observed marginal reduction is supported by the KS statistic of 0.31 and a large Cohen's d effect size of 0.85. APOE4–R251G shows similarity in bonding with lipidation water molecules to APOE2. The KS test shows a value of 0.19 and Cohen's d is 0.47. This indicates a medium effect size on their bonding distributions. These results confirm that the APOE4–R251G causes notable changes in HB between APOE4 and lipidation water molecules.
Overall, APOE3–V236E increases intra-lipidation hydrogen bonding with a moderate effect size. APOE4–R251G induces large shifts in APOE4 lipidation region bonding. It also alters bonding with water molecules. The effect sizes are moderate to large, and the distribution changes are marginal. The complete statistic for these KS cumulative distribution and the timeseries plot of number of hydrogen bonds for the lipidation region can be found in S3 and S7 of the SI.
Fig. 9 shows the number of hydrogen bonds of the oligomerization region for the APOE2, APOE3 and APOE4, and APOE3–R136S, APOE3–V236E, and APOE4–R251G. Fig. 9(a) presents hydrogen bonding between the oligomerization region and other resides. Fig. 9(b) shows the hydrogen bonds within the oligomerization region. Fig. 9(c) shows the hydrogen bonds between oligomerization region and oligomers-water molecules.
APOE4–R251G mutation do not change the scope of the hydrogen bonds between the oligomerization region and other residues in APOE4. In Fig. 9(a), APOE4's hydrogen bond range (10–25) falls between that of APOE2 (10–30) and APOE3 (8–23). Statistical comparisons show no significant differences, with negligible effect sizes for APOE2 vs. APOE4 (0.059, Cohen's d 0.023) and APOE3 vs. APOE4 (0.10, Cohen's d 0.17). The APOE3–R136S and APOE3–V236E mutations shift APOE3's hydrogen bond range with other residues from 7–23 to 5–18. APOE3–R136S shows a marginal difference compared to APOE3 (KS 0.4) with a large effect size (Cohen's d 0.97). APOE3–V236E shows some statistical similarity (KS 0.27) with a medium to large effect size (Cohen's d 0.71). The APOE4–R251G mutation does not cause significant changes to APOE4 oligomerization region hydrogen bonding with other residues. The KS statistic of 0.053 and Cohen's d of 0.064 confirm that APOE4–R251G presents no significant shift and a negligible effect size.
APOE3 mutations do not cause significant shift in the hydrogen bonding distribution of APOE3. The distribution of hydrogen bonding within oligomerization region is presented in Fig. 9(b). APOE4 presents a hydrogen bonding range (25–42) similar to APOE2 and APOE3. The KS statistic for APOE2 vs. APOE4 (KS of 0.16, and Cohen's d of 0.34) indicates a no significant difference with a small effect size. In contrast, APOE3 vs. APOE4 show good statistical similarity with a KS of 0.073, and a negligible effect size (Cohen's d 0.094). The APOE3–R136S (25–44) and APOE3–V236E (25–38) mutations do not cause a significant shift in the hydrogen bonding distribution of APOE3. The statistics of APOE3–R136S vs. APOE3 (KS of 0.11 and 0.12, and Cohen's d of 0.22 and 0.028) indicate a no significant difference and a small to negligible effect size.
APOE4–R251G induce moderate reduction of hydrogen bonding within the oligomerization region of APOE4. As presented in Fig. 9(c), APOE4–R251G mutation (25–38) introduces a shift in APOE4's hydrogen bonding distribution. The statistics show KS value of 0.35 and Cohen's d of 0.77. These statistics indicate a marginal shift with a medium to large effect size on APOE4–R251G hydrogen bonding distribution. These statistics confirm that APOE4–R251G alters APOE4's hydrogen bonding within the oligomerization region.
APOE mutations do not alter the hydrogen bonding of oligomers water molecules. In Fig. 9(c), APOE4 presents a HB range 100–135, APOE2 95–130, and APOE3 105–140. The KS statistic for APOE4 vs. APOE2 is 0.079. For APOE4 vs. APOE3, it is 0.13. These values support the similarity between these pairs. They confirm no significant difference in APOE4's hydrogen bonding compared to APOE2. The APOE3–R136S mutation (105–140) shows a KS statistic of 0.032. This supports the null hypothesis. There is no significant difference from APOE3's distribution. The effect size is negligible (Cohen's d 0.038). Similarly, the V236E mutation (105–140) shows no significant change in APOE3 hydrogen bonding with oligomer-water molecules (KS 0.24). However, the effect size is medium. In addition, the APOE4–R251G mutation (100–135) induces a moderate leftward shift in APOE4's hydrogen bonding. However, the distribution remains similar. The KS statistic is 0.24, supporting this similarity. The effect size is medium (Cohen's d 0.56). The complete statistic for these KS cumulative distribution and the timeseries plot of number of hydrogen bonds for the oligomerization region can be found in S4 and S8 of the SI.
Collectively, APOE3 mutations show distinct impacts on APOE3's hydrogen bonding within the oligomerization region. APOE4–R251G marginally alters the hydrogen bonding in APOE4's oligomerization region but does not significantly affect oligomer-water molecule interactions.
The simulation data will be made availibale upon request. Statistical result data has been made available in the supplementary information (SI). Supplementary information: Fig. S1.1: statistics for total SASA of lipidation region – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S1.2: statistics for hydrophobic SASA of lipidation region – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S1.3: statistics for hydrophilic SASA of lipidation region – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S2.1: statistics for total SASA of oligomerization region – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S2.2: statistics for hydrophobic SASA of oligomerization region – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S2.3: statistics for hydrophilic SASA of oligomerization region – (i) KS test matrix (ii) Cohen’s d matrix. Fig. S3.1: statistics for hydrogen bonding of lipidation with other residues – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S3.2: statistics for hydrogen bonding within lipidation region – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S3.3: statistics for hydrogen bonding of lipidation region with water molecules – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S4.1: statistics for hydrogen bonding of oligomers with other residues – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S4.2: statistics for hydrogen bonding of residues within oligomerization region – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S4.3: statistics for hydrogen bonding between oligomerization region with water molecules – (i) KS test matrix and (ii) Cohen’s d matrix. Fig. S5: time-series plot of SASA for lipidation region of the six APOE proteins – (a) total residues, (b) hydrophobic residues and (c) hydrophilic residues. Fig. S6: time-series plot of SASA for oligomerization region of the six APOE proteins – (a) total residues, (b) hydrophobic residues and (c) hydrophilic residues. Fig. S7: time-series plot of number of hydrogen bonds for the lipidation region for (a) lipidation with other residues, (b) within lipidation region, and (c) lipidation region with water molecules. Fig. S8: time-series plot of number of hydrogen bonds for the oligomerization region for (a) oligomers with other residues, (b) within oligomerization region, and (c) oligomers with water molecules. Table S1: correlation analysis and flexibility metrics for APOE isoform and variant comparisons by structural region. See DOI: https://doi.org/10.1039/d5cp03829d.
| This journal is © the Owner Societies 2026 |