A molecular dynamics model for glycosylphosphatidyl-inositol anchors: flop down or lollipop?

We present a computational model of glycosylphosphatidyl-inositol (GPI) anchors for molecular dynamics studies. The model is based on state-of-the-art biomolecular force fields from the AMBER family, employing GLYCAM06 for carbohydrates and Lipid14 to represent fatty acid tails. We construct an adapted glycero-phosphatidyl-inositol unit to establish a seamless transition between the two domains of atom types. This link can readily be extended into a broad variety of GPI variants by applying either domain’s building block scheme. As test cases, selected GPI fragments inserted into DMPC and POPC bilayer patches are considered. Our results suggest that the glycan part of the GPI anchor interacts strongly with the lipid head groups, partially embedding the carbohydrate moieties. This behaviour is supported by the conformational preferences of the GPI anchor, which in particular are conveyed by the strong interactions between the proximal amine and phosphate groups. In a similar way we can conclude that the extension of the anchor away from the lipid bilayer surface that could prevent the contact of the membrane with an attached protein (‘‘lollipop picture’’) is quite unfavorable. Indeed, when attaching green fluorescent protein to the GPI anchor, it is found to reside close to bilayer surface all the time, and the rather flexible phosphoethanolamine linker governs the extent to which the protein directly interacts not only with the head groups, but also with its own GPI core.


Introduction
Glycosylphosphatidylinositols (GPIs) are complex glycolipids present in eukaryotic cells, typically covalently bound to the C-terminus of proteins via a phospho-ethanolamine unit. 1 They primarily serve to anchor proteins to the outer leaflet of the cell membrane, where a Mana(1-2)-Man-a(1-4)-GlcNa(1-6)-myo-inositol pseudopentasaccharide core bridges towards a lipid tail, which in turn is inserted into the plasma membrane. Fig. 1 shows a schematic of a minimal complete structure of a GPI-anchored protein. The sequence of four monosaccharides consisting of three mannoses and one glucosamine plus the trailing myo-inositol (blue) is referred to as the GPI core, the sequence without inositol is termed the GPI backbone. The core is a conserved part in almost all naturally occurring types of GPI-anchors, and it can be modified by various types of residues or side chains, the molecular weight of which may exceed that of the core itself.
Indeed, the vast number of possible side chain modifications including branched oligosaccharides that occur in nature suggests a rather broad functionality beyond the role of a mere anchoring device. GPIs are thought to be involved in localization of their proteins in membrane microdomains, commonly referred to as lipid rafts, 2,3 and bring proteins into close proximity with other raft-associated species to enable their interactions, which underline and determine diverse processes such as signal transduction, 4,5 cell adhesion protein trafficking and sorting, 6,7 and antigen presentation. 8 For all of these possible functions, a key aspect is how the GPI-anchored proteins distribute within the plasma membrane. In nerve cells, GPI-anchored Thy-1 and prion proteins have been suggested to form clusters in membrane domains with distinct lipid composition. 9 In general, the usually saturated alkyl chains of the anchor should favor association with membrane domains rich in cholesterol or other saturated phospho-and glycolipids, glycosphingolipids and sphingomyelin, 7 although there are indications that GPIanchored proteins may also be distributed by non-equilibrium driving forces. 10 Although intuitively appealing, pictures such as raft association or trafficking are still intensely debated, 11,12 and in fact this conception has seriously been challenged by recent experiments of Schütz and coworkers. 13 A major reason for the persisting controversies is that there is no clear molecular scale picture of how the complex of attached protein and the glycolipid anchor is embedded within and interacts with the local membrane environment. Experimental evidence for naturally occurring GPI-anchored proteins is scarce or ambiguous. From cryo-TEM studies with purified GPI-alkaline phosphatase (GPI-AP) inserted into liposomes 14 it was concluded that the glycan part should fill the space between protein and lipid, reminiscent of the protective coat of Trypanosoma brucei. 15 At ambient conditions in solution, a rather close proximity of AP to the lipid head group region has been inferred, 16 suggesting a mutual interaction of glycan, lipid and protein. [17][18][19] The seemingly opposing pictures are not necessarily a contradiction: in liposomes, the content of reconstituted protein or membrane composition and morphology are difficult to control. In addition, the glycan content of a GPI anchor is not exactly defined, for its composition may repetitively be remodeled during a protein's life cycle. 20 Ideally a comprehensive characterization of GPI membrane embedding should proceed in a quasi-synthetic, bottom-up fashion starting from sufficiently simple and defined model systems. For instance, by creating a series of GPI-analogues attached to green fluorescent protein (GFP) and studying GPI-GFP diffusivities in supported lipid bilayers 21 and cells in vivo, 22 Bertozzi and coworkers conclude that high lateral mobility should be attributed to a rather stiff anchor preventing intermittent interactions with the membrane and increased protein-membrane contacts to more flexible anchor structures. In the former case GPI-GFP would roughly resemble a ''lollipop''; in the latter, it can be thought to ''flop down'' onto the membrane, a picture introduced by Sharom and Lehto, 16 see also Fig. 2. Exploring molecular scale details of GPI placement, however, even with elaborate NMR approaches inevitably requires computational methodology. [23][24][25] With the present work we want to study the embedding of GPI anchors in lipid bilayers with atomistic, molecular dynamics (MD) simulations. To facilitate modular buildup and accurate representation of either molecular species, we shall construct a hybrid computational model by combining two state-ofthe-art force fields from the AMBER family, GLYCAM06 26 (glycans) and Lipid14 27 (lipids). However, unlike simply ''stitching together'' a carbohydrate and a lipid via some ad hoc procedure, we shall specifically parametrise a molecular hub (highlighted in Fig. 1) that may readily be extended to various different GPI topologies simply by applying either force field's building block scheme. In this way, a coherent source of atomistic models is provided allowing to walk the ''synthetic route'' computationally. In order to keep the investigation at a manageable level in the spirit of a bottom-up approach, we shall restrict most of the exposition to anchor fragments up to and including Man3, compare Fig. 1. In the last section, we shall give a preliminary account of a fully fledged model of GPI-anchored green fluorescent protein (GFP) including the phosphoethanolamine (PE-) linker; the results largely support the major conclusions: the basic appearance of the embedded GPI-anchor can be understood in terms of rather simple molecular-mechanical arguments suggesting that a flop-down picture should dominate; in DMPC and POPC bilayers configurations with all glycan moieties stretching away from the solvent-headgroup interface are rare. Fig. 2 illustrates the ''flop down'' vs. the ''lollipop'' picture with specifically selected simulation snapshots of the GPI anchor model developed in this work. The flop down, however, can be realized in a number of ways, depending on the extent to which the glycan part intercalates with the head groups. Before analysing this behaviour in detail, we shall first discuss the computational model and how GPI conformations may be characterised before bilayer insertion. We will also comment on how the hybrid model may be validated and/or calibrated further especially along with our results on GPI-GFP, which gives some clues to experimental validation.

Computational approach and model validation
To our knowledge there are only very few computational studies involving GPI anchors or GPI anchored proteins. Homans et al. gave an early structural account of the GPI anchor covering the surface of Trypanosoma brucei, 15 Zuegg and Gready 28 investigated the flexibility of an isolated prion protein attached to a simplified model of a lipid monolayer. Lomás and coworkers have studied the conformational characteristics of the GPI core, 23 and that of isolated GPI anchors inserted in small micelles, 24 with short simulations in support of NMR data.
In fact, providing a faithful model of a GPI anchored protein immersed in a cell membrane is a challenging endeavor, implying on the one hand that the characteristics of each type of biomolecule (protein, carbohydrate and lipid) must be represented with sufficient accuracy (with respect to stability, conformational preferences and phase behaviour, for instance). 29 On the other, the mutual nonbonded interactions of the three species in close proximity must be calibrated, and the experimental evidence for such situations is scarce. [30][31][32][33] A mere computational validation is rather difficult and subtle; in the scope of the present work we can only briefly indicate the state of the art of this subject (see, e.g., the discussion preceding the Conclusion section).
The former issue mainly requires specialization. In the Amber family of force fields, Lipid14 27,34 has been developed to describe lipid bilayers in a stable and robust way; the focus of GLYCAM06 is devoted to modeling complex carbohydrates of almost arbitrary composition 26,35 (compare the corresponding developments in the CHARMM force field 36,37 ). The transferrability of GLYCAM06 (particularly important to the case of GPI anchors) required the adaption of bonded interactions and how they are encoded by a suitable (re-)definition of atom types. For creating a hybrid link that may be extended with both, GLYCAM06 and Lipid14 building blocks, we need to establish a smooth transition between the two domains of atom types that define torsion, angle bending and stretching potential terms. This is accomplished by selecting a suitable bridging fragment (highlighted in Fig. 1) and turning it into a molecule with appropriate methyl cappings, see Fig. 3(a). The transition between atom types occurs across the bond from the C1 carbon (type Cp, GLYCAM) to the phosphate oxygen (oT, Lipid14). w 0 denotes the corresponding torsion angle. Link (a) will be provided in two flavors: in addition to 6OMe-Ino-PGL(0) we define Ino-PGL(0) with bare inositol if the latter is the terminating head group moiety. This dual character is indicated in (b) with 6OMe-Ino-OMe and Ino-OMe, which play the key role for obtaining parameters of bonded interactions (e.g., w 0 and w 1 ) involving a mixed set of atom types (Cg, Cp pertaining to GLYCAM06, oT, pA and oP to Lipid14), and for defining partial charges as to comply with either domains building block principle. The path from (b) to (a) involves a number of somewhat lengthy and tedious steps and is outlined in detail in the ESI, † Section S1. To check for consistency, we compare the conformational preferences of w 0 -w 2 obtained for Ino-POMe to either an all-GLYCAM06 or -Lipid14 parametrization presented in Section S2 (ESI †). These results prove to be rather valuable in order to categorize the plausible conformational preferences of the hybrid link, and then the full GPI anchor. We shall briefly summarize the essential findings: (i) w 0 -w 2 in the hybrid model of Ino-POMe are not affected by the type of partial charges assigned (ensemble averaged GLYCAM06 charges vs. AM1-BCC), see Fig. S3 (ESI †) for comparison. We use the AM1-BCC scheme as a convenient method to produce Lipid14-like partial charges, which rest on the standard two-stage RESP protocol in Amber. 38 This observation is useful as it simplifies the compilation of reference compounds to compare effects of GLYCAM06 vs. Lipid14 parameters. In addition it facilitates further validation of fully fledged models for GPI anchors (see concluding discussion of the following section).
(ii) The behaviour of w 0 of the hybrid-, (obtained from MD in TIP3P solvent at ambient conditions), the all-GLYCAM06and Lipid14 parametrisation is rather similar, major weights are given to intervals B[+601,+901] (within +synclinical or +sc orientation), and B[+1501,+1801] (within the antiperiplanar (ap) or trans-like domain of torsion angles). The distribution in w 0 exhibits two neighboring peaks, compare Fig. S4 and S5 in the ESI † (or S6, for direct comparison).
(iii) The distribution of w 1 in the hybrid model of Ino-POMe is very similar to that in the all-GLYCAM06 version exhibiting two peaks within Àsc, +sc (close to BAE901), with slight preference for +sc (Fig. S4, ESI †). The all-Lipid14 version, by contrast, is different in that broad access is also given to torsions within ap. This can uniquely be attributed to the atom type assigned to the C1 carbon of the inositol ring: cA in Lipid14 (generic sp 3 bonded carbon) and Cp in GLYCAM06 (sp 3 carbon bonded to phosphate oxygen). The ''cA-Cp exchange'' does not impact w 0 see Fig. S6 and the corresponding discussion in the ESI. † 39 (iv) The torsions w 0 -w 2 of the full link 6OMe-Ino-PGL(0) behave quite similarly to those of Ino-POMe; however, the substitution of glucosamine at the O6 oxygen (Fig. 3(c)) has a pronounced impact. The two peaks for w 0 merge and shift into ap; for w 1 , major preference is now given +sc; for w 2 , an asymmetry is established with a preference for +sc/Àac. This is solely due to the mutual intramolecular interaction of the proximal amine and phosphate groups, elucidated by a series of auxiliary simulations reported in Fig. S7 and the accompanying text in the ESI, † Section 2; see also the discussion related to Observation (iv) is very useful as it allows us to single out a few particular GPI conformations that can be expected to occur with high probability. Fig. 4 illustrates the corresponding geometries, which have been obtained in a ''molecular mechanics''-like fashion as follows: from an equilibrated DMPC bilayer a representative phospholipid tail was selected with both alkyl chains pointing along the z-direction, and the O32-P31 bond tilted by 451 with respect to the bilayer normal. This tail is common to structures (a)-(d). Structure (a) was then obtained by adding inositol and GlcN with (w 0 ,w 1 ,w 2 ) set to (1751,901,901) according to observation (iv). The (C,F) glycosidic angles from GlcN to inositol exhibit only one narrow minimum around (701,À1501). The distance distribution between amine and phosphate group (distance between nitrogen and phosphorous atom) has a maximum at 3.5 Å and a secondary one at 5.0 Å. The remaining three mannoses were added with glycosidic angles set according to their global free energy minima. 41 Selecting the lowest energy values for all torsions furnishes the anchor with a hook-like appearance shown in (a) or (b), where the latter is obtained simply by setting the O-glycosidic angle in the Mana(1-6)Man linkage (captured by the orange ellipse in Fig. 4(a and b)) to the gt configuration, which should be equally or a little less populated than gg. The transition gg-gt twists the hydroxy-methyl group of Man3 approximately from Àz to +z-direction.
(c) is derived from (a) by setting (w 0 ,w 1 ,w 2 ) to (1751,À901,À901) that is, choosing for both, w 1 and w 2 the values that are less likely (compare Fig. S7, ESI †). In (d) we have stretched out the GPI core as much as possible (O in Mana(1-6)Man linkage set to the unfavorable trans gauche and Mana(1-2)Man tweaked to a high energy conformation) and setting (w 0 ,w 1 ,w 2 ) to (901,901,901) that is, unfavorable for w 0 . In this conformation the amine and phosphate group are rather close (nitrogen-phosphorous distance about 3.1 Å) such that steric conflicts can be expected. Twisting the two charged groups away from each other as shown in (e) requires adjustment of the GlcN-Ino glycosidic angles, elevating the internal molecular energy as well. From these considerations one may predict that a lollipop-like conformation (d and e) should be rare or rather overwhelmed by variations of the hook-like appearance (a and b). In the following section we will compare the corresponding behavior with molecular dynamics and lipid bilayer systems.

Simulation setup
For MD simulations in small DMPC-and POPC bilayer patches we consider the GPI-anchor fragments displayed in Fig. 5. (a) has been modeled from the link Ino-PGL(0), (b) and (c) from 6OMe-Ino-PGL(0). As simulation engines Amber14 42 as well as GROMACS v. 4.6.4 were employed 43,44 the latter in the majority of cases for running long (microsecond) bilayer simulations. All simulations pertaining to GROMACS were run using the CPU as well as the GPU version. To translate Amber/GLYCAM-to GROMACS molecule topologies and input files we utilized a script originally devised by Sorin and Pande 45 and adapted by us 41 to cope with the specificities of GLYCAM06. All simulations were carried out under standard conditions at 303 K with the TIP3P water model with pressure maintained at 1 bar using semi-isotropic Berendsen rescaling 46 in an orthorhombic simulation box. Removal of center of mass motion was applied to the bilayer system as a whole. The species (a-c) were inserted through a number of separate steps. First a quadratic bilayer patch of 2 Â 64 (8 by 8) phospholipid molecules was prepared with surface area of 100 Å 2 per lipid. At this stage one lipid per leaflet was deleted and replaced with a corresponding GPI molecule, keeping the symmetry of the simulation setup. The latter was pulled out a few Å in z-direction as to avoid any initial steric clashes of the glycan part with the PC head groups. In addition, the glycan portion was twisted away from the bilayer surface as much as possible by manually adjusting the torsion angles w 0 -w 2 and all subsequent glycosidic linkages, obtaining a structure close to that shown in Fig. 4(e). In this way some excess internal energy is provided at the start of the simulation procedure. The whole system is then solvated with TIP3P waters taken from pre-equilibrated solvent boxes. Waters placed within the region of alkyl chains were subsequently removed. This fact and the different sizes of (a) to (c) lead to slight variations in the final system size with average extensions of 6.5 nm, 6.5 nm and 13 nm in x, y and z-direction, respectively, B66 500 atoms in total, and B17 500 water molecules. In the case of Ino-PGL (a) one Na + counterion was added to the system. We used a plain cutoff of 1 nm for vdW interactions and the same value was used as the real space cutoff in the particle mesh ewald (PME) electrostatics, which was used througout.
Initially, all lipids were subject to soft harmonic restraints with respect to their center-of-mass motion to allow the bilayer to soak with solvent during an initial period of 100 ps at 100 K during which bad contacts were resolved, using a Langevintype stochastic integrator 47 with a collision time of 1 ps. Subsequently, the temperature was ramped up to 300 K during another 100 ps, again removing water molecules that penetrated into the domain of alkyl chains. Then the system was allowed to relax with pressure control on (pressure relaxation time of 2 ps) at 1 bar. The area per lipid quickly (well within 10 ns) relaxed to its equilibrium value, and remained there for the total simulation time in each case. Reference simulations with temperature controlled by a Nosé-Hoover chain thermostat 48 resulted in the same behavior. Snapshots were written out at an interval of 0.25 ns resulting in a total of 4000 frames per ms, of which we omit the first 50 ns (equilibration period) leaving 3801 for data analysis.
In Fig. 6 we show the time-dependent area per lipid for systems (a-c) in DMPC and POPC bilayers, respectively. The panel for DMPC indicates the evolution of GPI-core-PGL from the initial erected-(snapshot taken at 0.3 ns) to a flop-down conformation where the GPI backbone appears to point roughly parallel to the bilayer plane.

General appearance of GPI conformations within bilayers
In the previous section it was argued that for GPI-core-PGL hook-like conformations arise through the special preferences of torsion angles w 0 -w 2 , induced by intramolecular interactions between amine and phosphate group. We can essentially make the same observation as described in (iv) with GPI fragments (a-c) in Fig. 5, but now embedded within the bilayers, see Fig. 7. As soon as GlcN is included, the bimodal distribution for w 0 merges and shifts into +ap, almost complete emphasis for w 1 on +sc, and an asymmetry of w 2 with emphasis on +ac/+ap configurations, the orientation of the O31-P31 bond shows its maximum around 451 (see insets). Indeed, in the simulation runs we observe (with the exception of the ''submarine'', see below) a pronounced occurrence of hook-like conformations for GPI-core-PGL in each leaflet, that is, the GPI core mostly tends to ''flop down'' onto the bilayer surface. Fig. 8 displays how similar conformations to the ones shown in Fig. 4 are assumed within the lipid environment.
(a) reproduces the snapshot shown in Fig. 2, the conformation belongs to a class that we shall coin ''swimmer'', because all carbohydrate moieties are well exposed to the solvent, situated at the interface between PC head groups and aqueous subphase, see also (b) for the corresponding top view. The swimmer can be realized by a number of variations of the  ). (f) shows an extreme situation (with a conformation close to Fig. 4(c)) where the GPI core is completely buried within the head groups, contacting the hydrophobic core of the bilayer with minimum exposure to water. This ''mole'' actually persists over the full simulation time of the lower leaflet of the POPC bilayer, and appears to be stabilized or kinetically trapped. A similar situation is observed with extended conformations. Fig. 8(c) shows the ''submarine'': the stretched out GPI core persists for more than 500 ns, but only Man3 is well exposed to the solvent subphase. The submarine eventually turns into the ''diver'' at the end of the simulation run. The ''phoenix'' (e), which is close to the conformation (d) in Fig. 4, lasts as expected only for a few nanoseconds. These (and in particular the last) examples show that it is not enough to distinguish between different conformations, but also between different levels of hydration. To fully capture the dynamics of the embedding the following observation is useful: the vertical positioning of all GPI species seems to be quite independent of conformation or carbohydrate content, see Fig. S8 and the corresponding discussion in the ESI, † Section S3. That is, in z-direction, the GPI phosphate resides at the level defined by all other phosphates in the lipid layers. We may thus picture the GPI core to move and reorient with respect to this reference level and to produce some characteristic time-dependence of the hydration pattern, complementary to conformational dynamics. This is analysed in the following.

Aspects of conformational dynamics
In addition to tracking the hydration level of all carbohydrate moieties we also consider the hydrogen bridges formed with the PC heads in each frame, see Fig. 9(a) and (b), respectively, where the time points of the snapshots in Fig. 8 have been annotated. The swimmer is characterized by an overall even exposure of all monosaccharide entities to the solvent, or very few h-bonds towards the PC heads. This situation is seen to prevail for the upper leaflets of the DMPC and POPC bilayer. The ''phoenix'' Fig. 8(e) observed in the upper DMPC leaflet (reproduced from Fig. 2) lasts for a few ns only; its hydration  level is similar to that of the swimmer, yet its placement is unfavorable not only because of the molecular conformation, but also because the alkyl chains have to enter the hydrophilic region to some extent.
The transition from submarine to diver is clearly visible in the lower left panels of (a) and (b), respectively: hydration is traded for h-bonds towards PC heads. The mole exhibits an even stronger interaction with the surrounding lipids and in both cases we expect that the conformational freedom of the GPI backbone itself is affected. This can be verified with the evolution of the end-to-end distance d(C4,C6) and the tilt angle y z of the core with respect to a vector defined by the C4 ring carbon of Man3, and C6 of inositol, see Fig. 10. For the submarine the tilt towards the z-axis persists for more than 0.6 ms (see inset) corresponding to the course of the hydration level in Fig. 9(a), until it gradually relaxes towards a lateral orientation. The mole shows little variation in tilt angle. Both configurations are additionally restrained with respect to end-to-end distance, which spans an interval of only 10-15 Å (see lower panel in Fig. 10) whereas the ''swimmers'' (green and blue graphs) can explore values of d(C4,C6) as low as 6 Å.
How to validate the force field for embedded GPIs?
In the previous assessment it became clear that a flop-down appearance of the GPI core prevails. In terms of steric/ geometric accessibility by a protein the swimmer as a specific example naturally appears as a more plausible placement than the mole. But to determine the relative stability between a buried and a more flexible placement certainly requires advanced sampling strategies 49 beyond the scope of the present account. To judge whether the ''mole'' could eventually be ruled out requires yet additional work. As has already been pointed out in the computational approach, the mutual interaction of Fig. 9 Hydration level of GPI core moieties Man3, Man2, Man1 and GlcN, color coding as in Fig. 8. The upper two panels of each block refer to the upper leaflet (z 4 0) of the respective bilayer system. The hydration level for a specific frame at time t is defined as the number of water molecules within a shell of 3 Å thickness around the selected residue. The distance criterion is applied to water oxygen only. (b) Number of hydrogen bonds of each monosaccharide towards the PC head group atoms per frame. Both quantities in (a) and (b) have been subjected to a running average with a 20-frame window in order to remove excessive noise. aminoacids/protein, carbohydrates and lipids is extraordinarily difficult to characterise experimentally and thus poorly explored computationally. Consequently, one cannot expect every aspect to be calibrated with highest accuracy. The general problem of adequately tuning nonbonded interactions in crowded environments has been identified as one of the major challenges of current force field development. [50][51][52] Variation of force field parameters within admissible limits is a one viable strategy to test the sensitivity (and thereby the plausibility) of a certain phenomenology. 53,54 For instance, in discussing how the conformational preferences of the hybrid link Ino-POMe emerge, extensive use has been made of the fact that use of either GLYCAM06-type or AM1-BCC charges led to indistinguishable results, see e.g. Fig. S3 in the ESI. † That is, without seriously affecting aspects of molecular mechanics, the hybrid link to bridge GLYCAM06 and Lipid14 domains of atom types may also be viewed as a vehicle to test nonbonded interactions systematically for instance, by altering the charge scheme on the glycan moieties, exploring different vdW combination rules between carbohydrates and lipids, 55 etc.
Another possibility is to immediately go on with the synthetic route. In what follows, we shall explore what we can learn from suitably extending GPI-core-PGL and provide a first account of a fully fledged model of GPI-anchored green fluorescent protein (GFP). GFP is not a natural candidate as it lacks the typical motifs of GPI anchored proteins around their C-termini, 56 but it is rather useful for several reasons. It is known to interact very little with other lipid components, and is frequently used as a marker for in vivo studies of cell membranes such as in the investigation of GPI analogues in supported lipid bilayers 21,22 or different re-assembled GPI-GFP into live cells. 57 Thus, GPI-GFP is a convenient starting point to study how the protein would act back on the GPI anchor and how the latter would influence orientation and proximity of the protein with respect to a membrane.

GPI-anchored green fluorescent protein
To construct a GPI anchored GFP we proceeded through the following steps. Coordinates for GFP were taken from the X-ray structure with PDB code 1EMA, and rendered a complete molecular topology using the tLeaP facility from the Amber suite. The phosphoethanolamine (PE) linker was parametrized with the GLYCAM06 force field defining a transition region between linker and GFP's C-terminus (Threonine) similar to the way the hybrid building block between GLYCAM06 and Lipid14 was derived. This is fully outlined in the ESI, † Section 4. The creation of the simulation setup followed along the same lines as described for the free GPI-anchor, except that now the bilayer patches are 16 by 16 phospholipids in size with only one phospholipid per bilayer (upper leaflet) replaced by a GPI-anchored GFP. The total number of atoms was 309 634 and 311 789 for the setup with DMPC and POPC lipids, respectively, and additional 7 Na + ions were introduced to compensate GFP's charged residues. All other simulation conditions and parameters are as in the case of free GPIs. Initially, the GPI-GFP points away from the bilayer surface with its long axis parallel to the bilayer normal (z-direction). The long axis is relatively well defined by a vector running through the center of masses of residues 76 (glutamine) and 135 (histidine), see Fig. 11. Since a twisted configuration of GPI-core-PGL was chosen as initial configuration (compare Fig. 6), GFP and phosphoethanolamine linker initially had no contact to the bilayer surfaces. In the course of 1 ms long simulations for each bilayer system, the GPI core within 200-300 ns flops down onto the bilayer assuming a hook-like conformation reminiscent of the ''diver'', with the GFP barrel resting on the head group region with the long axis pointing along the bilayer surface, compare Fig. 11(a); the ''flop down'', remarkably, also allows for a completely upright orientation conveyed by the PE linker (b). Panel (c) shows the evolution of GFP orientation for DMPC and POPC. Both simulations indicate that the parallel orientation (the ''sleeping GFP'') Fig. 11 Snapshots from a simulation with a full GPI-GFP inserted into a POPC bilayer as described in the text where the GFP ''sleeps'' on the head groups with its long axis pointing laterally (a) or stands upright (b), respectively. Panel (c) shows the corresponding evolution of long axis orientation for the GPI-GFP in POPC and DMPC, respectively. Panel (d) shows the dynamics of the long axis for trajectories restarted from the time points indicated in (c) (same color code). The times are 0 ns (initial, tweaked GPI configuration), 200 ns, 500 ns, 700 ns (snapshot (b)), 850 ns and 980 ns.
should dominate, but the re-orientational dynamics is quite slow, occurring roughly over 200 ns. We therefore restarted the simulation of GPI-GFP embedded in POPC from 6 different time points along the trajectory indicated by the dashed lines in (c), with new initial velocities (simulation time 200 ns each). The corresponding trajectories are shown in (d). Four of these show a tilt angle of smaller than 601 initially. The trajectory in orange starts from the same initial configuration as the one in panel (c), but now relaxing somewhat faster to parallel orientation. After 200 ns, only in one run the protein still stands upright, in all other cases the sleeping GFP appears to evolve. In this way, our current model makes a rather clear prediction about the orientation of the GFP, and this could be verified experimentally by employing anisotropic fluorescence spectroscopy 58 because the GFP determines the orientation of the chromophore inside.
We note that we verified that interaction of GFP with lipidwater interface is weak. When brought into initial contact with the head group region (sleeping or upright orientation) the GFP detaches from the bilayer surface within 100-200 ns driven by diffusion and bilayer undulations.
By analysing all trajectories of Fig. 11(d), we can make further observations: (i) Fig. S9(a) in the ESI † shows that taken together, the orientation distribution of the end-to-end vector from C6 (inositol) to C4 (Man3) is in accord with those of the swimmerlike configurations seen to dominate in the POPC and DMPC top leaflet of the free GPI anchor GPI-core-PGL, compare Fig. 10; the attachment to the protein appears to stabilizes the end-to-end distance d(C4,C6) at B1.4 nm, see Fig. S9(b) (ESI †). However, unlike the swimmer of GPI-core-PGL where the O glycosidic angle of the aMan1-6Mana linkage between Man2 and Man1 exhibits mostly the gg rotamer, in GPI-GFP it is exclusively gt, compare also Fig. 4. This actually causes Man1 to stick out of the headgroups (compare red-colored Man1 in Fig. 11(a)) more than Man2 or Man3.
(ii) The GPI core does not notably influence GFP's orientation; it is by contrast the PE linker and its flexibility that allows the GFP to explore a range of different orientations. It may even ''twist back'' to interact with in principle all glycan moieties. In Fig. S10 (ESI †) we characterise these interactions by displaying the dynamic formation of hydrogen bonds. The fact that an attached protein may quite extensively interact with its own GPI glycan could be a very important hint for possible NMR experiments where, for instance, a GPI-anchored protein is inserted into a well defined and suitable environment such as a small micelle 24 or a phospholipid nanodisc. 59

Conclusions
We can state that the hybrid GPI anchor models developed in this work turn out to be quite useful. They exhibit a broad variety of conformational modes by which the GPI glycan can interact with the headgroup-solvent interface facilitated by the dominating hook-like appearance of GPI-core-PGL, which causes the GPI core glycan to flop down onto the lipid bilayer but nevertheless retains a high degree of internal flexibility. This may be contrasted to other glycolipid species such as the ganglioside GM3, which is inherently stiffer and rich in b-type glycosidic linkages, making the glycan headgroup point away from the bilayer surface. 25 In contrast we can conclude that a ''lollipop'' picture of a GPI anchor (represented, e.g., by the ''phoenix'') is rather implausible. The preference of GPI-core-PGL for hook-like conformations can be expected to be a robust and generic phenomenon, as it is in particular conveyed by the proximity and mutual intramolecular interaction of amine-and phosphate groups, a mechanism retained after attaching GFP. The proximity to the bilayer surface and the flexibility of orientation conveyed by the phospho-ethanolamine linker permit the GFP to interact quite extensively with its own GPI-glycan. Naturally the question arises about the consequences of adding various side chains to Man1 such as in the case of, e.g., Toxoplasma gondii. 60 Whether a specific side chain would impose a certain preference of orientation or stabilize a favorable protein fold are then indeed aspects that could be studied numerically and experimentally giving valuable insights into the general purpose of GPI anchoring.

Conflicts of interest
There are no conflicts to declare.