Shaheen A. 
            Farhadi
          
        
      , 
      
        
          
            Antonietta 
            Restuccia
          
        
      , 
      
        
          
            Anthony 
            Sorrentino
          
        
      , 
      
        
          
            Andrés 
            Cruz-Sánchez
          
        
       and 
      
        
          
            Gregory A. 
            Hudalla
          
        
       *
*
      
J. Crayton Pruitt Family Department of Biomedical Engineering, University of Florida, Biomedical Sciences J293, PO BOX 116131, 1275 Center Drive, Gainesville, FL 32611, USA. E-mail: ghudalla@bme.ufl.edu;   Tel: +1 (352) 273 9326
    
First published on 24th September 2021
In nature, the precise heterogeneous co-assembly of different protein domains gives rise to supramolecular machines that perform complex functions through the co-integrated activity of the individual protein subunits. A synthetic approach capable of mimicking this process would afford access to supramolecular machines with new or improved functional capabilities. Here we show that the distinct peptide strands of a heterotrimeric α-helical coiled-coil (i.e., peptides “A”, “B”, and “C”) can be used as fusion tags for heterogeneous co-assembly of proteins into supramolecular structures with tunable subunit stoichiometry. In particular, we demonstrate that recombinant fusion of A with NanoLuc luciferase (NL-A), B with superfolder green fluorescent protein (sfGFP-B), and C with mRuby (mRuby-C) enables formation of ternary complexes capable of simultaneously emitting blue, green, and red light via sequential bioluminescence and fluorescence resonance energy transfer (BRET/FRET). Fusion of galectin-3 onto the C-terminus of NL-A, sfGFP-B, and mRuby-C endows the ternary complexes with lactose-binding affinity that can be tuned by varying the number of galectin-3 domains integrated into the complex from one to three, while maintaining BRET/FRET function. The modular nature of the fusion protein design, the precise control of domain stoichiometry, and the multiplicity afforded by the three-stranded coiled-coil scaffold provides access to a greater range of subunit combinations than what is possible with heterodimeric coiled-coils used previously. We envision that access to this expanded range of co-integrated protein domain diversity will be advantageous for future development of designer supramolecular machines for therapeutic, diagnostic, and biotechnology applications.
| Design, System, ApplicationNon-covalent co-association of different proteins can produce supramolecular constructs capable of performing complex functions through the concerted activity of the individual components. Fusing proteins to peptides that form specific supramolecular structures, such as an α-helical coiled-coil, is a widely used synthetic approach for non-covalent protein assembly. However, typical systems only enable incorporation of one or two different proteins. Increasing the number of co-integrated protein types, while maintaining control of their proportions, would enable design of constructs with greater diversity of functional complexity. Here, we demonstrate an approach to create constructs with emergent and synergistic functionality via fusion of proteins to each of three distinct peptide strands that form a heterotrimeric α-helical coiled-coil. Constructs that simultaneously emit blue, green, and red light in response to a chemical stimulus were created via co-assembly of a luciferase enzyme and a pair of fluorescent proteins, highlighting the potential to develop modular diagnostics with this approach. These constructs were also endowed with tunable carbohydrate-binding affinity by co-integrating different numbers of galectin-3 domains, demonstrating the potential to develop targeted drug-delivery vehicles with this approach. This modular protein co-assembly approach is expected to provide access to new constructs with sophisticated functional capabilities for medical and biotechnology applications. | 
Inspired by these and other examples, synthetic protein co-assemblies are attractive as both tools to interrogate natural heterogeneous protein assemblies and as the basis for creating entirely new molecular machines. Central to these efforts is the development of co-assembly motifs that provide precise control of the stoichiometry of the co-integrated protein domains. One approach relies on sophisticated computational methods that enable the design of protein partners with complementary association interfaces.8,9 Another approach relies on fusing the protein of interest onto a peptide “handle” that directs its assembly into a prescribed supramolecular architecture.10,11 Handles based on peptides that form β-sheets have been shown to provide control of protein stoichiometry within an entire system of molecules;12 however, β-sheet nanofiber chain length is difficult to control, while co-assembly of β-strands is often heterogeneous and stochastic.13 Thus, although system-level control of protein ratio can be achieved, molecular level precision is lacking. In contrast, α-helical coiled-coils are self-limiting, deterministic structures formed from a discrete number of peptide strands, where this number can be tuned through rational design of peptide–peptide interfaces.14,15
Due to their predictable and reproducible behavior, α-helical coiled-coils find extensive use as scaffolding for supramolecular protein assembly. For example, homodimeric and trimeric coiled-coils are used for multivalent display of antibody fragments.16,17 Likewise, a leucine zipper peptide that forms a homodimeric coiled-coil could stabilize the homodimeric quaternary structure of galectin-1.18 Fusing galectin-1 and galectin-3 onto opposing ends of this dimeric leucine zipper peptide provided a heterotetrameric (i.e., ‘dimer of dimers’) construct with increased immunomodulatory activity when compared to either galectin alone.19,20 Heterodimeric protein constructs can also be formed by inserting charged amino acids along the hydrophilic interface residing between two coiled-coil forming peptides (i.e., hydrophobic residues at a and d positions, whereas charged residues at e and g positions of the canonical abcdefg heptad sequence repeat).21,22 In this design, charge complementarity favors the association of two different coiled-coil strands (i.e., A + B peptide pairing), while electrostatic repulsion prevents self-association (i.e., A + A or B + B peptide pairing). This approach has been used to create a soluble T cell receptor analog,23 CD8 heterodimers,24 HLA-DR1:HLA-DM heterodimers,25 multivalent antibody fragments,26–28 probes for high resolution molecular imaging,29 molecular recognition screens and sensors,30,31 mediators of membrane fusion,32 protein purification tags,33 controlled drug release vehicles,34 probes of multivalent cell adhesion,35 stabilized growth factors,36 growth factor immobilization anchors,37–39 and transcription factor immobilization anchors.40 However, although higher-ordered homomeric assemblies have been reported,41–43 examples of higher-ordered heterogeneous assemblies are limited.44 Further, there is presently little understanding of the effect of the number and type of co-integrated protein domains on the co-assembly process or functional capabilities of the resulting heterogeneous constructs.
Here we tested the concept that constructs with modular and tunable protein domain stoichiometry could be created through the use of a trimeric coiled-coil peptide scaffold in which each strand was unique. To test this, we employed an ABC heterotrimeric coiled-coil developed by Alber and colleagues.45 Although each peptide A, B, or C demonstrated some capacity for self-association when alone, the heterotrimer was the preferred state when all three molecules were present in the system. To test the potential of the A, B, and C peptides as handles for protein co-assembly, we created a library of fusion proteins in which strand A was fused to NanoLuc luciferase (NL), B was fused to superfolder green fluorescent protein (sfGFP), and C was fused to mRuby (Fig. 1A). In some instances, the β-galactoside-binding protein galectin-3 (Gal3) was fused onto the opposing terminus of A, B, or C to generate NL-A-Gal3, sfGFP-B-Gal3, and mRuby-C-Gal3 (Fig. 1B). We chose NL, sfGFP, and mRuby because their co-assembly was expected to yield constructs capable of sequential bioluminescence and fluorescence resonance energy transfer (BRET/FRET),46 which provides both an analytical measure of co-assembly and a demonstration of the potential of this approach to create modular diagnostics with emergent activity arising from the co-integrated activity of the co-assembled protein domains. We chose galectin-3 based on a recent report showing that the carbohydrate-binding affinity of homogeneous synthetic galectin-3 constructs can be tuned by varying the number of galectin-3 domains,43 which provides an additional analytical measure of co-assembly and a demonstration of the potential of this approach to create targeted drug delivery vehicles.42Fig. 1C outlines the assortment of heterotrimeric trios that are possible with this protein library, which were evaluated in this report using native PAGE, fluorimetry and luminometry, and lactose affinity chromatography. These studies demonstrate that distinct peptide strands of a heterotrimeric coiled-coil can be used as handles to create heterogeneous protein co-assemblies with tunable functional domain stoichiometry, thereby enabling opportunities to develop modular multi-protein machines with new functional capabilities.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 300 × g at 4 °C for 10 min) in a Sorvall™ RC 6 Plus Superspeed Centrifuge (ThermoFisher). Pelleted bacteria were resuspended and lysed in B-PER bacterial protein extraction reagent (ThermoFisher) supplemented with a Pierce protease inhibitor tablet (ThermoFisher), 2400 units mL−1 DNAse I (ThermoFisher), and 50 mg mL−1 lysozyme (ThermoFisher) for 20 min at room temperature. Bacterial lysates were centrifuged (42
300 × g at 4 °C for 10 min) in a Sorvall™ RC 6 Plus Superspeed Centrifuge (ThermoFisher). Pelleted bacteria were resuspended and lysed in B-PER bacterial protein extraction reagent (ThermoFisher) supplemented with a Pierce protease inhibitor tablet (ThermoFisher), 2400 units mL−1 DNAse I (ThermoFisher), and 50 mg mL−1 lysozyme (ThermoFisher) for 20 min at room temperature. Bacterial lysates were centrifuged (42![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 600 × g at 4 °C for 15 min) to separate the soluble protein fraction into the supernatant. Supernatant was applied to HisTrap™ FF crude prepacked columns (GE Healthcare) connected to an ÄKTA™ Pure FPLC system (GE Healthcare) where His-tagged proteins were purified via immobilized metal affinity chromatography (0–250 mM imidazole gradient for protein elution). Amicon® Ultra centrifugal filters (MilliporeSigma) with 10 kDa cut-off were used to concentrate purified proteins to 5 mL for further purification and removal of imidazole via size-exclusion chromatography on a HiLoad™ 26/600 Superdex™ 200 column (GE Healthcare) connected to an ÄKTA™ pure FPLC system. Molar concentration of purified proteins was determined by Beer–Lambert law. Absorbance (λ = 280 nm) was measured on a NanoDrop spectrophotometer (ThermoFisher) and extinction coefficient of each protein was calculated based on amino acid content using ExPASy ProtParam tool (available at https://web.expasy.org/protparam/). Extinction coefficients are as follows: 2842.0 M−1 mm−1 for NL-A; 2201.5 M−1 mm−1 for sfGFP-B; 2602.5 M−1 mm−1 for mRuby-C; 6441.5 M−1 mm−1 for NL-A-Gal3; 5788.5 M−1 mm−1 for sfGFP-B-Gal3; 6053.0 M−1 mm−1 for mRuby-C-Gal3.
600 × g at 4 °C for 15 min) to separate the soluble protein fraction into the supernatant. Supernatant was applied to HisTrap™ FF crude prepacked columns (GE Healthcare) connected to an ÄKTA™ Pure FPLC system (GE Healthcare) where His-tagged proteins were purified via immobilized metal affinity chromatography (0–250 mM imidazole gradient for protein elution). Amicon® Ultra centrifugal filters (MilliporeSigma) with 10 kDa cut-off were used to concentrate purified proteins to 5 mL for further purification and removal of imidazole via size-exclusion chromatography on a HiLoad™ 26/600 Superdex™ 200 column (GE Healthcare) connected to an ÄKTA™ pure FPLC system. Molar concentration of purified proteins was determined by Beer–Lambert law. Absorbance (λ = 280 nm) was measured on a NanoDrop spectrophotometer (ThermoFisher) and extinction coefficient of each protein was calculated based on amino acid content using ExPASy ProtParam tool (available at https://web.expasy.org/protparam/). Extinction coefficients are as follows: 2842.0 M−1 mm−1 for NL-A; 2201.5 M−1 mm−1 for sfGFP-B; 2602.5 M−1 mm−1 for mRuby-C; 6441.5 M−1 mm−1 for NL-A-Gal3; 5788.5 M−1 mm−1 for sfGFP-B-Gal3; 6053.0 M−1 mm−1 for mRuby-C-Gal3.
      
      
        To characterize the solution behavior of the fusions and their co-assembly into a heterotrimer, we first measured the hydrodynamic size of each alone and in equimolar combination using size-exclusion chromatography (ESI† S10). When alone, all of the proteins eluted from the SEC column at a volume corresponding to a molecular weight that was significantly greater than the theoretical value. In particular, NL-A eluted with a sharp peak corresponding to an empirical molecular weight (MW) of 99.5 kDa, as well as smaller peaks at higher and lower MW (theoretical MW = 25.1 kDa) (ESI† S10A). sfGFP-B eluted with a sharp peak corresponding to an empirical MW of ∼164 kDa, as well as a broad peak centered at an empirical MW of ∼89 kDa (theoretical MW = 34.5 kDa) (ESI† S10B). mRuby-C eluted with a sharp peak corresponding to an empirical MW of 98 kDa, as well as a smaller peak at a lower MW (theoretical MW = 32.5 kDa) (ESI† S10C). The observed discrepancies in the empirical and theoretical MWs of NL-A, sfGFP-B, and mRuby-C may suggest a tendency for the fusion proteins to self-associate. Indeed, in the paper describing the development and characterization of the A, B, and C peptides, Alber and colleagues reported that the peptides had some tendency to self-assemble when alone, but favored heterotrimer formation when present in combination.45 Here, though, it is also important to note that the estimation of molecular weight from elution volume via SEC assumes that the proteins can be approximated as a globular hard sphere, and have similar hydrodynamic properties as the standards used for calibration. It is reasonable to expect that the linker domain and the A, B, or C peptide domain, which are roughly 50 amino acids long and correspond to 10% or more of the protein molecular weight, are likely to be unstructured. Intrinsically disordered proteins, as well as other non-globular fusions, are known to elute at volumes that correspond to much higher MWs than expected due to their larger Stokes radii.47,48
When combined at an equimolar ratio, the heterotrimeric mixture of NL-A, sfGFP-B, and mRuby-C eluted as a relatively narrow peak at an elution volume that corresponded to an empirical MW of 108 kDa, which was slightly higher than the theoretical value of 92.1 kDa (ESI† S10D). As was noted above for the analysis of NL-A, sfGFP-B, and mRuby-C elution, discrepancy in the empirical and theoretical MW likely reflects the non-globular structural features of the assembly. However, the observation that the proteins eluted at unique volume when combined as compared to their elution when alone suggested that they were in a different physical state (e.g., co-assembled into a heterotrimer) when combined at an equimolar ratio. When compared to SEC traces reported previously for homotrimeric protein complexes,42,43 the empirical molecular weight estimated from elution volume suggested that NL-A, sfGFP-B, and mRuby-C formed a complex with a size comparable to the predicted heterotrimer.
Due to the limitations of SEC, we evaluated the migration of NL-A, sfGFP-B, and mRuby-C alone and in an equimolar mixture using native polyacrylamide gel electrophoresis (“native PAGE”) to assess their co-assembly into supramolecular complexes. In native PAGE, protein samples are loaded into a gel and then subjected to an electric field that will induce their mobility through the gel pores. Under native conditions, the distance that a protein will migrate through the gel depends on its size, shape, net charge, and the properties of the gel used. Charge-to-mass ratio is a useful predictor for migration distance, where molecules (or assemblies) with a greater negative charge-to-mass ratio are expected to migrate further (i.e., farther away from the negative pole). This contrasts with sodium dodecylsulfate PAGE (SDS-PAGE) wherein migration distance correlates directly with molecular weight, irrespective of charge, because the proteins are denatured and charge is equilibrated by way of the anionic detergent. Based on the properties of the fusion proteins, we would expect sfGFP-B to migrate a moderate distance (charge-to-mass = −0.377); NL-A to migrate the longest distance because it is smaller than sfGFP-B and has a slightly more negative charge-to-mass ratio (−0.398); and mRuby-C to migrate the shortest distance because it has the smallest charge-to-mass ratio of all of the proteins (−0.09). Consistent with this, we observed that NL-A migrated a slightly longer distance than sfGFP-B, whereas mRuby-C migrated a shorter distance than either of the other proteins (Fig. 2A). When the NL-A, sfGFP-B, and mRuby-C fusion proteins were combined at an equimolar ratio prior to electrophoresis, only one band was identified on the native PAGE gel following Coomassie staining (Fig. 2A). This band migrated a unique distance when compared to the migration of NL-A, sfGFP-B, or mRuby-C alone, and this intermediate distance was consistent with the relative migration distance expected based on its charge, size, and charge-to-mass ratio (∼92 kDa, −26 net charge, −0.282). We also note that these native PAGE experiments employed gradient gels, which will retard the mobility of a larger-sized protein complex that carries a high net negative charge as it progresses further along the length of the gel toward the positive pole of the electric field where the pore size continues to decrease. Thus, some variability in migration distance as a function of charge-to-mass ratio is expected. Collectively, the presence of a single Coomassie band at a unique migration distance relative to the individual proteins suggested that NL-A, sfGFP-B, and mRuby-C co-assembled into a supramolecular complex. Further, the location of the bands of the individual proteins relative to the band for the mixture suggested that the proteins likely did not undergo significant self-association, and thus the discrepancies in empirical and theoretical MW as predicted by SEC were likely due to improper assumptions of protein hydrodynamic shape.
Under native PAGE conditions, NL-A, sfGFP-B, and mRuby-C are expected to maintain their folded-state functions (i.e., luminescence, green fluorescence, and red fluorescence, respectively). To determine if all three proteins were co-localized at the site of the single Coomassie band identified after electrophoresis of the ternary mixture, we subjected the gel to blue light transillumination to induce green and red fluorescence emission, and then treated it with the NL substrate furimazine to evaluate luminescence. When subjected to blue light transillumination, we observed a yellow band at the same location as the Coomassie band, which is the expected output for overlapping green and red fluorescence emission (Fig. 2B). Likewise, we observed luminescence from the gel treated with furimazine at the same location as the Coomassie band (Fig. 2C). Collectively, these observations indicated that NL-A, sfGFP-B, and mRuby-C were co-localized at the site of the single band observed with Coomassie stain after subjecting the ternary mixture to electrophoresis. However, we note that a weak red fluorescent band was also observed at the same location as mRuby-C when alone, while stronger green and blue bands were observed at the same locations as sfGFP-B and NL-A, respectively. These bands indicate that some fraction of the proteins were unassembled in the ternary mixture. We note that the weakened signal from mRuby-C is likely due to its weak excitation upon blue light transillumination, while the absence of the NL-A, sfGFP-B, and mRuby-C bands in the Coomassie stained gel is likely due to the detection limit of the dye. Observing some fraction of proteins in the unassembled state would be expected here because this is an equilibrium system, where the ratio of assembled to unassembled proteins would be governed by the dissociation constant of the coiled-coil. We note that for a binary system, which is considerably easier to model quantitatively, 99% assembly on a 1 micromolar protein basis would require a dissociation constant in the low nanomolar range. In a ternary system, which is expected to be thermodynamically less favorable than a binary system due to the lower statistical likelihood of A + B + C collision, the dissociation constant required for 99% assembly on a 1 micromolar protein basis would also be on the order of nanomolar or lower. Most coiled-coil complexes reported to date have dissociation constants that are in the nano- to micromolar range. Thus, quantitative or near-quantitative conversion should not be expected in this system. However, because this method is not quantitative as performed, the excess green fluorescence and blue luminescence relative to red fluorescence in the unassembled fraction could be also due to errors arising from practical limitations of measuring protein concentration.
Due to overlap of their emission and excitation spectra, respectively, proximal NL and sfGFP domains can demonstrate bioluminescence energy transfer, or BRET, wherein photons emitted from NL can mediate excitation and, in turn, emission of green light from sfGFP in the absence of discrete sfGFP excitation.49 Likewise, proximal sfGFP and mRuby domains can demonstrate fluorescence resonance energy transfer, or FRET, wherein photons emitted from sfGFP can mediate excitation and, in turn, emission of red light from mRuby.50 If NL, sfGFP, and mRuby are all co-localized, NL excitation can induce emission of green light from sfGFP, which in turn induces emission of red light from mRuby via sequential BRET/FRET.46 Heterodimeric α-helical coiled-coil tags have previously been shown to enable FRET because they place fused fluorescent protein domains sufficiently close to enable efficient photon transfer.33 Here, we used sequential BRET/FRET to evaluate the co-assembly of NL-A, sfGFP-B, and mRuby-C into a heterotrimer (Fig. 3A). When furimazine was added to a solution containing an equimolar ratio of NL-A, sfGFP-B, and mRuby-C, a peak associated with NL emission was observed along with two additional emission peaks at λ = 515 nm and λ = 585 nm (Fig. 3B, solid and dashed black traces), which correspond to the emission maxima of sfGFP and mRuby, respectively. In contrast, when furimazine was added to a control solution containing an equimolar ratio of NL lacking A (“WT-NL”), sfGFP lacking B (“WT-sfGFP”), and mRuby-C only blue light emission was detected (Fig. 3B, blue trace). Comparison of the raw emission spectra for the heterotrimer and control groups indicated that the NL signal intensity in the heterotrimer group was diminished by ∼20% relative to the control group (ESI† S3). Some decrease in NL “donor” emission should be expected in the heterotrimer group when compared to the control because photons from NL would be absorbed by the acceptor sfGFP during BRET. Although not performed here, donor decay can be used to measure resonance energy transfer efficiency and provide some sense of intermolecular distance.
As an additional control for FRET within the homotrimer, we also excited the ternary mixture of NL-A, sfGFP-B, and mRuby-C with light at 485 nm (i.e., sfGFP excitation maximum) and 558 nm (i.e., mRuby excitation maximum). When the heterotrimer mixture was excited with 558 nm light, we observed a strong peak corresponding to the emission of mRuby, without any significant emission at the wavelengths corresponding to sfGFP or NL emission (Fig. 3C, red trace). When the heterotrimer mixture was excited with 485 nm light, we observed a strong peak corresponding to the emission of sfGFP-B as well as a small shoulder in the wavelength region that overlaps with mRuby emission (Fig. 3C, green trace), suggesting weak FRET between sfGFP-B and mRuby-C in this sample. When a control mixture was excited with 485 nm light, we observed a strong peak corresponding to the emission of sfGFP but the shoulder overlapping with mRuby emission was absent (ESI† S2), indicating that the weak FRET observed in the heterotrimer sample was enabled by fusion of sfGFP and mRuby to the B and C peptides, respectively. We note that the weakened FRET induced when the ternary mixture of NL-A + sfGFP-B + mRuby-C was excited with 485 nm light (Fig. 3C) as compared to the stronger FRET observed when furimazine was added to the mixture (Fig. 3B) could be due to direct BRET between NL-A and mRuby-C. Red-shifted BRET has been reported before,51 and the NL-A control spectrum (Fig. 3B, blue trace) indicates that photons are emitted with a wavelength that falls within the excitation spectrum of mRuby (558 nm). Collectively, when taken together with the native PAGE data in Fig. 2, these observations demonstrated that A, B, and C mediate co-assembly of NL, sfGFP, and mRuby into a ternary construct with emergent function (i.e., sequential BRET/FRET) attributed to the co-integrated activity of each protein domain.
An inherent benefit of coiled-coil scaffolds is that the N- and C-termini could, in principle, each be fused to a different protein domain to create assemblies with an even greater range of functional capabilities. For example, homomeric coiled-coils have been used as scaffolding to create fluorescent probes to study the emergent signaling function of galectin-3 or constructs that combine the signaling activity of galectin-1 and galectin-3.19,20,42,43 Here, we tested whether an additional protein domain could be co-integrated into assemblies of NL-A, sfGFP-B, and mRuby-C by replacing NL-A with a new fusion protein wherein galectin-3 was fused onto the C-terminus of the A domain (“NL-A-Gal3”) (Fig. 4A). We first subjected each protein and an equimolar ternary mixture to native PAGE to evaluate protein co-assembly qualitatively (Fig. 4B). A single band was observed in each lane corresponding to the individual proteins following Coomassie staining, whereas two bands were observed in the lane corresponding to the ternary mixture. Subjecting the gel to blue light transillumination and furimazine demonstrated that the lower band in the ternary mixture lane emitted yellow fluorescence (indicative of sfGFP and mRuby co-localization), as well as blue luminescence (indicative of NL co-localization). In contrast, the upper band emitted red fluorescence with a consistent hue and at the same location as the band for mRuby-C alone. We note that some unassembled sfGFP-B (green fluorescence) and NL-A-Gal3 (blue luminescence) were also detected in the lane corresponding to the ternary mixture via transillumination and furimazine, although not with Coomassie. These results demonstrate that NL-A-Gal3, sfGFP-B, and mRuby-C can co-assemble, although the assembly efficiency does not approach 100% and the integration of mRuby-C may be hindered by the presence of the Gal3 domain.
Physical characterization by size-exclusion chromatography and dynamic light scattering suggested that the heterotrimeric assembly of NL-A-Gal3, sfGFP-B, and mRuby-C has a molecular weight and hydrodynamic size in a range comparable to that of previously reported homotrimeric assemblies fused to Gal3 (ESI† S4).42,43 The lower molecular weight shoulder in the SEC trace may be due to unassembled NL-A-Gal3, sfGFP-B, and mRuby-C. The higher molecular weight shoulders in the SEC trace may reflect differences in the hydrodynamic shape of the heteroassemblies, and not the formation of larger (e.g., non-specific aggregates) or smaller (e.g., dimer) constructs. This is supported by the DLS measurements, which did not identify larger aggregates, which would be expected to be detected with greater sensitivity because they will diffract more light, as well as filtration studies which showed that the protein concentration did not change considerably before and after being passed through a 0.2 micron filter (ESI† S4B). Caution should also be taken with the assumption that NL-A-Gal3 + sfGFP-B + mRuby-C can be approximated as a globular hard sphere, given that each reporter protein is linked to the 35 amino acid coiled-coil strand by a flexible 16 amino acid linker, while galectin-3 has an ∼130 amino acid globular C-terminal domain linked to an ∼110 amino acid long, intrinsically disordered N-terminal domain.52 Under the present conditions, we do not know and cannot adequately predict if the unstructured domain of galectin-3 is in a compact or extended conformation in these assemblies. As noted above for the SEC analysis of NL-A, sfGFP-B, and mRuby-C, an extended conformation of the galectin-3 domain would be expected to decrease the elution volume of the construct from the SEC column, which would result in observing a higher-than-expected molecular weight for the assembly.
Galectin-3 can bind to lactose immobilized on agarose chromatography beads and be eluted via a soluble lactose gradient.42,43 In contrast, neither sfGFP nor mRuby demonstrate lactose binding affinity. Thus, the co-assembly of sfGFP-B and mRuby-C with NL-A-Gal3 would be expected to endow sfGFP-B and mRuby-C with lactose binding affinity. For the ternary mixture of NL-A-Gal3, sfGFP-B, and mRuby-C, immobilized lactose affinity chromatography identified two elution peaks measured via UV absorbance (λ = 280 nm), one in the non-binding “void” fraction and the other at a soluble lactose concentration that was similar to the concentration of lactose required to elute wild-type Gal3 (red dashed line) (Fig. 4C compared to ESI† S5). The correlation of the ternary mixture bound fraction elution with that of wild-type Gal3 suggested that assembly did not significantly alter Gal3 carbohydrate-binding affinity, consistent with other previously reported Gal3 fusions.42,43 Calculation of the area under the curve indicated that ∼66% of the protein was in the bound fraction. In contrast, more than 98% of wild-type galectin-3 was in the bound fraction. Consistent with the native PAGE analyses, this suggested that some of the proteins were in the unassembled state in the ternary mixture, as would be expected for an equilibrium system. It is assumed that the proteins eluting in the void fraction were primarily sfGFP-B and mRuby-C, which lack carbohydrate-binding affinity.
To confirm that the observed lactose-dependent elution of the mixture of NL-A-Gal3, sfGFP-B, and mRuby-C was due to specific Gal3:lactose interactions, we also characterized the elution of the control mixture of NL-A, sfGFP-B, and mRuby-C. 100% of the protein was identified in the void fraction (Fig. 4D), confirming that the observed bound fraction in the elution profile of the ternary mixture of NL-A-Gal3 + sfGFP-B + mRuby-C was due to the presence of the NL-A-Gal3 fusion protein.
Although lactose affinity chromatography demonstrated that the majority of the protein in the ternary NL-A-Gal3 + sfGFP-B + mRuby-C mixture eluted in the bound fraction, A280 measurements cannot reliably determine if NL, sfGFP, and mRuby are present in this fraction. To determine if NL-A-Gal3, sfGFP-B, and mRuby-C were co-assembled in the bound fraction, we subjected the eluent to furimazine and measured BRET/FRET. We observed a strong peak corresponding to NL luminescence emission, as well as shoulders indicative of sfGFP emission (via BRET) and mRuby emission (via FRET) (Fig. 4E). This result demonstrates that NL-A-Gal3, sfGFP-B, and mRuby-C co-assembled into a heterogeneous complex that endowed sfGFP-B and mRuby-C with lactose-binding affinity while also maintaining BRET/FRET capability between the reporter domains. When taken in conjunction with the native PAGE (Fig. 4B) and SEC (ESI† S4) for this mixture (Fig. 4B), these analyses suggest that NL-A-Gal3, sfGFP-B, and mRuby-C co-assembled into a heterotrimeric construct demonstrating the functional properties of each co-integrated protein domain.
Comparison of the spectra in Fig. 4E to that in Fig. 3B indicated that the relative sfGFP emission in the NL-A-Gal3 + sfGFP-B + mRuby-C sample was lower than that in the NL-A + sfGFP-B + mRuby-C sample. This could be due to a variety of factors, including a change in the intermolecular distance between NL and sfGFP in the fusion construct with the Gal3 domain which would be expected to decrease BRET efficiency. Additionally, a decrease in BRET could be due to a loss of NL in the NL-A-Gal3 + sfGFP-B + mRuby-C bound fraction eluent. Recall that the NL-A + sfGFP-B + mRuby-C mixture was not subjected to chromatography before BRET/FRET measurements, whereas nearly a third of the protein was eluted in the void fraction of the NL-A-Gal3 + sfGFP-B + mRuby-C mixture. The protein content of the void fraction was not analyzed, and although one would expect sfGFP-B and mRuby-C that lack lactose-binding affinity to dominate this fraction, it would not be unreasonable to assume some NL-A-Gal3 was also present therein. Finally, we would expect that NL-A-Gal3 and the heterotrimer of NL-A-Gal3 + sfGFP-B + mRuby-C would have similar affinity for immobilized lactose and, as such, a similar elution profile in response to the soluble lactose gradient. In contrast, sfGFP-B and mRuby-C would be eliminated in the void fraction. Thus, the bound fraction eluent of NL-A-Gal3 + sfGFP-B + mRuby-C would be expected to have excess NL-A-Gal3 that is in the unassembled state and would not contribute to BRET/FRET. Consistent with this, native PAGE demonstrated that some unassembled NL-A-Gal3 was present in the ternary mixture.
Informed by these data, we next tested whether the Gal3 domain could be fused to either the B or C tag by replacing sfGFP-B with sfGFP-B-Gal3 or mRuby-C with mRuby-C-Gal3 (Fig. 5A). Three bands were observed in the native PAGE lane corresponding to the mixture of NL-A + sfGFP-B-Gal3 + mRuby-C following Coomassie staining, whereas only one band was observed in the lane corresponding to the mixture of NL-A + sfGFP-B + mRuby-C-Gal3 (ESI† S6, top row). The location of two bands in the mixture of NL-A + sfGFP-B-Gal3 + mRuby-C were at the same location as the bands for sfGFP-B-Gal3 and mRuby-C alone, albeit at lower staining intensity, suggesting that some fraction of these proteins was in the unassembled state in the ternary mixture. We note that the staining intensity of sfGFP-B-Gal3 and mRuby-C alone was significantly higher than that of NL-A, suggesting that the former may have been in molar excess of NL-A in this experiment. The third band was in a unique location suggesting formation of a heteroassembly in this mixture. Likewise, the single band in the lane for the mixture of NL-A + sfGFP-B + mRuby-C-Gal3 was in a unique location relative to the bands for the proteins alone suggesting that this trio also formed a heteroassembly.
To evaluate the co-localization of the NL, sfGFP, and mRuby fusion proteins, we subjected the native PAGE gel to blue light transillumination and furimazine (ESI† S6, middle and bottom row). Yellow fluorescence and blue luminescence were co-localized at the site of the unique Coomassie bands in the lane for each ternary mixture. Consistent with the Coomassie staining, sfGFP-B-Gal3 and mRuby-C were detected alone in the mixture of NL-A + sfGFP-B-Gal3 + mRuby-C, whereas no NL-A was detected. Notably, this suggested that sfGFP-B-Gal3 and mRuby-C did not form a dimer in the absence of A. Both unassembled sfGFP-B and NL-A were detected in the lane for the mixture of NL-A + sfGFP-B + mRuby-C-Gal3. Unexpectedly, we also observed an intermediate band in the lane corresponding to the ternary mixture of NL-A + sfGFP-B + mRuby-C-Gal3 that emitted yellow fluorescence (indicative of red and green fluorescence co-emission) and blue luminescence (indicative of NL co-localization), which was not detected with Coomassie staining. This could be indicative of a different physical state of the heterotrimer, for example, if the Gal3 domain was in a compact state versus an extended state during electrophoresis, where the former would correlate to a larger hydrodynamic size and, in turn, shorter migration distance.
To further evaluate the co-assembly of these fusion proteins into a heterotrimer, we subjected the ternary mixtures of NL-A + sfGFP-B-Gal3 + mRuby-C and NL-A + sfGFP-B + mRuby-C-Gal3 to lactose affinity chromatography. Protein eluted in both the unbound void fraction and as a bound fraction released by a soluble lactose competitor (Fig. 5B). Both heterotrimers eluted from the lactose chromatography column at a similar soluble lactose concentration as wild-type Gal3 and the NL-A-Gal3 + sfGFP-B + mRuby-C heterotrimer, suggesting that the carbohydrate-binding affinity of Gal3 was not affected by fusion to either the B or C peptide. Quantification of the area under the curve indicated that ∼75% of the protein was in the bound fraction in the NL-A + sfGFP-B-Gal3 + mRuby-C mixture, whereas only ∼61% of the protein was in the bound fraction for the NL-A + sfGFP-B + mRuby-C-Gal3 mixture. When compared alongside the analysis of the NL-A-Gal3 + sfGFP-B + mRuby-C mixture, this suggested that the extent of heterotrimer assembly was highest when Gal3 was fused to sfGFP-B and lowest when Gal3 was fused to mRuby-C.
Finally, we measured BRET/FRET in the bound fraction eluent collected from the ternary mixtures of NL-A + sfGFP-B-Gal3 + mRuby-C and NL-A + sfGFP-B + mRuby-C-Gal3 to determine if the three proteins co-eluted. A strong NL emission peak, as well as weaker sfGFP and mRuby emission peaks, were observed when furimazine was added to the bound eluent fraction collected from both ternary mixtures (Fig. 5C). However, the mixture of NL-A + sfGFP-B-Gal3 + mRuby-C (Fig. 5C, top) yielded more pronounced green fluorescence emission than the mixtures of NL-A-Gal3 + sfGFP-B + mRuby-C (Fig. 4E) and NL-A + sfGFP-B + mRuby-C-Gal3 (Fig. 5C, bottom), which generally correlated with the trend for the percentage of protein in the bound fraction of each mixture. We also note that in contrast to the mixture analyzed in Fig. 4E, any unassembled NL-A was likely to be eliminated in the void fraction of the NL-A + sfGFP-B-Gal3 + mRuby-C and NL-A + sfGFP-B + mRuby-C-Gal3 mixtures. Thus, the BRET/FRET analysis in Fig. 5C is only assessing the activity of proteins in the assembled state. Taken together, the results presented in Fig. 4 and 5 demonstrate that an additional functional domain can be appended onto the C-terminus of the coiled-coil scaffold, increasing the total types of functionalities in the assembly from 3 to 4. However, the co-assembly of A, B, and C may be affected by attaching protein cargoes onto both the N- and C-termini and, although not demonstrated here, the extent of this effect is likely to depend on the physical properties of the protein.
To further extend the functional capabilities of this platform, we asked whether the lactose-binding affinity conferred upon NL-A, sfGFP-B, and mRuby-C via their heterotrimeric co-assembly could be tuned by varying the number of Gal3 domains, including dimeric and trimeric Gal3 (Fig. 6A). We recently demonstrated that the lactose-binding affinity of synthetic Gal3 homo-oligomers could be tuned by varying the number of domains from one to six.43 To test the tunability of heterotrimer carbohydrate-binding affinity, we evaluated different mixtures of NL-A, sfGFP-B, and mRuby-C with and without the Gal3 domain. Native PAGE suggested that any combination of A, B, and C fusion proteins with or without Gal3 domains could co-assemble, as indicated by unique Coomassie and co-localized luminescent/fluorescent bands in the mixtures when compared to the proteins alone (Fig. 6B and ESI† S7). As with the other ternary mixtures studied above, excess NL and sfGFP fusions were detected in the lanes corresponding to each ternary mixture in the native PAGE gels, suggesting that co-assembly does not approach completion under any condition tested.
We used lactose-affinity chromatography to further characterize the co-assembly of the mixtures of fusion proteins with different numbers of Gal3 domains. Here, we expected that the relative binding affinity, which correlates with the soluble lactose competitor concentration, would increase when two proteins with Gal3 domains were co-assembled together. When a mixture of NL-A + sfGFP-B-Gal3 + mRuby-C-Gal3 was passed through a lactose-agarose column, the elution profile shifted to the right relative to the elution for wild-type Gal3 (Fig. 6C, left), as well as to the heterotrimers with one Gal3 domain studied above (Fig. 4c and 5b). The rightward shift of this elution profile was similar to the elution profile for an sfGFP-Gal3 homodimer reported previously,43 suggesting that two Gal3 domains were co-integrated into the construct that eluted in the bound fraction from the ternary mixture studied here. Quantification of the area under the curve demonstrated that ∼71% of the protein was in the bound fraction, indicating that the presence of the second Gal3 domain did not adversely affect A + B + C co-assembly when compared to the relative efficiency of assembly of constructs with one Gal3 domain.
Likewise, when a mixture of NL-A-Gal3 + sfGFP-B-Gal3 + mRuby-C-Gal3 was passed through a lactose-agarose column, the elution profile shifted even further to the right, suggesting a further increase in apparent lactose-binding affinity (Fig. 6C, right). The rightward shift of this elution profile was similar to the elution profile for a sfGFP-Gal3 homotrimer reported previously,43 suggesting that three Gal3 domains were co-integrated into the eluted construct. Quantification of the area under the curve demonstrated that ∼84% of the protein was in the bound fraction, indicating that the presence of the third Gal3 domain did not adversely affect A + B + C co-assembly when compared to the relative efficiency of assembly of constructs with one or two Gal3 domains. Notably, this was the highest percentage of protein recovered from any bound fraction; the low level of detectable protein at the concentration of lactose required to release wild-type Gal3 suggested that the majority of proteins in this ternary mixture were either co-assembled and bound immobilized lactose, or were eluted in the void fraction. We did not characterize the state of the proteins in the void fraction, but suggest that their inability to bind lactose could either be due to the large size of the construct preventing efficient access to the immobilized lactose, or that the neighboring domains imposed a steric impediment that prevented Gal3 interaction with the immobilized lactose.
Finally, we measured BRET/FRET to determine if all three proteins were present and spatially co-localized in the bound fraction eluent. Recall that in control samples containing NL, sfGFP, and mRuby, which cannot co-assemble, no BRET/FRET was observed because the domains were not in close enough proximity (Fig. 3B). Thus, any BRET/FRET observed here would be expected to be due to co-assembly of NL-A-Gal3, sfGFP-B-Gal3, and mRuby-C-Gal3. A strong NL emission peak, as well as weaker sfGFP and mRuby emission peaks, were observed when furimazine was added to the bound fraction eluents collected from the Gal3x2 and Gal3x3 mixtures (Fig. 6D). This indicated that the three different proteins were co-assembled in a manner that placed the NL, sfGFP, and mRuby domains sufficiently close for BRET and FRET. However, the efficiency of BRET was lower in these heterotrimers when compared to heterotrimers formed with either zero or one Gal3 domain. This could be due to an increase in the intermolecular distance between NL and sfGFP in assemblies with an increasing number of Gal3 domains, or other physical features that we have not yet considered.
Nonetheless, there remains room for improvement with this approach. In particular, the results presented here demonstrate that protein co-assembly is not quantitative or near-quantitative under the employed conditions. Rather, some unassembled protein was detected in each ternary mixture studied, where the unassembled protein concentration was likely determined by the dissociation (or association) constant of the coiled-coil. Future efforts to design a peptide trio demonstrating higher heterogeneous co-assembly affinity and minimal off-pathway homogeneous self-association could lead to more stable constructs with greater extent of formation. Further, the results presented here suggest that appending proteins onto the termini of the A, B, or C peptides may affect their co-association, although the dissociation constants of the heterotrimers were not measured on a case-by-case basis here. Future efforts to optimize the peptide affinity and/or the construct design (e.g., linker length) could yield improvements in co-assembly irrespective of the appended protein cargo. Finally, BRET/FRET efficiency depends on intermolecular distance. Here, BRET/FRET was used as an analytical reporter and no effort was made to optimize these events. However, future efforts to optimize the construct design (e.g., linker length and protein domain orientation) could enable opportunities to create constructs demonstrating highly efficient coupled or synergistic chemical reactions.
| Footnote | 
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d1me00083g | 
| This journal is © The Royal Society of Chemistry 2022 |