Open Access Article
Ruidi
Wang
ab,
Jiayu
Li
b,
Xiumei
Li
a,
Jin
Guo
b,
Junqiu
Liu
a and
Hongbin
Li
*b
aState Key Laboratory of Supramolecular Structure and Materials, College of Chemistry, Jilin University, Changchun 130012, P. R. China
bDepartment of Chemistry, The University of British Columbia, Vancouver, BC V6T 1Z1, Canada. E-mail: hongbin@chem.ubc.ca
First published on 20th August 2019
The giant muscle protein titin is the largest protein in cells and responsible for the passive elasticity of muscles. Titin, made of hundreds of individually folded globular domains, is a protein polymer with folded globular domains as its macromonomers. Due to titin's ultrahigh molecular weight, it has been challenging to engineer high molecular weight artificial protein polymers that mimic titin. Taking advantage of protein fragment reconstitution (PFR) of a small protein GB1, which can be reconstituted from its two split fragments GN and GC, here we report the development of an efficient, PFR-based supramolecular polymerization strategy to engineer protein polymers with ultrahigh molecular weight. We found that the engineered bifunctional protein macromonomers (GC-macromonomer-GN) can undergo supramolecular polymerization, in a way similar to condensation polymerization, via the reconstitution of GN and GC to produce protein polymers with ultrahigh molecular weight (with an average molecular weight of 0.5 MDa). Such high molecular weight linear protein polymers closely mimic titin and provide protein polymer building blocks for the construction of biomaterials with improved physical and mechanical properties.
The giant muscle protein titin is a representative of such tandem modular protein-based polymers. Titin has an ultrahigh molecular weight (MW): it is the largest single polypeptide chain in the cell and consists of more than 34
000 amino acid residues with a MW of more than 3.5 MDa.1,10,11 Titin is composed of more than two hundred individually folded immunoglobulin-like (Ig) and fibronectin-type III domains. These domains are each about 90 residues long and serve as the macromonomer for titin.11,12 The I-band part of titin is extensible and is largely responsible for the passive elastic properties of muscles.7,13 Truncation of titin, which leads to a much reduced MW, leads to the dysfunction of muscles, including cardiomyopathy.14
Due to its unique nanomechanical properties and its role in muscle mechanics, titin has attracted considerable interest as a potential building block for constructing protein-based biomaterials.15–17 However, it has been challenging to engineer titin-mimetic protein polymers with ultrahigh molecular weight. Even for the smallest titin isoform (the N2B cardiac titin), its I-band has a MW of ∼0.5 MDa with a contour length of ∼150 nm in its native state, due to the folded globular nature of the macromonomer.11 From the polymer length point of view, the I-band of human N2B cardiac titin is equivalent to a polystyrene of a MW of ∼110 kDa.
Various methods have been developed to engineer protein polymers, such as gene concatemerization and chemical conjugation (including disulfide bond formation and thiol–maleimide coupling).18–23 However, the resultant protein polymers have low degrees of polymerization. For example, the widely used polyprotein (I27)8 has a MW of 80 kDa and a contour length of ∼30 nm,15 which is equivalent to a polystyrene of a MW of only ∼21 kDa (a degree of polymerization of 200). Thus, such protein polymers in their folded states can only be considered as oligomers. The advantages/features associated with ultrahigh MW protein polymer titin thus cannot be fully realized in such oligomeric protein polymers. Here we report a protein fragment reconstitution-based supramolecular polymerization approach to synthesize artificial titin-mimetic protein polymers with ultrahigh molecular weight.
It is known that some proteins can be split into two fragments, which can then recognize each other via supramolecular interactions and reconstitute the folded conformation of the native protein spontaneously, a process called protein fragment reconstitution (PFR).24,25 The small protein GB1, the B1 IgG binding domain of protein G from Streptococcus (GB1), is one of such proteins.26 GB1 consists of 56 residues and assumes a β-grasp fold where an α-helix packs against a four stranded-β-sheet. When split into two fragments from the loop connecting the α-helix and β-strand 3, the two fragments A (1–40 aa) and B (41–56 aa) can reconstitute at a 1
:
1 ratio into the native GB1 fold, albeit with a nick in the loop.26–30 This non-covalent “GB1” fold has a dissociation constant Kd of ∼9 × 10−6 M.26 In our previous work, we engineered a loop elongation variant of GB1, termed GL5CC, where 5 residues were inserted into the unstructured loop and residues 42/44 were mutated to cysteines.31,32 When split into fragments GN (residue 1–42) and GC (residues 43–61), GN and GC can reconstitute into GB1's native fold (Fig. 1A). Under oxidizing conditions, the two cysteine residues can form a disulfide bond in the reconstituted GL5CC, converting the reconstituted noncovalent GL5CC to a covalently connected polypeptide chain.30 These unique properties make the PFR of GL5CC an attractive supramolecular driving force, which has been used to construct supramolecular hydrogels.30 Going a step further to fully explore the utility of PFR as a general supramolecular driving force, here we demonstrate the use of the PFR for engineering protein polymers with ultrahigh molecular weight via supramolecular polymerization. Our study not only enables the engineering of titin-mimetic protein polymers, but may also open up a new avenue toward addressing the long-standing challenge in engineering protein polymers with ultrahigh molecular weight for technological applications, such as spider silk production.
After transforming plasmids containing the desired gene into Escherichia coli strain DH5α, protein overexpression was carried out. The overnight starter was inoculated into the LB liquid medium with 100 μg mL−1 ampicillin and incubated until OD600 is about 0.6–0.8 at 37 °C. Then, 1 mM isopropyl-1-β-D-thiogalactoside was added to the culture to induce protein overexpression. Protein overexpression continued for about 4 hours. Then the cells were harvested by centrifugation at 5000 rpm, 4 °C for 10 min. The cells were lysed with 100 mg mL−1 lysozyme. Proteins were purified as soluble monomers from the supernatant using Co2+ affinity chromatography. Excess salts in the eluted proteins samples were removed by dialysis against water for 24 hours and the dialyzed protein samples were then lyophilized. Amino acid sequences of all the constructs used in this study are detailed in Table S1 in the ESI.† It is important to note that a temperature of 37 °C and the reducing cytoplasm of E. coli are essential in preventing the polymerization of GC-I27-GN in E. coli and ensure that GC-I27-GN is expressed as a soluble monomer, as the melting temperature of the reconstituted, reduced GL5CC is 23 °C.
:
1 ratio. For unfolding measurements, equal amounts of I27-GN and GC-I27 were mixed and reduced by 3 mM DTT first, then 4 M GdmCl solution was added to the I27-(GN–GC)-I27 complex in order to denature the reconstitution domain. The same processes were carried out for individual I27-GN and GC-I27 as negative controls. Protein concentrations in all tests were adjusted to 20 μM. The association kinetics curve was fitted to the second-order rate law. The dissociation kinetics curves were fitted to the first-order rate law.
The SEC calibration was achieved by using the following protein standards: (1) amylase: 200 kDa; (2) alcohol dehydrogenase: 150 kDa; (3) bovine serum albumin: 66 kDa; (4) carbonic anhydrase: 29 kDa and (5) cytochrome c: 12.4 kDa.
The PFR of GN and GC is directional, thus placing them in proper orientation relative to the macromonomer is essential for the construction of protein polymers with a high degree of polymerization. Fusing GN to the N-terminus of the protein macromonomer and GC to the C-terminus, the PFR of GL5CC will place the protein macromonomers right next to each other, resulting in a large steric hindrance that will likely prevent further reactions (Fig. S1†). In contrast, fusing GC to the N-terminus and GN to the C-terminus of the protein macromonomer will result in no steric hindrance, thus facilitating further reactions (Fig. 1).
To test if such an arrangement can lead to efficient PFR, we constructed mono-functional fusion proteins I27-GN and GC-I27. Since PFR relies on noncovalent supramolecular interactions, we used native polyacrylamide gel electrophoresis (PAGE), size-exclusion fast protein liquid chromatography (SE-FPLC) and stopped-flow fluorimetry to confirm the PFR of the I27-GN and GC-I27 under native and reducing conditions, where the disulfide bond formation is prevented. As shown in Fig. 2A, upon mixing I27-GN with GC-I27 at a 1
:
1 molar ratio, the original bands for I27-GN (lane 1) and GC-I27 (lane 2) disappeared. Instead, a new band emerged (lane 3), indicating that GN-I27 and I27-GC can recognize each other under reducing conditions and undergo effective protein fragment reconstitution. It is well known that protein mobility in native PAGE is determined by a combination of factors, including protein conformation, surface charge and MW; native PAGE is less suitable to determine the MW.34 Thus, the relative position of the three bands cannot be used to directly compare their MW.
To further confirm the native PAGE results, we carried out SE-FPLC experiments (Fig. 2B). The 1
:
1 mixture of I27-GN and GC-I27 eluted at the shortest time, followed by I27-GN and GC-I27. This result clearly indicated the successful reconstitution of I27-GN and GC-I27 at a 1
:
1 ratio, and is consistent with the expected MW of I27-GN, GC-I27 and their 1
:
1 reconstituted supramolecular complex. The stopped-flow spectrofluorimetry results also confirmed the successful and rapid reconstitution of I27-GN and GC-I27 under reducing conditions (Fig. S2†).
To further stabilize the reconstituted noncovalent GN–GC complex, we used air oxygen to oxidize the two engineered cysteines in GN and GC to a disulfide bond in the reconstituted GN–GC complex30,31 (Fig. 1A), thus converting a non-covalently associated GN–GC complex into a covalently linked folded protein domain. Fig. 2C shows the non-reducing sodium dodecyl sulfate (SDS)-PAGE results. After boiling and SDS denaturation, the oxidized reconstituted complex I27-(GN–GC)-I27 appeared as a single band with a MW of ∼28 kDa, the sum of the two interacting protein fragments (I27-GN: 15 kDa and GC-I27: 13 kDa), corroborating that GN–GC indeed formed a covalently linked reconstituted protein domain after oxidation.
Indeed, we found that the macromonomers spontaneously started to polymerize in aqueous solution right after the protein purification step. Fig. S3† shows the native PAGE image of the resultant polymeric assemblies under reducing conditions, which correspond to the non-covalent supramolecular protein polymers formed by the PFR. Gradual oxidation of the reconstituted GL5CC complex by air oxygen resulted in the formation of high molecular weight, covalently linked protein polymers. It is evident from SDS-PAGE that most macromonomer GC-I27-GN reacted with each other and was converted into protein polymers with a high degree of polymerization (Fig. 3A).
Fig. 3B shows the time course of the polymerization by PFR. It is evident that as the reaction proceeded, the amount of protein polymers with higher MW increased while the fraction of the macromonomer and lower MW species (dimer and trimer) decreased with time and even got completely consumed. This is characteristic of the classical condensation polymerization and confirms that the supramolecular polymerization via PFR indeed follows a mechanism similar to that of condensation polymerization. It is clear that after one day of reaction, a high degree of polymerization was achieved, and only a trace amount of dimers was still present. A ladder of protein polymers with different degrees of polymerization was clearly visible. Protein polymers with a degree of polymerization higher than 10 were not well-resolved in the SDS-PAGE. To characterize the MW of the protein polymers, we carried out SE-FPLC (Fig. 3C and D). Fig. 3D shows the molecular weight distribution of the resultant protein polymers. The number average molecular weight (Mn) and weight average molecular weight (MW) of the protein polymers are ∼505 kDa and 542 kDa, respectively, giving rise to a MW polydispersity index (PDI) of 1.09. Dynamic light scattering measurements revealed a diffusion averaged MW of 458 kDa, further corroborating the high molecular weight of the engineered protein polymers (Fig. S4, ESI†). Such a high molecular weight of the protein polymers constructed from the PFR is much larger than that of any other protein polymer reported so far constructed by either gene concatenation18,33 or chemical conjugation,19,20 representing an efficient and successful attempt to engineer a protein polymer with high molecular weight that is close to that of the I-band cardiac N2B titin.
To further prove that the polymerization is through the head-to-tail supramolecular condensation reaction, we included monofunctional species in the reaction mixture. The introduction of monofunctional reactants into a condensation polymerization reaction will terminate the growing polymer chain and significantly reduce the MW of the resultant polymers. Indeed, in the PRF-based supramolecular condensation reaction, the inclusion of 17% GC-I27 in the reaction mixture (the ratio of GC-I27-GN
:
GC-I27 is 5
:
1) significantly reduced the amount of the high molecular weight products (Fig. S5†). Further increasing the concentration of GC-I27 eliminated the high molecular weight products completely. These results confirmed that the titin-mimetic protein polymers are indeed obtained via the PRF-based supramolecular condensation polymerization reaction.
Since the protein polymers are obtained via the PRF-based supramolecular condensation polymerization reaction, protein monomer concentration may play important roles in determining MW and MW polydispersity. Our preliminary results showed that a lower concentration of the GC-I27-GN monomer led to a lower degree of polymerization. For example, the degree of polymerization of GC-I27-GN at 5 μM is quite low (Fig. S6A†). When the monomer concentration is high enough, further increasing monomer concentration appeared to increase the polydispersity of the molecular weight (Fig. S6B and C, Table S2†). However, to gain a better understanding of this supramolecular polymerization reaction, a detailed study of the effect of protein concentration on polymer length and distribution is needed.
The ultrahigh molecular weight titin-mimetic protein polymers we engineered here should allow for the engineering of protein hydrogels at much reduced protein concentration and with improved mechanical properties. To demonstrate this, we engineered protein-based hydrogels using the Ru(II)(bpy)32+-mediated photocrosslinking strategy. We found that GC-I27-GN-based polymer solution can be readily crosslinked into a solid hydrogel at a protein concentration as low as 1% (Fig. 5), and 1% protein hydrogels showed a storage modulus of 0.6 kPa. Higher protein concentrations lead to a higher storage modulus of the protein hydrogels (Fig. 5C). Compared with (GB1)8-based hydrogels (with the lowest gelation point of 5% and storage modulus of 4 kPa), the reduction in gelation concentration and improved mechanical properties of GC-I27-GN-based protein polymer hydrogels can be readily rationalized by the increased molecular weight of the protein polymers.
Another unique feature of this protein hydrogel lies in its responsiveness to the redox potential. Under oxidizing conditions, the GC-macromonomer-GN-based protein polymer is a covalently linked single polypeptide chain. However, in the reduced state, it is a supramolecular protein polymer assembled from the non-covalently associated GC–GN complex via PFR when the temperature is lower than the melting temperature Tm of the reconstituted reduced GL5CC (which is ∼23 °C).30 When the temperature is higher than Tm, the reconstituted GC–GN complex will dissociate into GN and GC, leading to the depolymerization of the protein polymer (GC-I27-GN)n. This property should entail the protein polymer and protein hydrogels with temperature and redox potential responsiveness. As shown in native PAGE (Fig. S7†), upon incubation in PBS buffer containing 5 mM DTT at 37 °C for 30 minutes, the protein polymers depolymerized to the monomer state. The protein polymer hydrogel showed similar behaviors. As shown in Fig. S8,† when incubated in PBS buffer containing 30 mM DTT overnight at room temperature (∼20 °C), the (GC-I27-GN)n hydrogel remained intact. However, when incubated in the same buffer at 50 °C for one hour, the protein hydrogel was completely dissolved, resulting from the disassembly of the non-covalent supramolecular polymer and the crosslinked network structure.
Here we have developed an efficient, protein fragment reconstitution-based supramolecular polymerization strategy to accomplish this goal. The bifunctional monomer state can be readily controlled via redox potential. Through supramolecular polymerization under ambient conditions, protein polymers of ultrahigh molecular weight (with a MW of 0.5 MDa), which is close to that of the I-band part of human cardiac titin, have been constructed. This study represents a successful attempt to engineer titin-mimetic protein polymers of ultrahigh molecular weight, which allows for the engineering of protein hydrogels with low protein polymer concentration and improved mechanical properties. Moreover, the engineered disulfide bond provides a redox-based method to depolymerize the engineered protein polymers to fully recover the monomers.
This PRF-based supramolecular polymerization is a step growth polymerization process in nature. Thus, the temperature and monomer concentration can, in principle, be used to tune the polymerization process to better control the molecular weight and molecular weight distribution. In particular, temperature will likely have an interesting effect on this polymerization reaction, as temperature will affect not only the GC–GN reconstitution (due to the relatively low melting temperature of the reconstituted GC–GN complex,∼23 °C), but also the oxidation kinetics to form a covalent GC–GN complex. For this, a detailed understanding of the rate law of this supramolecular polymerization is essential.
Moreover, this supramolecular polymerization approach is completely genetically encoded and modular. Different protein macromonomers, including protein chimera, can be readily fused with GC and GN to produce protein polymers with ultrahigh molecular weight. Thus, this method is general and should be applicable to a wide range of protein macromonomers. Compared with the protein shackle approach,23 which allows for the incorporation of other proteins as “side chains” into the protein shackle, the PRF-based method enables the incorporation of other proteins into the protein polymer backbone, thus enriching the toolbox for engineering high molecular weight protein polymers. Therefore, our approach may open up a new avenue, which can serve as an alternative to metabolic engineering, towards engineering proteins of ultrahigh molecular weight for technological applications, such as the engineering of artificial extracellular matrix protein fibronectin and spider dragline silk proteins. Thus, we anticipate that this PFR-based supramolecular polymerization method will find a broad range of applications in constructing protein-based biomaterials and functional assemblies.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c9sc02128k |
| This journal is © The Royal Society of Chemistry 2019 |