Open Access Article
J. Bennett
Addison
a,
Thomas M.
Osborn Popp
a,
Warner S.
Weber
a,
Janice S.
Edgerly
b,
Gregory P.
Holland
a and
Jeffery L.
Yarger
*a
aDepartment of Chemistry and Biochemistry, Arizona State University, Tempe, Arizona 85287-1604, USA. E-mail: jyarger@gmail.com
bDepartment of Biology, Santa Clara University, Santa Clara, CA, USA
First published on 29th August 2014
Embiopterans produce silken galleries and sheets using exceptionally fine silk fibers in which they live and breed. In this study, we use electron microscopy (EM), Fourier-transform infrared (FT-IR) spectroscopy, wide angle X-ray diffraction (WAXD) and solid-state nuclear magnetic resonance (ssNMR) techniques to elucidate the molecular level protein structure of webspinner (embiid) silks. Silks from two species Antipaluria urichi and Aposthonia ceylonica are studied in this work. Electron microscopy images show that the fibers are about 90–100 nm in diameter, making webspinner silks among the finest of all known animal silks. Structural studies reveal that the silk protein core is dominated by β-sheet structures, and that the protein core is coated with a hydrophobic alkane-rich surface coating. FTIR spectra of native embiid silk shows characteristic alkane CH2 stretchings near 2800–2900 cm−1, which decrease approximately 50% after washing the silk with 2
:
1 CHCl3
:
MeOH. Furthermore, 13C ssNMR data shows a significant CH2 resonance that is strongly affected by the presence of water, supporting the idea that the silk fibers are coated with a hydrocarbon-rich layer. Such a layer is likely used to protect the colonies from rain. FTIR data also suggests that embiid silks are dominated by β-sheet secondary structures similar to spider and silkworm silk fibers. NMR data confirms the presence of β-sheet nanostructures dominated by serine-rich repetitive regions. A deconvolution of the serine Cβ NMR resonance reveals that approximately 70% of all seryl residues exist in a β-sheet structure. This is consistent with WAXD results that suggest webspinner silks are 70% crystalline, which is the highest crystalline fraction reported for any animal silks. The work presented here provides a molecular level structural picture of silk fibers produced by webspinners.
Webspinner silks are among the thinnest of all known silk-based biopolymers, but previous reports of fiber diameters are conflicting. Okada et al.4 found from SEM images that silks from the Australian webspinner Aposthonia gurneyi have fibers with a mean diameter of only 65 nm, while polarized optical microscopy data by Collin et al.5,6 found fiber diameters in the range of 500 nm for multiple embiopteran species. This discrepancy is curious, and is addressed in this work.
While the embiopteran taxonomy and behavior have been well studied,1–3,7–9 very little is known about the molecular-level structure of their silks. Significant insight was gained through cDNA sequencing of the primary silk protein sequence from multiple embiopteran species including An. urichi and Ap. ceylonica, which are the subjects of this study.6,10 Representations of the full (approximately 70 kDa) primary protein sequence for both species are seen in Fig. 1E. Like other silk-based biopolymers,11–13 webspinner silks are composed of highly repetitive protein sequences in which glycine, serine, and alanine are heavily represented.4,6,10 Although there exists very little experimental data characterizing webspinner silks, the obvious similarities between the embiopteran protein sequence and other well-characterized fibroin biopolymers allows us to make predictions. Silkworm and spider silks are also composed of highly repetitive proteins that contain runs of poly(GA) and poly(A).14,15 It is well known that these repeat motifs form water-inaccessable antiparallel β-sheet structures that are aligned along the fiber axis, and these rigid, nanocrystalline β-sheets are thought to be responsible for the impressive mechanical strengths of both spider and silkworm silks.16–18 When one looks at the cDNA sequences for many embiid silks,10 similarities are observed but alanine appears to have been mostly replaced by serine. Instead of runs of poly(GA), embiopteran primary protein sequences are dominated by runs of poly(GS) or GAGSGS repeats. Instead of runs of poly(A), some webspinner silks contain runs of poly(S). Serine has a higher propensity to adopt a β-sheet environment than alanine because of its ability to form side chain intra-sheet hydrogen bonds,19–21 therefore similar nanocrystalline β-sheet structures are fully expected from poly(GS), GAGSGS and poly(S) repeats found in embiopteran silks. Indeed, previous Fourier transform infrared spectroscopy studies on embiopteran silks have suggested that the fibers are dominated by β-sheet secondary structures within the protein core, presumably arising from said repetitive protein motifs.4–6,10
These β-sheet structures within webspinner silks are potentially nanocrystalline and well aligned with respect to the fiber axis, similarly to other well-characterized silks. The percentage of the total fiber content that is nanocrystalline varies significantly among animal silks. Studies on spider dragline silk fibers show that the crystallinity content ranges from about 30% (Nephila clavipes) to 40% (Latrodectus hesperus),23,24 while fibers from the domesticated silkworm are typically higher, in the range of 40–60% crystallinity.25,26 The crystalline fractions arise mainly from repeated sequences, thus the primary protein sequence for webspinner silks should yield predictive power; protein secondary structure within spider dragline silk fibers has been shown to correlate quantitatively with silk primary protein sequences.24,27 With the exception of short C-terminal domains, the protein sequences of many embiopteran silks are virtually entirely composed of repetitive motifs, which likely adopt a β-sheet structure. For example, if all of the repeat domains within embiopteran silk from the species An. urichi adopt a β-sheet nanocrystalline structure, then the percent crystallinity may be upwards of 90%. If true, embiopteran silks are substantially higher in nanocrystalline content than other well characterized silk biopolymers. To the best of our knowledge, no attempt has been made to measure the nanocrystallite size, orientation or crystalline fraction of any webspinner silks, which we address in this work through Wide Angle X-ray Diffraction (WAXD) techniques.
We are interested in better understanding the hierarchical structure of webspinner silks. Considering the many similarities in primary protein composition, it would not be surprising if the hierarchical structure of webspinner silk is similar to that of other well-characterized silks. The current skin-core model for spider dragline silks, for example, is a fibrous protein core that is encapsulated by a thin protective lipid-like coating.28 As discussed, the protein core of spider dragline silk is composed of nanocrystalline β-sheet structures with dimensions of a few nanometers aligned along the fiber axis, separated by loosely-organized sheet, helical or randomly-oriented domains.22,29–31 This protein core is surrounded by a thin protective coating that is rich in long chain lipids and alkanes.32–34 Very little work has been performed characterizing this outer protective coating on spider silks, although Schulz et al.34 conducted a thorough GC-MS analysis of the lipid and hydrocarbon content in silk from the orb weaver spider Nephila clavipes, revealing the presence of mostly long chain alkanes and methoxyalkanes of chain length between C28 and C34. The role of the lipid or alkane-rich surface coating is unclear, but it likely serves to both waterproof the silk fibers and maintain moisture within the silk core.32 Based on observations in the field and on laboratory colonies, embiopteran silken galleries are remarkably water-repellant, thus it is feasible that along with a β-sheet dominated protein core webspinner silk fibers possess a similar lipid or alkane-rich surface coating that serves a waterproofing purpose. Water-repellency would appear an especially adaptive trait for An. urichi, a tropical rainforest species from Trinidad and Tobago. Their colonies live on tree bark where they feed on epiphytic algae and lichens from within the protective covering of their silk. Rainfall is heavy, often torrential, and yet water appears to flow over the surface of the silk, leaving the insects dry beneath. Ap. ceylonica species are subtropical and tropical as well and are likely exposed to similar environmental conditions as the Trinidadian species. The availability of these two species in laboratory cultures and of published work on their silk provided us with an opportunity to further investigate the functional and structural aspects of embiopteran silk.
Interestingly, a few other silk-producing insects are known to contain a significant lipid content.35,36 It is often challenging to characterize a surface coating on spider silks with NMR techniques because this layer is thin relative to the overall fiber size and therefore hard to detect. The exceptionally fine fiber diameters of embiid silks provides us with a unique opportunity to observe and characterize any surface coating in addition to the protein core because of the higher surface area to volume ratio. In this present study, we use SEM, TEM, FT-IR, WAXD and solid-state NMR techniques to help elucidate both molecular-level protein structure within the silk fiber core, and to interrogate the alkane-rich surface coating surrounding the silk fibers.
:
2.5
:
2.5 TFA
:
TIS
:
H2O). The crude peptide was precipitated out of TFA using cold diethyl ether, and after several ether washes, the crude peptide was dried, dissolved in 6 M LiBr, and dialyzed against water at 4 °C for 4 days. Any water-insoluble peptide was centrifuged out, and solubilized peptide was recovered by lyophilization. This lyophilized peptide represents Ap. ceylonica silks in a random-coil structure. Some of this peptide was dissolved in formic acid and carefully pipetted onto a clean glass surface. The peptide crystallized into a β-sheet structure upon slow evaporation of formic acid. Peptides were then characterized using NMR spectroscopy.
240 scan averages. Direct Detection under Magic Angle Spinning (DD-MAS) experiments on hydrated embiid silk samples used a 2.5 μs π/2 pulse, 20 ms acquisition time, 100 kHz spectral width, a 1 second relaxation delay, and 40
960 scan averages. All CP and DD-MAS experiments were collected using 150 kHz TPPM38 proton decoupling during acquisition. To collect data on hydrated samples, silks were first soaked in DI water for at least 30 minutes then blotted with a Kim-Wipe to remove excess water.
To the best of our knowledge, the only available mechanical data obtained on silk produced by webspinner insects is unreliable due to improper fiber diameter measurements. Therefore in an attempt to better estimate embiid fiber tensile properties, we collected tensile stress–strain curves on silk bundles prepared for An. urichi. Samples were prepared by carefully brushing an E-shaped cardboard card across the tarsus of adult female insects of An. urichi. Fibers were superglued to each of the three anchor points on the E. Stress–strain curves were obtained by stretching one side of the E-shaped card at a rate of 1% per second, while the other unstretched side was analyzed using SEM to approximately obtain the number of fibers present. Additional experimental details and results are included as ESI.† Results suggest that webspinner silks are significantly stronger than previously thought; we observed an average of 500 MPa mean ultimate stress and about 30% extensibility over 14 measurements. Due to the small fiber diameters and extreme difficulty in obtaining consistent samples, this result should only be interpreted as a rough estimate.
The protein core structure was also analyzed using infrared spectroscopy. FT-IR is an extremely common technique used to study protein secondary structure of silk-based biopolymers; some examples include silks produced from silkworm,40–43 spiders,42,44 caddisfly larvae,45 silverfish,46 lacewing,36 and webspinners.4,5 The amide I, II and amide III absorbances are commonly used to infer protein secondary structure. There seems to be a general consensus in the literature regarding the amide I band, but significant debate in assigning secondary structures to components of the amide III band. For example, the amide III β-sheet and random coil absorbances were assigned at 1263 and 1230 cm−1 respectively for silkworm silk from Bombyx mori,40 but are assigned to 1222 and 1242 cm−1 for silkworm silk from An. pernyi. It is possible that the inversion of peak location is a result of different repetitive motifs; Bombyx mori silk is dominated by poly(GA) and GAGAGS units while An. pernyi contains runs of poly(A) similar to spider silks. In support, careful secondary structure assignments to the amide III band for poly(A)-rich spider silks agree well with An. pernyi but not with Bombyx mori.42 Thus the primary protein sequence has a major impact on FT-IR absorption peak positions, as does fiber strain and overall secondary structure.47 Considering significant variation in peak position, width, and shape, we think that a quantitative deconvolution of FT-IR absorbance bands will not give convincing results. Qualitative and especially comparative analysis of IR absorbance profiles, however, can be extremely powerful. Fig. 4 shows that the FT-IR absorbance profile of An. urichi is virtually identical to that of Ap. ceylonica. The strong amide I absorbance maximum at 1627 wavenumbers is identical to reports from Okada et al.,4 on webspinner silk from the congener Ap. gurneyi, which is similarly dominated by poly(GS) and poly(S) motifs. For comparison, the amide I band for β-sheet-rich silkworm silk is found at 1615 cm−1 (ESI†). The shift towards higher frequency absorbance for webspinner silk relative to silkworm silk could be attributed to a larger random coil component, as is often done in the literature through peak deconvolution. While this is a possibility, we think it is more likely that the shift in frequency is a result of differing primary protein sequences rather than a lower β-sheet fraction, especially when considering the high expected crystalline fraction for webspinner silks. Amide I absorbances from random-coil structures are typically found near 1650 cm−1 while β-sheet structures show absorbances at lower frequencies (1620 cm−1).48 Additionally, the shoulder absorbance at 1695 cm−1 is regularly assigned to β-sheet protein structures,40,48 thus it is quite clear that like silkworm silks, webspinner silk fibers are dominated by β-sheet structures, and are remarkably similar across species.
Φ)−1)/2 and Φ is the angle between the c axis of the nanocrystallite and the fiber axis. Difficulty manually aligning the silk fibers during sample preparation leads to fiber misalignment, which means this is a low estimate of the actual nanocrystallite orientation within the fiber. The overall crystalline fraction can be estimated as a fraction of the area represented by radial integration 15° on either side of the equator for the equatorial (200) and (120) crystalline peaks relative to the area represented by the integrated intensity of the full diffraction pattern resulting in 69% crystallinity. This high crystallinity content is likely due to the high fraction of repetitive motifs and significant serine content, both of which correlate to increased β-sheet content in the silk protein core.
:
MeOH-washed silks in their dry and hydrated states. Solid-state NMR is a very powerful tool for characterizing biopolymers; it has been widely utilized to elucidate molecular-level environments of repeat motifs such as poly(GA) and poly(A) in silkworm and spider silks, respectively. For embiid silks from both species discussed in this work, glycine and serine are the most dominant amino acids represented at approximately 45% and 36%, respectively.10 The 13C chemical shift of glycine Cα is not very sensitive to secondary structure, however both the alanine and serine Cα and especially Cβ chemical shifts are influenced dramatically by local conformation,50,5113C chemical shifts have been used extensively to identify secondary structures within silk-based biopolymers, and can be applied here to elucidate molecular level environments within embiid silks. Cleaned, native silk from Ap. ceylonica and An. urichi were studied with 13C solid-state NMR spectroscopy. 13C resonances for the three most abundant amino acids are assigned, and secondary structures are indicated (Fig. 6, 7). We are limited in the amount of information we can extract from 13C data on naturally-abundant samples, but we can still draw many strong conclusions. For one, serine exists predominantly in β-sheet structures based on the 13C chemical shifts observed for serine Cβ (Fig. 6). As indicated in Fig. 6, the serine Cβ resonances in the CP-MAS spectra lie at 64.5 ppm, a shift consistent with serine in a β-sheet environment. Serine from GAGSGS and poly(S) repeat motifs are most likely responsible for this observation. We also can see a minor shoulder at 62 ppm that is assigned to serine containing regions in a randomly-oriented environment.
Further confirmation of the secondary structural assignments where made by comparing 1H → 13C CP-MAS NMR data of peptides mimicking webspinner repetitive motifs that were either trapped in a random-coil confirmation or crystallized into a β-sheet structure from formic acid (Fig. 8). To mimic webspinner silk in a random-coil structure, a 30 AA peptide representing the protein sequence of Ap. ceylonica silk (illustrated in Fig. 1E) was synthesized using solid-phase peptide synthesis, dissolved in 6 M LiBr, dialyzed against DI water for 4 days, and then lyophilized. The peptide was then dissolved in formic acid and slowly dried, inducing a β-sheet structure. This method is commonly used to capture silkworm model peptides in both random-coil and β-sheet conformations.52,531H → 13C CP-MAS data was collected on both peptides (8A). Using DMFIT software,54 the spectra for random-coil and β-sheet model peptides were independently fit to extract precise 13C chemical shifts, peak widths, and % Gaussian vs. % Lorentzian line shapes. This information was then used to deconvolute NMR spectra for native silks to approximate β-sheet content; only peak amplitudes were allowed to vary when fitting native webspinner silk data. Deconvolution of NMR data obtained for Ap. ceylonica (8B) and An. urichi (data not shown) silks indicate that serine Cβ, the residue most isolated and most sensitive to secondary structure, is approximately 70% β-sheet and 30% random-coil for both species. This estimation is consistent with both our FT-IR and our XRD data in that FT-IR absorbance profiles for the two species suggest that the silk protein structures are virtually identical and are dominated by β-sheet structures, and that analysis of XRD results reveal that An. urichi silk is 70% crystalline. The alanine Cβ resonance shows a similar story in both silks, although it is more clear for An. urichi silk fibers where alanine is better represented (6C and 6D). Alanine, found almost exclusively in GAGSGS repeat motifs from An. urichi silk, clearly exists predominantly in β-sheet structures.
The contrast between CP and DD-MAS spectra in Fig. 7 is very useful to highlight rigid versus mobile regions of hydrated silks.55–58 Domains of the silk that are affected by water will typically have shortened 13C T1 and longer T2 relaxation times due to increased molecular motion, thus are easily seen as sharp peaks in the direct spectra. The DD-MAS experiment uses a fast (1 second) recycle delay so that 13C resonances from rigid regions of the silk with long T1 relaxation properties become saturated and therefore are not observed. While the majority of serine adopts a β-sheet structure within the silks, a new sharp resonance emerges in the direct spectra from both species at 62.2 ppm. This peak corresponds to serine Cβ in a random coil or loose helical environment. Serine is found almost exclusively in poly(GS), poly(GAGSGS) or poly(S) repetitive motifs in both silks, therefore not all seryl residues from these repeats exist in a β-sheet structure. A native silk gallery will naturally repel water due to its hydrophobic coating, but in this case we encouraged hydration of the fibers by completely submerging the silk samples in water. Perhaps when the silk becomes forcibly hydrated, poly(GS) and GAGSGS repeats from Ap. ceylonica and An. urichi silks are stable in both conformations, or can exchange between sheet-like and randomly-oriented structures on a slow timescale. Alanine resonances are weak from Ap. ceylonica silks, but we also observe a sharp peak emerge at 17.5 ppm for alanine Cβ from An. urichi silk in the direct spectra. Similar to the random coil serine Cβ peak evident by the sharp peak at 62.2 ppm, a portion of alanine also appears to adopt a random coil environment. Alanine from An. urichi silks comes from GAGSGS repeat domains, further supporting the idea that such repeats are found in both sheet-like and random structures when hydrated. For comparison, Bombyx mori silkworm silk is dominated by similar GAGAGS repeat domains, however a parallel experiment conducted on these fibers does not result in an emerging sharp helix or random-coil resonance upon fiber wetting. It appears then that when water penetrates into and solvates webspinner silk fibers, a higher fraction of silk protein becomes mobile as compared to similar regions of silkworm silk. Webspinner silk protein is much smaller than that of silkworm (70 kDa vs. 350 kDa). Perhaps for both silkworm and webspinner silks then, the dominant protein repeat units are embedded primarily in β-sheet nanostructures, but C- and N-terminal ends are less restricted. The shorter length of the webspinner protein and therefore higher percentage of protein near the C- and N-terminal domains may account for the apparent increase in random-coil protein content after fiber wetting.
Fig. 7 also yields information on the surface coating surrounding the silk fibers. For both Ap. ceylonica and An. urichi silks, one notices that the alkane CH2 resonance is significantly pronounced in the direct spectra when the silk is wet. The alkane CH2 resonance from both silks is both sharp and strong in the direct spectra, presumably because of fast molecular motion induced by the presence of water on the silk surface. Additionally, both An. urichi and Ap. ceylonica silks show a new sharp resonance in the direct spectra at 14 ppm, which we assign to terminal or branched alkane CH3 groups on the silk surface. Similar to FT-IR results in Fig. 3, NMR data shown in Fig. 7 on Ap. ceylonica silk fibers reveal the severe reduction of alkane CH2 and CH3 signals after washing the fibers with a 2
:
1 mixture of CHCl3
:
MeOH (red). We conclude that the surface coating surrounding the silk protein core is composed of long-chain lipids or alkanes that are non-covalently attached to the protein.
:
1 CHCl3
:
MeOH, suggesting that this surface coating is non-covalently adhered to the protein core. The hydrophobic nature of silken galleries appears to serve to protect the insects from being dislodged from their arboreal dwellings during heavy rain. It also protects the interior of their domiciles and their bodies from extreme wetting, which has multiple negative consequences including increased risk of suffocation and of fungal attack. During severe dry periods, the silk might also serve to limit desiccation of the insects sheltering inside. The basic hierarchical structure of webspinner silk fibers is illustrated in this study: the thin fiber core is composed of glycine and serine-rich repetitive protein motifs that primarily adopt a β-sheet nanostructure, and the protein core surrounded by a thin protective hydrophobic lipid or alkane-rich shell.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c4ra07567f |
| This journal is © The Royal Society of Chemistry 2014 |