Open Access Article
Aleksandra Hecelab,
Mariano Briccolab and
Eva Freisinger
*b
aFaculty of Chemistry, University of Wrocław, 50-383 Wrocław, Poland
bDepartment of Chemistry, University of Zurich, 8057 Zurich, Switzerland. E-mail: freisinger@chem.uzh.ch
First published on 13th April 2026
Metallothioneins (MTs) are a superfamily of cysteine-rich proteins essential for metal homeostasis and detoxification. While mammalian and plant MTs have been extensively characterized, fungal MTs remain comparatively understudied despite their remarkable sequence diversity. In particular, members of the fungal-IV MT family from Yarrowia lipolytica exhibit unusual sequence features, including a conserved CCC motif and an extended cysteine-free N-terminal region, raising the question of whether all cysteine residues participate equally in Cu(I) coordination and how sequence context influences metal binding. To address this, we investigated one representative protein (Ylip_MT) using a combination of potentiometric titrations and complementary spectroscopic techniques to resolve the stepwise recruitment of cysteine ligands during Cu(I) coordination. Truncated variants were employed to probe the role of the cysteine-free N-terminal segment, while a Y54C mutant was introduced independently to restore the otherwise conserved C-terminal cysteine and assess its influence on metal binding. Our results indicate the formation of a Cu4Cys8–9 cluster and demonstrate that both the extended cysteine-free N-terminal region and the C-terminal environment modulate cluster stability. This work represents the first application of potentiometry to a Cu(I)-MT system and provides new insight into how sequence-specific features govern metal binding in fungal MTs.
In contrast to our understanding of mammalian and plant MTs, fungal MTs remain relatively understudied. They exhibit considerable diversity in primary amino acid sequences among different fungal species. Fungal MTs are grouped into six families, with each family comprising only a few or even a single member.12 Fungi, in the most common sense, that are diverse molds but also mushrooms, are classified under family fungi-I. Notably, yeasts, which also belong to the fungal kingdom, occupy the remaining five fungal families (Candida glabrata, fungi-II and -III; Yarrowia lipolytica, fungi-IV; Saccharomyces cerevisiae, fungi-V and -VI).
Copper is an essential trace metal ion in all living organisms13,14 but can also be cytotoxic due to its redox reactivity. In particular, Cu(I) can catalyse the production of free radicals in Fenton reactions,15,16 and disruption in copper homeostasis has been implicated in severe diseases such as Menke's and Wilson's diseases.14,17–19 MTs play a critical role by tightly binding Cu(I) ions, thereby shielding cells from oxidative stress.20,21
Most characterized fungal MTs belong to the Cu–thionein class, with the CUP1 protein from baker's yeast (S. cerevisiae) being one of the most extensively studied examples.22,23 Its NMR solution structure revealed coordination of seven Cu(I) ions within a single cluster,24 while single crystal X-ray crystallography identified an additional Cu(I) ion, resulting in a Cu8Cys10 cluster topology. This structure shows a combination of linear and trigonal-planar thiolate-only coordination of the copper ions.25 Of interest here are also two smaller fungal MTs from the family fungi-I, for which copper binding capacities were determined. The MT from Agaricus bisporus (portobello mushroom) contains seven cysteine residues and binds 5.8 equivalents of copper ions when isolated from its native source.26 A highly homologous MT from the filamentous fungus Neurospora crassa contains the same number of cysteine residues and incorporates six Cu(I) ions when cultivated in Cu(II)-containing media.27 For comparison, the N-terminal β-domain of human MT3 shows an intermediate cysteine content of nine and is thus more similar to the protein investigated here (vide infra). It hosts the enigmatic Zn3Cys9 or Cd3Cys9 β-cluster but can also form well-defined major species containing four, six, or ten Cu(I) ions.28,29 Uniquely, the native form isolated from the human brain was found to contain a homometallic air-stable Cu(I)4–thiolate cluster in the β-domain, while the C-terminal α-domain hosts the enigmatic Zn4Cys11 α-cluster.30–32
This study focuses on a MT sequence from family 11 of fungal-IV MTs, a group of highly homologous sequences identified in the yeast Yarrowia lipolytica strain CLIB 122/E 150. Members of this family share a conserved cysteine distribution pattern (Fig. 1). The genes encoding MT1 and MT2 are located in close proximity on chromosome A, whereas the genes for MT3 and MT4 are adjacent on chromosome C (Fig. S1).33,34 Transcription of all four genes (MT1–4) is upregulated upon copper exposure.35 Genomic sequencing further revealed an additional putative MT gene on chromosome E, designated Ylip_MT in this study (B5FVH3 MT, Fig. 1).
Ylip_MT exhibits several distinctive sequence features that make it an attractive model system for studying Cu(I) coordination. First, it lacks histidine residues, including the otherwise highly conserved His at position 21 (MT1 and 3) or 22 (MT2 and 4), which is replaced by arginine. As a result, cysteine thiolates represent the only potential metal-coordinating ligands. Second, the sequence contains the conserved central CCC motif, raising the question of whether all three cysteine residues within this stretch participate equally in Cu(I) coordination. Third, in contrast to the other family members, Ylip_MT lacks a conserved C-terminal cysteine residue, which is replaced by tyrosine. Finally, the N-terminal region is unusually long and completely devoid of cysteine residues.
These features define three key structural elements whose roles in Cu(I) binding remain unclear: (i) the cysteine-free N-terminal segment, (ii) the central CCC motif, and (iii) the absence of the conserved C-terminal cysteine. In particular, it is unknown whether all cysteine residues contribute equally to Cu(I) coordination or whether specific sequence contexts modulate their reactivity.
To systematically address these questions, we designed a set of targeted variants. To probe the role of the C-terminal region, a Y54C mutant was generated to restore the canonical cysteine residue. To evaluate the contribution of the cysteine-free N-terminus, truncated constructs comprising only the C-terminal domain (C-term_Ylip_MT) were investigated, along with the corresponding Y54C variant (C-term_Y54C_Ylip_MT) (Fig. 1). This design enables us to disentangle the influence of global sequence architecture and local cysteine motifs on metal binding.
Despite these distinctive structural features, the spectroscopic and thermodynamic properties of Cu(I) binding to Ylip_MT and its variants have not yet been characterized, representing a significant gap in our understanding of fungal MTs. Here, we address this gap by combining spectroscopic and potentiometric approaches to elucidate the Cu(I) binding properties and thermodynamic stability of these MT constructs.
The proteins were expressed in LB or TB media containing 0.1 mg mL−1 ampicillin. The 15N-labeleled peptides for heteronuclear NMR experiments were expressed in M9 minimal media using 15N ammonium chloride as the nitrogen source. Overexpression was carried out for 6 h at 30 °C or overnight at 17 °C after induction with 1 mM isopropyl-β-D-thiogalactopyranoside (IPTG) at an OD600 value of approximately 0.6. Cell pellets were harvested by centrifugation and lysed by sonication. For initial purification of GST-tagged MTs, first a GST affinity column (GSTPrep FF 16/10, GE Healthcare, Glattbrugg, Switzerland) was used with 1× phosphate-buffered saline (PBS), pH 7.3, as the equilibration and loading buffer and 5 mL of 50 mM reduced glutathione (GSH) in 50 mM Tris-HCl (pH 8) as the elution buffer. Afterwards, samples were dialyzed against TRIS-HCl buffer to remove GSH. The GST tag was cleaved with TEV protease for 3 h at 34 °C in the dialysis buffer (50 mM TRIS-HCl, pH 8) by adding 0.5 mM 1,4-dithiothreitol (DTT) and 0.1 mM ethylenediaminetetraacetic acid (EDTA, pH 8) and using a TEV
:
peptide ratio of 1
:
30 (w/w). TEV protease was produced according to the standard procedure with some modification.37,38 Cleaved MTs were further purified via size exclusion chromatography (HiLoad 16/600 Superdex 75 pg, 10 mM ammonium acetate, pH 7.8) and lyophilized overnight. The apo-MTs were resuspended in 10 mM HCl and 100 mM tris(2-carboxyethyl)phosphine (TCEP) and purified with a Superdex Peptide 10/300 GL column under anaerobic conditions (a Coy Lab Vinyl Anaerobic Chamber equipped with a palladium catalyst and filled with a 5% hydrogen/95% nitrogen gas mixture). Protein masses were confirmed using electrospray ionization mass spectrometry (ESI-MS). The final concentrations of the purified proteins were determined by thiol quantification using the 2,2′-dithiopyridine (2-PDS) assay.39
:
1, 2
:
1, 3
:
1 and 4
:
1. The HYPERQUAD 2013 program was used for the stability constant calculations.43 Reported log
β values refer to the overall equilibria:| pCu + qH + rL = CupHqLr |
Standard deviations were computed using HYPERQUAD 2013 and refer to random errors only. The speciation and competition diagrams were generated using the HySS program.44
The details of the calculation of proton and complex formation constants have been described previously.45
200 rpm for 30 min under anaerobic conditions.
For MALDI-TOF MS, the Cu(I)-loaded protein was prepared in 50 mM ammonium acetate buffer at pH 4.5. A double-layer spotting technique was applied, whereby 0.2 μL of matrix solution consisting of saturated sinapic acid (SA) in EtOH was first deposited onto a ground steel MALDI target plate (Bruker) and allowed to dry completely. Then, 1 μL of the protein sample was mixed with 1 μL of SA matrix solution (saturated in H2O/MeCN at 2
:
1 containing 0.1% TFA). From this mixture, 0.5 μL was drop-cast onto the previously dried matrix layer and allowed to dry prior to analysis. MALDI-TOF MS experiments were performed using an Autoflex Speed time-of-flight mass spectrometer (Bruker Daltonics, Bremen, Germany) equipped with a Bruker Smartbeam™-II laser operating at a wavelength of 355 nm in positive linear mode. Instrument control and data acquisition were carried out using the Bruker Daltonics flexControl software (version 3.4). Spectra were accumulated over 10
000–15
000 laser shots and recorded between m/z 1000 and 20
000. Average masses were calibrated using signals from a protein mixture consisting of insulin ([M + H]+avg = m/z 5734.51 Da), cytochrome C ([M + 2H]2+avg = m/z 6180.99 Da), myoglobin ([M + 2H]2+avg = m/z 8476.65 Da), ubiquitin I ([M + H]+avg = m/z 8565.76 Da), cytochrome C ([M + H]+avg = m/z 12
360.97 Da), and myoglobin ([M + H]+avg = m/z 16
952.30 Da), all obtained from the Bruker Protein Calibration Standard I mixture (mass range ca. 4000–20
000 Da).
Ylip_MT4.1 ± 0.1
Y54C_Ylip_MT 4.43 ± 0.03
C-term-Ylip_MT 3.98 ± 0.08
C-term-Y54C_Ylip_MT 3.95 ± 0.03
Additionally, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) of Cu(I)-loaded Ylip_MT and electrospray ionization mass spectrometry (ESI-MS) of C-term-Ylip_MT and C-term-Y54C_Ylip_MT predominantly show the Cu4MT species as the main form (Fig. S4), further corroborating the stoichiometry determined by F-AAS. No ESI-MS spectrum was obtained for Y54C_Ylip_MT, potentially due to issues with ionization or sample stability during analysis. The binding capacities of the different Ylip_MT constructs are thus slightly lower compared to those of other small fungal MTs (e.g., from A. bisporus and N. crassa)26,27 but comparable to that of the Cu4Cys9 cluster observed in human MT3.28,30,31 These findings confirm that the N-terminal cysteine-free segment does not affect the maximum Cu(I) binding capacity and that the shorter constructs can serve as a robust model for studying metal ion coordination.
To better differentiate between LMCT contributions and cluster-centered transitions, the ratio ε262/ε295 was plotted as a function of added Cu(I) (Fig. 2d and S5d). This allows two important conclusions. First, the evolution of the ratio argues against strictly cooperative binding in which only a single dominant species forms. In a purely cooperative scenario, one would expect the spectral shape to remain constant during titration, with increasing Cu(I) leading only to a proportional increase in intensity. Under such conditions, the ratio of absorptivities at two selected wavelengths would remain constant throughout the titration. Second, in contrast to this expectation, the ε262/ε295 ratio shows changes up to approximately four equivalents of Cu(I), becoming constant only thereafter. This clearly indicates that the nature of the Cu(I) coordination environment evolves during titration. Notably, for all constructs, the ratio decreases markedly between three and four equivalents of Cu(I), reflecting a disproportionate increase in the 295 nm band. This behaviour is characteristic of Cu(I)–thiolate cluster formation, in which additional bridging thiolate ligands promote Cu(I)–Cu(I) interactions and stabilize multinuclear assemblies.46–49 At Cu(I) loadings above ∼4 equivalents, the ratio reaches a plateau. This behaviour likely reflects the onset of uniform scattering contributions, possibly due to minor Cu(I) precipitation, affecting the measured wavelength range similarly at both wavelengths.
Interestingly, differences between constructs are evident. For the Y54C mutant proteins, the ε262/ε295 ratio remains nearly constant during the initial titration phase (up to ∼3 equivalents), whereas both full-length and truncated wild-type sequences exhibit a gradual decrease over this range. The comparatively stronger early increase in cluster-centered transitions in the wild-type proteins suggests that some bridging cysteine residues may already participate in metal coordination during binding of the first three Cu(I) equivalents, possibly reflecting the reduced number of cysteine ligands available.
In summary, the UV data support a two-stage model of Cu(I) coordination. In the first stage, up to ∼3 equivalents, Cu(I) ions bind primarily to terminal thiolate ligands, resulting predominantly in LMCT transitions. In the second stage, completed at approximately four equivalents, recruitment of bridging thiolates leads to formation of multinuclear Cu(I)–thiolate clusters, reflected by the pronounced increase in absorptivity at 295 nm. The close similarity in spectral behaviour between the full-length proteins and their truncated counterparts further indicates that the cysteine-free N-terminal region of Ylip_MT does not directly participate in Cu(I) coordination.
The titrations of the two Y54C mutants yielded comparable spectra with subtle differences. For instance, the band at 315 nm was more pronounced at sub-saturation Cu(I) concentrations compared to the wild-type constructs, and in C-term_Y54C_Ylip_MT the negative band at 287 nm was slightly shifted to 289 nm. Moreover, maximum and minimum ellipticity values at 262 and 287 nm were reached at lower Cu(I) equivalents (2 and 2–3 for Y54C_Ylip_MT, and 2–3 and 3 for C-term_Y54C_Ylip_MT), suggesting that the additional C-terminal cysteine in these mutants reduces steric strain and facilitates earlier binding of all cysteine thiolates.
Finally, the comparable extrema observed during Cu(I) titrations of the two plant MTs, γ-Ec-1 and cicMT2, of the two domains of mouse MT1 and human MT3, as well as for the Cu(I)-thionein from S. cerevisiae in the UV-vis region (262 nm, 285–295 nm, and 300–400 nm) further corroborate these spectral features as characteristic signatures of Cu(I)–thiolate cluster formation.28,48–50,52
000 M−1 cm−1 at 262 nm, with 8–9 thiolate ligands contributing to cluster formation.47 Our estimates are therefore in good agreement with a Cu4Cys8–9 cluster model.To reduce experimental uncertainty, we used absorptivity values obtained from a curve fit of the titration data rather than relying solely on raw measurements (Fig. 2 and S5, Table S1). The resulting analysis for the two full-length constructs is shown in Fig. 4a (the truncated variants behave nearly identically and are therefore not shown). According to this model, binding of the first Cu(I) equivalent involves approximately three cysteine thiolates in both constructs, consistent with the typical trigonal-planar coordination geometry of Cu(I) in MTs.25 Addition of the second and third equivalents progressively increases the number of participating thiolates, reaching approximately seven (wild type) and eight (Y54C mutant) after three equivalents. Full saturation corresponds to engagement of all eight or nine cysteines, respectively.
![]() | ||
| Fig. 4 Estimated number of Cu(I)-coordinating thiolate groups in Ylip_MT and Y54C_Ylip_MT derived from UV absorption data (see Fig. 2, Tables S1–S3). (a) Calculation assuming that full thiolate participation is reached after the addition of 4 equivalents of Cu(I). (b) Calculation assuming that all thiolate groups are fully engaged by 3 equivalents of Cu(I). The plots are shown with a rastered background to facilitate visualization of the number of coordinating thiolates. | ||
An alternative scenario must also be considered. It is conceivable that most cysteine thiolates are already engaged after addition of three equivalents of Cu(I), and that the changes observed upon further metal addition primarily reflect structural rearrangements rather than recruitment of new ligands. Several observations support this possibility:
(i) the UV-vis data indicate that cluster formation becomes significant once more than three equivalents of Cu(I) are added (Fig. 2d),
(ii) the CD spectra reach maximum ellipticity at approximately three equivalents or earlier (Fig. 3d), and
(iii) the increase in ε262 may not exclusively report on the number of terminal thiolates but may also intensify when terminal ligands convert into μ-bridging coordination modes during cluster assembly. Under this scenario, the additional increase in absorptivity between three and four equivalents would primarily reflect formation of bridging thiolates rather than recruitment of additional cysteine residues for binding.
The calculation based on this alternative scenario is shown in Fig. 4b and results in approximately one additional coordinating thiolate per titration step compared to the first model.
The full-length proteins Ylip_MT and Y54C_Ylip_MT contain 19 (5 Lys, 8 Cys, 2 Asp, 1 Glu, and 1 Arg, plus the C- and N-termini) and 20 protonation sites (plus an extra Cys), respectively. This high number of titratable groups renders the potentiometric analysis complex. Since our spectroscopic data indicate that the cysteine-free N-terminal region does not significantly contribute to Cu(I) binding, we first focused on the corresponding C-terminal fragments. These truncated constructs contain fewer protonation sites, 15 for C-term_Ylip_MT (4 Lys, 8 Cys, and 1 Asp, plus the C- and N-termini) and 16 for C-term_Y54C_Ylip_MT (plus one additional Cys), thereby simplifying the system.
All titrations were initiated from the apo-proteins dissolved in 10 mM HCl (pH 2). Based on previous results showing a Cu(I) binding capacity of four equivalents for both Ylip_MT and its Y54C mutant, titrations were performed at five protein-to-metal ratios (1
:
0, 1
:
1, 1
:
2, 1
:
3, and 1
:
4). Although data were collected over a pH range of 2–11, only values up to pH 8–9 were included in the calculations to avoid complications from oxidative processes at higher pH. Within this evaluated pH range stepwise deprotonation of 11 sites was observed for both truncated constructs. For C-term_Ylip_MT, these correspond to the N- and C-termini, eight cysteine residues, and one aspartate. For C-term_Y54C_Ylip_MT, the deprotonating groups comprise the C-terminus, nine cysteines (or eight cysteines plus the N-terminus), and one aspartate. The most deprotonated species are defined as H4L for C-term_Ylip_MT and H5L for C-term_Y54C_Ylip_MT, while the fully protonated peptide species are denoted (H4L)H11 and (H5L)H11, respectively (Table 1). Accurate determination of individual pKa values in the alkaline region is challenging because the N-terminal –NH3+ group and the most basic cysteine residues deprotonate within a narrow pH window (8–9). In C-term_Y54C_Ylip_MT, the highest pKa (assigned to either the N-terminal–NH3+ group or the most alkaline cysteine) exceeds 9.3 (Table 1) and could therefore not be reliably determined under the applied experimental conditions.
| C-term_Ylip_MT | C-term_Y54C_Ylip_MT | |||||
|---|---|---|---|---|---|---|
| Species | log β |
pKa | Protonation site | pKa | log β |
Species |
| a N-terminus for C-term_Ylip, N-terminus or Cys-SH for C-term_Y54C_Ylip_MT. | ||||||
| (H4L)H11 | 75.3(1) | 3.5 | C-terminus | 2.7 | 76.6(1) | (H5L)H11 |
| (H4L)H10 | 71.8(1) | 3.5 | Asp-COOH | 2.8 | 73.9(1) | (H5L)H10 |
| (H4L)H9 | 68.3(1) | 4.3 | Cys-SH | 4.2 | 71.7(1) | (H5L)H9 |
| (H4L)H8 | 64.0(1) | 6.8 | Cys-SH | 6.8 | 66.9(1) | (H5L)H8 |
| (H4L)H7 | 57.2(1) | 7.5 | Cys-SH | 7.8 | 60.1(1) | (H5L)H7 |
| (H4L)H6 | 49.7(1) | 7.7 | Cys-SH | 7.8 | 52.3(1) | (H5L)H6 |
| (H4L)H5 | 42.0(1) | 8.0 | Cys-SH | 8.5 | 44.5(1) | (H5L)H5 |
| (H4L)H4 | 34.0(1) | 8.3 | Cys-SH | 8.5 | 36.0(1) | (H5L)H4 |
| (H4L)H3 | 25.7(1) | 8.5 | Cys-SH | 9.1 | 27.5(1) | (H5L)H3 |
| (H4L)H2 | 17.2(1) | 8.6 | Cys-SH | 9.1 | 18.4(1) | (H5L)H2 |
| (H4L)H1 | 8.6(1) | 8.6 | N-terminus/Cys-SHa | 9.3 | 9.3(1) | (H5L)H1 |
β) and corresponding pKa values for apo-C-term-Ylip_MT and apo-C-term-Y54C_Ylip_MT.The two lowest proton dissociation constants – 3.5 for apo-C-term-Ylip_MT and 2.7/2.8 for apo-C-term-Y54C_Ylip_MT – are assigned to the carboxylic acid groups of the C-terminus and the aspartic acid side chain. These values are consistent with the literature data of C-termini and carboxylic groups in folded proteins.53 For C-term-Ylip_MT, nine additional protonation constants were determined that are consequently assigned to the eight cysteine residues and the N-terminal amino group. While for C-term-Y54C_Ylip_MT ten additional protonation constants were expected in the covered pH range, only nine were obtained. These correspond either to all nine cysteine residues or to eight cysteines plus the N-terminal-NH3+ group. A reliable differentiation between these possibilities is not feasible due to their overlapping deprotonation ranges. The pKa values for the four lysine residues in both MTs – as well as the remaining undetected group in C-term-Y54C_Ylip_MT (either the additional cysteine or the N-terminus) – are expected to exceed 9.5–10 and therefore lie outside the pH window evaluated here.
The pKa values of cysteine thiolate and N-terminal-NH3+ groups are usually found in a range of 8–9.53–57 However, deviations from this range have been reported, particularly in enzymes containing redox-active cysteine residues such as the thiol disulfide oxidoreductase DsbA.53,58 Lowered pKa values may result from hydrogen bonding to other residues or by surrounding positive charges of other side chains as shown for, e.g., a protein from the thioredoxin-fold family.59 More recently, we observed a pronounced downward shifted pKa also in the apo-form of a MT. Using potentiometric pH titrations similar to the study performed here, a pKa value of 4.4 was determined for one cysteine thiol group in an eight cysteine residues containing small MT from the aquatic fungus Heliscus lugdunensis, while the other seven cysteines had pKa values >6.6.45 Similarly, a pKa value of 4.7 was recently assigned to a thiol in the apo-form of a small cysteine-rich motif from the CopY repressor, again using potentiometry.60 These findings clearly demonstrate that the pKa range of cysteine residues in metal-binding proteins can be considerably broader than commonly assumed.
Notably, also in the constructs investigated here, one residue in each case exhibits a markedly lower pKa value (4.3 for C-term-Ylip_MT and 4.2 for C-term-Y54C_Ylip_MT). Consistent with previous observations and by exclusion of other ionizable groups, this low pKa can only be attributed to a cysteine thiol group. A plausible explanation is strong hydrogen-bond stabilization of the thiolate, which would significantly lower its pKa. Potential hydrogen-bond donors include one of the lysine side chains or a neighboring, still protonated cysteine residue.58,61 The reason why such a pronounced effect is observed for only one single thiol among many remains unclear. Interestingly, however, a comparable phenomenon has been reported for polyhistidine-tagged peptides, where one histidine residue displays a substantially lower pKa (e.g., 4.70 for the DHDHDHHHHHHPGSSV-NH2 peptide (N-DpH)) relative to the remaining histidines (5.40 to 7.51).62
:
Cu(I) ratios (from 1
:
1 to 1
:
4) are shown in Fig. S9 and S10. These diagrams illustrate the pH-dependent abundance of the different protonation states.For several protonation states, individual stepwise dissociation constants were not resolved because two or more protons were released within a very narrow pH range. In these cases, the reported pKa values are the logarithm of an average one-proton dissociation constant. Upon Cu(I) binding, the pKa values of individual cysteine thiol groups decrease relative to the values of the apo form. Previous studies have reported that coordination of Cd(II) and Zn(II) to metallothioneins decreases the apparent pKa of thiols to roughly 3.5 and 4.5, respectively.45,63,64 Given the stronger Cu(I)–Cys interaction, even lower pKa values are expected. One of the sparse examples is the apparent pKa of 3.1 for a cysteine thiol group in the abovementioned small cysteine-rich motif from the CopY repressor in the presence of one Cu(I) equivalent.60
Cu(I)–thiolate complexes in MTs typically adopt digonal or trigonal coordination geometries.25,65 Therefore, stepwise coordination of the first Cu(I) equivalent is expected to involve two or three cysteine residues and to lower their pKa values accordingly. The first pKa values that were fitted from the titration data are 3.1 for the Cu(H4L)H7 species of C-term_Ylip_MT and 3.2 for the Cu(H5L)H7 species of C-term_Y54C_Ylip_MT. Accordingly, two cysteine thiol groups each have pKa values that are even lower, i.e. below ∼3. That very low pKa values cannot be determined precisely is commonly observed in complex-formation studies in cases where multiple species coexist within a narrow pH range. Taken together, we can safely conclude that these three thiolates with significantly lowered pKa values in each construct are directly involved in Cu(I) binding.
Nevertheless, additional protonation sites also display somewhat lower pKa values. For C-term_Ylip_MT, an average pKa value of 3.8 is obtained for the Cu(H4L)H6 and Cu(H4L)H5 species. We may assign one pKa to a coordinating thiolate or a thiolate that encounters proximity effects in the folding polypeptide. These may include intramolecular hydrogen bonding or electrostatic stabilization by nearby positively charged residues (e.g., lysine side chains) or coordinated Cu(I) ions. In addition, the CCC motif may further depress neighboring cysteine pKa values through proximity effects of adjacent, still protonated thiols.53 The second pKa value of 3.8 may also still originate from the low cysteine pKa seen in the apo-form. In this way we can align the results with the absorption data, i.e. 3–4 coordinating cysteine thiolate groups upon addition of one Cu(I) equivalent (Fig. 5). The pKa values of the Cu(H4L)H4, Cu(H4L)H3, and Cu(H4L)H2 species range from 5.0 to 7.3 and likely correspond to cysteine residues not directly involved in Cu(I) coordination. The final pKa of 7.5 for the Cu(H4L)H is attributed to deprotonation of the N-terminal amine, which also does not participate in metal binding and therefore remains close to its free-ligand value. A similar pattern is observed for C-term_Y54C_Ylip_MT. The Cu(H5L)H6 and Cu(H5L)H5 species have low values as well (3.7 and 4.1), and the non-coordinating cysteine residues and the N-terminal amine exhibit pKa values between 5.4 and 7.6.
At a 2
:
1 metal-to-peptide ratio, five cysteine residues display markedly lowered pKa values (≤3.5; 3.4 for the mutant) and hence are clearly directly involved in the coordination of Cu(I) at low pH. The next two cysteines exhibit slightly higher values (4.2 for C-term_Ylip_MT and 4.1 and 4.4 for C-term_Y54C_Ylip_MT) and are thus in the range discussed for thiolates either coordinating or in close proximity to coordinated Cu(I) ions. That results in a maximum number of 6–7 coordinating thiolate groups. The remaining cysteines, not involved in coordination, retain higher pKa values (>6.2 and 6.3). Comparison with the number of coordinating thiolates estimated from UV data (Fig. 4 and Table S3), and assuming full thiolate engagement at three Cu(I) equivalents, suggests that two Cu(I) ions are coordinated by six thiolate ligands in the wild-type fragment and seven in the mutant, hence numbers that align with the potentiometric data.
In the presence of three equivalents of Cu(I), six residues in C-term-Ylip_MT display pKa values ≤3.2, and seven have values ≤3.3 in C-term_Y54C_Ylip_MT, while also the pKa values for the last observed deprotonation step remain relatively low: 3.65 (average) and 3.7 for C-term_Ylip_MT and 3.6 (average) for C-term_Y54C_Ylip_MT. This indicates that, in C-term_Ylip_MT, all eight cysteine thiolates as well as the N-terminal amine group are deprotonated in the presence of three equivalents of Cu(I). On one hand, this agrees well with the UV data. On the other hand, it clearly demonstrates the unexpected – and to our knowledge unprecedented – coordination of Cu(I) by the N-terminal amine. For C-term_Y54C_Ylip_MT, nine sites are deprotonated at low pH, suggesting that either all nine cysteine thiolates, or eight thiolates plus the N-terminus, are involved in Cu(I) binding. However, based on the UV results, it is most likely that all nine thiolate groups are coordinated.
No major changes are observed when evaluating the 4
:
1 metal-to-peptide ratio, except that all pKa values further decreased, strongly indicating Cu(I) coordination (≤3.23 for C-term_Ylip_MT and ≤3.3 for C-term_Y54C_Ylip_MT).
:
1 and 4
:
1 metal-to-peptide ratios), the flexible N-terminus of the full-length protein becomes deprotonated as well and may participate in metal ion binding despite its larger distance from the metal–thiolate cluster in the amino acid sequence.Interestingly, the CD spectra of the apo-forms closely resemble those of the Cu(I)-bound species. All samples display a pronounced negative ellipticity band at 198–199 nm, accompanied by weaker negative contributions in the 218–224 nm range. These spectral features indicate a substantial degree of structural disorder that is largely retained upon metal binding.
Deconvolution of the spectra allows assignment of the observed signals to different secondary structure elements. A strong negative band near 200 nm is characteristic of random coils, whereas α-helices exhibit negative bands at 208 and 220 nm with a positive band near 190 nm. β-sheets are associated with a negative band around 218 nm and a positive band near 195 nm, while β-turns show a strong negative band near 191 nm and a positive band around 207 nm.72,73 The calculated secondary structure contents indicate that all four samples contain approximately 40% unstructured regions and around 20% β-turns. The α-helical content is generally low (<6%), with the notable exception of Cu4Y54C_Ylip_MT, which shows an increased α-helical contribution of ∼13%. The β-strand content varies between 27% and 39%, with the lowest value observed for Cu4Y54C_Ylip_MT (Fig. 7b).
Our F-AAS, ESI-MS, and UV spectroscopic analyses demonstrate that both full-length proteins and their truncated variants bind the same number of Cu(I) ions, indicating that metal coordination occurs exclusively through the cysteine residues located in the C-terminal region. In analogy to previously characterized MT structures, this cysteine-rich C-terminal domain is expected to remain largely flexible to accommodate Cu(I)-thiolate cluster formation with minimal steric strain. Such a structural requirement is consistent with the observed enrichment in random coil and β-turn conformations. Consequently, the observed content of well-ordered secondary structure elements detected by CD is most plausibly attributed to the cysteine-free N-terminal region.
Within this framework, the strong similarity between the apo and Cu(I)-bound spectra becomes readily understandable: Cu(I) binding primarily reorganizes the flexible C-terminal cluster-forming region, while leaving the secondary structural elements of the N-terminal region largely unaffected. Thus, the CD data support a modular organization of Ylip_MT, consisting of a structurally flexible, metal-binding C-terminal domain and a partially structured N-terminal region that is largely independent of metal coordination.
To complement these results with 3D structural data, 15N-labeled samples of Ylip_MT, Y54C-Ylip_MT, C-term-Ylip_MT, and C-term-Y54C-Ylip_MT were prepared and [15N,1H]-HSQC spectra recorded (data not shown). However, the observed signal dispersion and line widths suggest the presence of conformational exchange processes. Only the spectrum of Cu4-Y54C-Ylip_MT shows a modest degree of dispersion, but even in this case, the data were insufficient for further structure calculations (Fig. S11).
While numerous studies have determined stepwise Cu(I) binding constants for human metallothioneins, employing competition assays with chromophoric ligands,74 ESI-MS with competing ligands,75 isothermal titration calorimetry,76,77 and simulations based on high-resolution MS data,29 direct potentiometric analysis of Cu(I)-MT systems was missing. Only recently has potentiometry been applied to a small cysteine-rich Cu(I)-binding motif from a CopY repressor.60 To the best of our knowledge, the present work represents the first successful application of potentiometric titrations to a Cu(I)-MT system, enabling direct access to protonation equilibria and stepwise metal-binding constants in these highly cysteine-rich proteins.
By combining spectroscopic evidence with thermodynamic data, we arrive at the following key conclusions:
(i) Maximum Cu(I) loading and cluster formation. All four constructs bind up to four Cu(I) ions. Both UV-vis spectroscopy and potentiometric data consistently support the formation of a Cu4Cys8–9 cluster. Importantly, this implies that all cysteine residues, including those within the CCC motif, participate in coordination. Thus, the initially raised question regarding the involvement of the complete CCC stretch can be answered affirmatively.
(ii) Functional role of the N-terminal segment. Comparison of full-length proteins with their isolated C-terminal domains demonstrates that the long, cysteine-free N-terminal region enhances Cu(I) binding and stabilizes the resulting complexes. Across the investigated pH range, the Cu4 complex of full-length Ylip_MT shows the highest thermodynamic stability. While it does not directly contribute thiolate ligands, the N-terminal segment clearly modulates the stability of the metal-binding core. Significantly decreased pKa values in full-length Ylip_MT and its C-terminal fragment strongly support coordination of Cu(I) by the N-terminal amino group.
(iii) Influence of the C-terminal tyrosine residue. In the truncated constructs, the C-terminal tyrosine in the wild-type fragment appears to contribute to complex stabilization at near-physiological pH. While the Y54C variant exhibits slightly higher Cu(I) binding efficiency under strongly acidic conditions (up to pH 3.5), the Cu4 complex of the wild-type C-terminal fragment becomes thermodynamically more stable at higher pH. This suggests that the aromatic residue may provide additional stabilization, possibly through secondary interactions that become relevant once the thiolate cluster is largely formed.
Overall, this work establishes potentiometry as a powerful tool for dissecting Cu(I) coordination in MTs and provides new insights into how subtle sequence features such as CCC motifs, terminal residues, and non-coordinating extensions fine-tune Cu(I) cluster formation and stability.
| This journal is © The Royal Society of Chemistry 2026 |