 Open Access Article
 Open Access Article
      
        
          
            Farid F. 
            Khoury‡
          
        
        
       a, 
      
        
          
            Bradley S. 
            Heater‡
a, 
      
        
          
            Bradley S. 
            Heater‡
          
        
       b, 
      
        
          
            Daniel R. 
            Marzolf
          
        
      b, 
      
        
          
            Sameera 
            Abeyrathna
b, 
      
        
          
            Daniel R. 
            Marzolf
          
        
      b, 
      
        
          
            Sameera 
            Abeyrathna
          
        
       a, 
      
        
          
            Jonathan W. 
            Picking
          
        
      b, 
      
        
          
            Piyush 
            Kumar
a, 
      
        
          
            Jonathan W. 
            Picking
          
        
      b, 
      
        
          
            Piyush 
            Kumar
          
        
       a, 
      
        
          
            Steven A. 
            Higgins
a, 
      
        
          
            Steven A. 
            Higgins
          
        
       b, 
      
        
          
            Randy 
            Jones
          
        
      b, 
      
        
          
            Alan T. 
            Lewis
             Jr
          
        
      b, 
      
        
          
            Katarzyna H. 
            Kucharzyk
b, 
      
        
          
            Randy 
            Jones
          
        
      b, 
      
        
          
            Alan T. 
            Lewis
             Jr
          
        
      b, 
      
        
          
            Katarzyna H. 
            Kucharzyk
          
        
       *b and 
      
        
          
            Scott 
            Banta
*b and 
      
        
          
            Scott 
            Banta
          
        
       *a
*a
      
aDepartment of Chemical Engineering, Columbia University, New York, NY 10027, USA. E-mail: sbanta@columbia.edu
      
bBattelle Memorial Research Institute, Columbus, OH 43201, USA. E-mail: kucharzyk@battelle.org
    
First published on 28th May 2025
Rare earth elements (REEs), which include the 15 lanthanides plus scandium and yttrium, are critical components commonly used in permanent magnets and play a significant role in electronics and green energy technologies. Due to the similarities of these ions, conventional separation processes are chemically- and energy-intensive and generate large quantities of waste. The lanthanides share physical and chemical similarities with calcium ions, which allow REEs to replace calcium in calcium-binding peptides and proteins. In this study, we conducted a bioinformatic search to identify calcium-binding peptides with high affinity to bind and separate REEs. Seven unique domains representing different calcium-binding geometries were selected for evaluation. The results revealed a strong correlation between the charge of the binding loop and its affinity for REEs. We concluded that highly charged, aspartic acid-rich loops exhibit greater electrostatic repulsion, which creates higher affinity due to the increased stabilization effect of ion binding. Binding affinity across the lanthanide series was highest for ions with radii similar to that of calcium (∼1 Å), consistent with the evolutionary optimization of calcium-binding proteins for selective ion recognition. While selectivity varied among proteins in solution, immobilized proteins demonstrated higher selectivity toward intermediate REEs. One notable candidate identified in the bioinformatic search was HEW5 from Nocardioides zeae. We leveraged the selectivity of HEW5 in a 7 mL column to demonstrate a single-stage, chelator-free separation of an equimolar lanthanum–neodymium mixture, achieving a high purity (>90%) and yield (90%) of REEs. Additionally, immobilized HEW5 was used to remove non-REE ions from a simulated leachate stream and separate lanthanum (>90% purity) from other REEs in a single separation stage.
Biological approaches are attractive as they offer the potential to mitigate the environmental impact of current methods. Proteins can be readily engineered for customized functions, and to meet demand of large-scale applications.11 It has been estimated that one-third of all proteins require a specific metal ion to perform their functions, and these binding interactions have evolved to be highly selective.12,13 Therefore, metalloproteins provide an exciting opportunity to develop new REE separation and recovery technologies.
Recently, a high-affinity lanthanide-binding protein,14 lanmodulin (LanM), was identified from the methylotroph Methylobacterium extorquens following the discovery of a lanthanide-dependent methanol dehydrogenase in the same host.15,16 This Mex-LanM and, more recently, a Hans-LanM, are EF-hand domains with picomolar affinities to lanthanides that have demonstrated over a millionfold selectivity for lanthanides over non-REEs and some selectivity within the lanthanides series.17 This has enabled single-stage separation of Nd and Dy ions.18,19 In multiple stages, Mex-LanM has showed the potential for high purity separation of Sc and grouped separation of other REEs. The extreme affinity of LanM for lanthanides is remarkable, but this can also lead to challenges for the elution of the REE ions. The release of bound ions requires sharp pH gradients or the use of strong chelators such as citrate or malonate.20 This adds process costs and requires additional steps to separate and recover the ions from the chelators for recycling.
Trivalent lanthanide ions share physical and chemical similarities with divalent calcium ions, such as ionic radius, coordination geometry, and oxophilicity.21,22 These properties enable lanthanides to replace calcium in many proteins, often with higher affinities.22–24 Several studies have explored the loops of calmodulin (CaM) and other EF-hand domains and engineered them for lanthanide binding.25–30 These peptides can be used to separate REEs from non-REEs and their moderate affinities compared to LanM enable simpler elution strategies. However, these peptides tend to lack selectivity within the lanthanide series. The most well-known example is the development of the lanthanide binding tag (LBT), which was created through a combinatorial screen of the calcium-binding loops of calmodulin.29 These engineered LBTs exhibit nanomolar affinities for REEs and high selectivity within the lanthanide series, up to a 60-fold difference between La and Lu.26,31 LBTs have been successfully used for REE recovery via cell surface display.32 However, in a chromatographic setting, their low binding capacity per peptide would necessitate large column volumes to achieve meaningful capture. Moreover, LBTs have been reported to lose approximately 50% of their binding capacity under moderately acidic conditions (pH 5).32 While their selectivity is impressive, there are limited studies demonstrating effective separation of REE mixtures using LBTs in a chromatography format.
Recently, we reported that the Block V repeats-in-toxins (RTX) domain of adenylate cyclase from Bordetella pertussis, which has been engineered for calcium-induced hydrogel formation,33–35 calcium-controlled phase-separation peptides,36 and modulation of enzymatic activity.37 RTX is also capable of binding REEs with high selectivity against non-REEs and can differentiate between the light lanthanides.38–40 The ability of CaM and the RTX domain, two calcium-binding peptides with different binding modalities, to bind REEs and selectively separate them from non-REEs including calcium, motivates a broader search for calcium-binding peptides with alternative binding motifs. In this study, we explored a large library of calcium-binding domains to identify additional peptides capable of separating REEs. We performed a bioinformatic search to identify potential high capacity, high affinity, and high selectivity REE binding domains. We characterized the most promising peptides and immobilized the best candidates for single-stage chelator-free separation of REEs. The most notable candidate was able to achieve high-purity and high-yield separation of La and Nd, two major components of bastnäsite ore leachate.
The search identified 3035 calcium binding domains (CBDs), which consisted of 0.4% C2 domain, 3% epidermal growth factor (EGF), 3.3% βγ-crystallin domain, 6.5% PPE-SVP, 42% Excalibur, and 45% thrombospondins (Fig. S1†). The search also identified over 19![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 pyrroloquinoline quinone (PQQ) calcium-binding domains, which were excluded due to the dependency of the metal coordination on the small molecule pyrroloquinoline quinone (PQQ). Three to five representative CBDs were chosen from each category, yielding 27 candidates. Of those, 14 candidates had high-confidence AlphaFold2 (ref. 47) structures and predicted48 to be cytoplasmic. Seven representative domains were manually selected for characterization based on their similarity to other reported CBDs and their diverse binding geometries.
000 pyrroloquinoline quinone (PQQ) calcium-binding domains, which were excluded due to the dependency of the metal coordination on the small molecule pyrroloquinoline quinone (PQQ). Three to five representative CBDs were chosen from each category, yielding 27 candidates. Of those, 14 candidates had high-confidence AlphaFold2 (ref. 47) structures and predicted48 to be cytoplasmic. Seven representative domains were manually selected for characterization based on their similarity to other reported CBDs and their diverse binding geometries.
A0A7 (Uniprot ID: A0A7L4YJY0) was truncated from a 295 amino acid hypothetical protein from Epidermidibacterium keratini.49 The parent protein comprises three repeating anti-parallel thrombospondin domains, and the selected sequence (S111–S196) forms an anti-parallel β-sheet domain with six calcium binding sites (Fig. 1A). The calcium ions are modeled to bind in the loops connecting the β-strands with an octahedral binding geometry, where one of the coordinating oxygens extends from an adjacent loop.
|  | ||
| Fig. 1 Predicted structures of the seven selected domains using AlphaFold3 (ref. 56) with calcium ions (red spheres). (A) A0A7 is from a repeating anti-parallel thrombospondin domain from Epidermidibacterium keratini, (B) HEW5 is a cytosolic calcium regulator from Nocardioides zeae, (C) K3T(VN) and (D) K3T(VV) are thrombospondin type 3 repeat domains from Ulvibacter litoralis, (E) HJH0 is an Excalibur calcium-binding domain-containing protein from Nocardioides zeae, CaM(III, IV) is an EF-hand domain truncated from full length calmodulin from Xenopus laevis, and (G) RTX is the Block V domain of the adenylate cyclase protein from Bordetella pertussis (PDB ID 5CVW). PDB files for the predicted structures (A–F) can be found in the ESI.† | ||
HEW5 (Uniprot ID: A0A6P0HEW5) is a hypothetical protein presumed to regulate calcium ion concentrations in the cytosol of Nocardioides zeae.50 The native protein is made of 137 amino acids, forming a β-sheet with eight octahedral calcium binding sites (Fig. 1B). The truncated domain (P14–G125) excludes disordered regions not affiliated with calcium binding. The binding sites are highly conserved across the protein, with each loop bearing the amino acid sequence D(A/T)DGDGY(V/I/T/A)D. All of the coordinating oxygens in a binding site of HEW5 belong to the same binding loop, making the sites more independent than the binding sites in the A0A7 domain.
K3T(VV) and K3T(VN) (Uniprot ID: A0A1G7K3T7) were both truncated from a 400 amino acid hypothetical protein from Ulvibacter litoralis.51 The protein is predicted to be a thrombospondin type 3 repeat and is made of a conserved sequence repeated in a chain 6.5 times with only slight variations. The N-terminal repeat (Unit A) was truncated to yield the K3T(VV) domain (V1–V63), and the C-terminal repeats (Units E–G) were trimmed to produce the K3T(VN) domain (V253–N402). The selected domains are aspartic acid-rich and bind four and eight calcium ions, respectively. The domains lack secondary structure characteristics, and the calcium-binding sites are connected via disordered loop regions (Fig. 1C and D), unlike the β-strands in the A0A7 and HEW5 domains and α-helices in EF-hand domains. One key distinction between the two selected domains is a disulfide bridge linking the two units of the K3T(VN) domain.
HJH0 (Uniprot ID: A0A6P0HJH0) is a 95 amino acid Excalibur calcium-binding domain-containing protein from Nocardioides zeae.50 Excalibur domains are distant relatives of EF-hand domains with a conserved calcium-binding loop DxDxDGxxCE.52 The cysteines form a disulfide bond, giving rigidity to the binding loop. The selected domain (G52–R95) omits the disordered region outside the binding pocket and can bind a single calcium ion (Fig. 1E).
CaM(III, IV) and the RTX domains were not involved in the bioinformatics search and were manually selected for the study. CaM(III, IV) (PDB: 1CFF53) was selected as a representative EF-hand domain. It was truncated from the full-length CaM protein (M76–K148), yielding a domain with two calcium ion binding sites (Fig. 1F). The RTX domain (PDB: 5CVW54) is the Block V domain of the adenylate cyclase protein from Bordetella pertussis. It was selected for its recently reported ability to bind REEs.38,40 The RTX domain forms a β-roll structure upon binding eight calcium ions comprised of a tandem repeat of a nine amino acid sequence GGxGxDx(L/F/I)x. The bound ions experience pentagonal-bipyramidal coordination geometry with one oxygen extending from an adjacent loop (Fig. 1G). The Block V RTX domain was not truncated further to exclude the non-calcium binding portion of the C-terminus as this capping group influences the calcium binding by entropically stabilizing the motif.55 The oligonucleotides and amino acid sequences for all constructs are tabulated in Tables S2 and S3.†
|  | ||
| Fig. 2 Competition experiments with xylenol orange (XO). (A) Number of high affinity binding sites per domain. The (*), (**), and (***) represent three statistical groups where members of different groups have statistically significant differences between their means via a one-way ANOVA test (p-values are reported in Table S4†). Titrations of XO with YbCl3 for (B) K3T(VN), (C) A0A7, (D) HEW5, (E) K3T(VV), (F) CaM(III, IV), (G) RTX, and (H) HJH0. The error bars represent the standard deviations from three independent trials. | ||
| Protein | Uniprot ID | # residue (MW kDa) | pI | # theoretical sites | Capacity (# theoretical sites/100 AA) | # high-affinity sites via xylenol orange | # sites via ITC | Max. selectivity via FRET ↓ | 
|---|---|---|---|---|---|---|---|---|
| HEW5 | A0A6P0HEW5 (P14–G125) | 117 (12.1) | 2.98 | 8 | 6.8 | 2.3 ± 0.5 | 8.6 ± 0.6 | 3.4 ± 1.1 | 
| RTX | PDB: 5CVW | 152 (15.9) | 3.96 | 8 | 5.2 | 0.47 ± 0.12 | 5.1 ± 0.6 | 3.1 ± 0.5 | 
| A0A7 | A0A7L4YJY0 (S111–S196) | 87 (9.3) | 2.69 | 6 | 6.9 | 3.7 ± 0.1 | 7.8 ± 0.7 | 1.7 ± 0.1 | 
| K3T(VN) | A0A1G7K3T7 (V253–N402) | 150 (15.6) | 2.82 | 8 | 5.3 | 4.2 ± 0.3 | ND | 1.7 ± 0.2 | 
| CaM(III, IV) | PDB: 1CFF (M76–K148) | 74 (8.5) | 4.16 | 2 | 2.7 | 0.55 ± 0.10 | ND | 1.7 ± 0.3 | 
| K3T(VV) | A0A1G7K3T7 (V1–V63) | 63 (6.6) | 2.82 | 4 | 6.3 | 1.9 ± 0.1 | ND | 1.3 ± 0.2 | 
| HJH0 | A0A6P0HJH0 (G52–R95) | 43 (4.7) | 3.85 | 1 | 2.3 | 0.14 ± 0.10 | ND | 1.1 ± 0.1 | 
The calcium-binding loops in many peptides are often disordered due to electrostatic repulsion from negatively charged aspartic and glutamic acids. The binding of the metal ion stabilizes the electrostatic repulsion and allows the protein to adopt a more compact structure.57,59–62 This stabilization correlates with increased binding affinities.58,63 For instance, A0A7 and HEW5, which have four aspartic acids per bind loops, showed higher affinities than the RTX domain, which has only one aspartic acid per loop. If the magnitude of the electrostatic stabilization is assigned to the isoelectric point of the protein, a clear pattern emerges: highly charged proteins with a low isoelectric point (A0A7, HEW5, K3T(VN), and K3T(VV)) have more high-affinity binding sites. In contrast, domains with higher isoelectric points (HJH0, CaM(III, IV), RTX) lacked high-affinity sites (Table 1). Additionally, entropic forces play a role in determining the affinities as upon metal ion binding, the ions and the sidechains in the binding pocket release their hydration shells, creating entropically favorable reactions. For instance, the binding of a lanthanide releases eight or nine water molecules, while calcium only releases six water molecules and this entropic driving force creates a higher affinity for REEs over calcium.31 The composition of non-coordinating sidechains in the loop, which must be dehydrated in the binding reaction, and other structural elements of the protein also affect the affinities.
The coordination geometry of the ion is critical for determining both the selectivities and affinities of the binding sites.57 The coordination geometries observed in the predicted structures for the selected domains varied between octahedral (CN = 6, where CN stands for coordination number) and pentagonal-bipyramidal (CN = 7), with all coordinating atoms being oxygens. Given potential errors in the structure prediction models (AlphaFold3), such minor variations in coordination geometries are unlikely to impact the affinities significantly. The charge of the coordinating oxygens, whether partial or full, and the presence of polydentate ligands likely play a more prominent role in this case. The affinity gradients within the K3T(VN) and K3T(VV) can be explained by the charge of the binding loop. The domains have eight and four binding sites, respectively, while only half of the predicted sites are detected in the competition assay. A closer examination of the structure of the binding sites yields two distinctly different binding modes. The high-affinity binding sites are coordinated by seven oxygens in a pentagonal bipyramidal geometry with up to four fully charged oxygen ligands, while the other set of sites are coordinated by only five partially charged oxygen atoms (Fig. 1C). For the other domains, sequential binding of the metal ions with cooperative binding could yield an affinity gradient within the protein.
The maximum selectivity for the domains, measured by FRET, ranged from 1.1 for HJH0 to 3.4 for HEW5 (Table 1). The Mex-LanM had a maximum selectivity of about 3.0 when comparing the same ions in this study via the CFP/YFP FRET system,18,64 indicating HEW5 has a higher selectivity amongst the REEs screened. Therefore, the selectivity results are encouraging for protein-based separation of REEs.
While the CFP/YFP FRET method enables high-throughput screening of apparent affinities of REE-binding proteins at physiological pH, it is essential to recognize the limitations. The two large proteins (CFP and YFP) on the ends of the peptides may affect folding, which in turn impacts the affinities and selectivities of the binding sites, as previously reported for the entropic effect of the C-terminus capping group of the RTX.55 Additionally, although no precipitation was observed, heavy REEs are more prone to form insoluble hydroxides at physiological pH (Visual MINTEQ).
HJH0 and CaM(III, IV) were capable of binding the tested lanthanides as indicated by FRET, but neither domain exhibited high-affinity binding sites via XO competition assay. HJH0 could not differentiate between the tested ions, having the lowest measured selectivity (1.1 ± 0.1). In contrast, CaM(III, IV) demonstrated moderate selectivity (1.7 ± 0.3) with a preference for light REEs. The binding capacity of both proteins was the lowest among the domains, with only 2.5 binding sites per 100 amino acids. Consequently, HJH0 and CaM(III, IV) were excluded from further testing on chromatography columns.
K3T(VN) and K3T(VV) exhibited moderate selectivities, ranging from 1.3 to 1.7, with a preference for heavy REEs. These domains are high-capacity binders, with 5–6 binding sites per 100 amino acids, and contain multiple high-affinity binding sites. Despite meeting all the requirements, they present multiple binding modes that could complicate chromatography column performance. Specifically, half of the ions in these domains are coordinated in a pentagonal-bipyramidal (CN = 7) geometry with multiple bidentate aspartic acids. The other half are coordinated in a square-pyramidal (CN = 5) geometry, with all coordinating oxygens being monodentate and partially charged. This variation in coordination could lead to differences in affinity, as indicated by XO results, and selectivity between sites, potentially resulting in broad and multipeak elution profiles in chromatography. For these reasons, the domains were not pursued for further characterization. However, their unique properties make them worthy of separate study.
A0A7, HEW5, and RTX domains were selected for further characterization and immobilization on chromatography resin. The three domains are high-capacity binders, with 5–7 binding sites per 100 amino acids, while EF-hand domains average less than three binding sites per 100 amino acids. The domains demonstrate high selectivity ranging from 1.7 to 3.4, and A0A7 and HEW5 display multiple high-affinity binding sites. Additionally, the binding loops of the domains are highly conserved and may be less likely to exhibit variation in selectivity across the protein. The repetitiveness and modularity of the sequences would enable the introduction of additional binding loops to increase the binding capacity per protein, as previously reported for the RTX domain.66 The HEW5 and RTX are the most selective and have opposing preferences for light and heavy REEs, respectively, presenting an opportunity to use different scaffolds tailored to the composition of the source material.
|  | ||
| Fig. 4 CD spectra of the proteins (60 μM) in their apo (—), Ca-bound (— —), and Nd-bound (— -) states for (A) A0A7, (B) HEW5, and (C) RTX. The reported spectra are the means of three scans. | ||
While the binding affinities were determined using a FRET-based method described above, the complications of having two larger proteins on either side and the requirement to run CFP/YFP FRET at near physiological pH conditions raised the need to pursue other direct routes with the unmodified peptides. Isothermal titration calorimetry (ITC) can be used to more directly measure the interaction between the ion and the binding site. The affinities of the down-selected proteins to REEs for nine lanthanide ions spanning light, intermediate, and heavy REEs in MES buffer (pH 6) were determined. HEW5 had the highest affinity with average dissociation constants ranging from 5–40 μM, with a preference for light REEs (Fig. 5A). In contrast, the RTX domain preferred intermediate and heavy REEs, with the highest affinity for europium. The results for the RTX domain agree with previously determined apparent dissociation constants at pH 6 via FRET.40 A0A7, similarly to HEW5, preferred light REEs and ranged from 10–70 μM, with the highest affinity for cerium and praseodymium. It is notable that the affinity of A0A7 to lanthanides had a different trend via ITC from what was measured via FRET, which could be attributed to change in pH conditions or that ITC measures the heat of the interaction between the amino acids and the ion while FRET measure the induced conformational change (apparent dissociation constants).
At low molar ratios, an initial increase in the enthalpy was observed. A similar response was reported for LanM peptides and other metalloproteins and is attributed to a large conformational change following the binding of the first ion.67,68 The data points were excluded from the isotherm fits; therefore, the estimated dissociation constants for A0A7 and RTX do not account for the first site. In case of negative cooperativity, as anticipated for the RTX, the affinity would be underestimated by ITC. The overall selectivity, the highest to lowest calculated lanthanide dissociation constants ratio, was similar for the three domains and ranged from five to eight (Fig. 5B). A summary of the calculated thermodynamic properties and individual isotherm fits are presented in ESI (Table S6 and Fig. S11–S17†).
Calcium interaction with the proteins could not be studied by ITC due to the weak affinity between the protein and calcium at pH 6. The resultant isotherms were incomplete sigmoidal curves (Fig. S13, S15 and S17†), and the dissociation constants were undetermined. The required amount of protein to increase the c-value ([protein]/kd) to produce a suitable isotherm was prohibitively high. Therefore, the dissociation constants for the interactions between the proteins and calcium were estimated as the concentration of ions needed to induce conformational change in CD. The lanthanide to calcium selectivity, reported as the ratio of calcium ion required to induce a conformational change in CD to the average dissociation constant for lanthanide, was the highest for A0A7 and HEW5 at 60-fold selectivity for lanthanides over calcium (Fig. 5B). In contrast, the selectivity of the RTX domain for lanthanides was only 15-fold over calcium. The results illustrate the ability of the domains to distinguish REEs from each other and to separate them from contaminants, including calcium.
The binding of the lanthanides to the proteins was an entropically driven interaction (Fig. S11†), and a similar behavior was observed for LBT.31 The finding was expected since the dehydration of the ions and the sidechains, releasing water molecules into the solvent, and the burial of hydrophobic residues due to the folding of the disordered domains are entropically favorable processes. The REE ions and side chains are highly hydrated in solution, and the disruption of their tightly bound water shells during binding imposes a significant enthalpic penalty, which must be entropically overcome to form the protein–ion complex. Interestingly, these domains evolved to bind calcium and have a preference for REEs. A0A7 and HEW5 prefer light REEs, while RTX showed a higher affinity for intermediate to heavy REEs. Replotting the dissociation constants (Fig. 5A) as a function of the effective ionic radius,69 where the radius of each ion is taken at appropriate coordination number (CN = 6 for A0A7 and HEW5, and CN = 7 for RTX, where CN stands for coordination number), the highest affinity for all three domains emerges at the same ionic radius (Fig. 6), which closely matches the ionic size of calcium (∼1 Å). These results suggest that the size of the ions is a major determining factor for the preference. It is important to note that coordination numbers were predicted via AlphaFold3 and should be confirmed by experimental validation.
|  | ||
| Fig. 6 Dissociation constants as a function of effective ionic radius.69 Average dissociation constants determined via ITC for A0A7, HEW5, and RTX. The coordination numbers (CN) were determined from AlphaFold3 predicted structures. The error bars represent 95% confidence intervals. | ||
|  | ||
| Fig. 7 Immobilized apparent dissociation constants measured under isocratic elution conditions (pH 3) for A0A7 and HEW5. The error bars represent the propagated standard deviations from three independent trials. The uncertainties stem from the experimentally determined capacity (Bt) and the standard deviations from the retention peaks (VR) of each element. Triplicate measurements of apparent dissociation constants and elution profiles are presented in ESI (Fig. S19†). | ||
The elution profiles of the A0A7 and HEW5 domains were studied in MES buffer at pH 6, under the same conditions as the ITC experiment. However, as anticipated from the XO competition experiments, the tight binding of the two domains prevented facile REE elution. Optimal conditions for REE elution from these domains were found around pH 4, which is near the pKa of aspartic acid. In contrast, elution from the RTX domain began at just below pH 5. The RTX domain demonstrated poor separation efficacy, exhibiting broad and multipeak elution profiles (Fig. S20†). This behavior is likely due to a large affinity gradient of the binding sites within the RTX domain, as previously observed in studies using calcium.38,54 Additionally, ITC revealed a stoichiometry of five functional binding sites out of eight theoretical sites in the RTX domain (Table 1), suggesting that the remaining three weaker or non-functional sites may contribute to premature elution.
The selectivity of A0A7 and HEW5 shifted from a preference for light REEs in solution to intermediate REEs upon immobilization. This shift may be attributed to the entropic effect of anchoring the protein, the adoption of a new conformational state (which could cause an increase in coordination number favoring heavier REEs, as discussed in Fig. 6), a shift in second-shell coordinating amino acids, and other factors such as column properties, pH, and environmental conditions. A recent study on the binding of lanthanides to the methanol dehydrogenase (Xoxf) using DFT showed a higher affinity to intermediate REEs, which is attributed to the higher spin state of these ions.72 This change in selectivity creates an affinity gradient within the light REE series, which can be exploited for separation. Light REEs constitute the majority of REEs found in bastnäsite, the most common rare earth source, according to the U.S. Geological Survey.73 Since the RTX domain demonstrated poor separation efficacy under the conditions tested, and A0A7 and HEW5 exhibit similar selectivities, HEW5 was chosen for further separation tests due to its higher capacity and stronger affinity.
|  | ||
| Fig. 8 Single-stage separation of an equimolar mixture of lanthanum (LaCl3) and neodymium (NdCl3) using immobilized HEW5 (7 mL column volume). Shaded areas represent pooled fractions used for calculating purity (>90%) and yield (>90%) of each element. In this context, purity is defined as the molar percentage of the specified ion in the pooled fractions, while yield represents the molar ratio of the pure fraction to the total loaded onto the column. Results are representative of triplicate trials; additional details are provided in ESI (Fig. S21†). | ||
Bastnäsite ores are the primary source of REEs and are particularly rich in light lanthanides. The mine concentrate, or “refined ore,” primarily contains cerium (Ce), lanthanum (La), neodymium (Nd), and praseodymium (Pr), along with minute amounts of heavier REEs such as samarium (Sm), europium (Eu), gadolinium (Gd), and yttrium (Y). It also includes significant quantities of non-REEs, mainly calcium (Ca) and strontium (Sr).75 Commonly, concentrated hydrochloric acid is used to leach the ore, releasing REEs into two streams: a cerium dioxide-rich stream and a REE chloride stream with lower cerium concentration.74 These elements are separated from impurities and one another through multiple stages of solvent extraction—an energy-intensive and environmentally taxing process.6–8
To further demonstrate the capabilities of this platform, a synthetic solution simulating the chloride stream from the hydrochloric acid leaching of ores (Table S7†) was separated using HEW5 with a loading of approximately 15% of the column capacity. Non-REE ions “impurities” were effectively separated from REE ions by initially operating the column at pH 5.5 where REEs remained bound to the column (Fig. 9A, Zone A). A linear elution gradient from pH 4.75 to pH 3.5 was then applied followed by constant elution at pH 3.0. Lanthanum (LaCl3) was separated from the remaining REEs (Zone B) with a purity ranging from 92–96% and a yield exceeding 80% (Fig. 9B). The remaining REEs were group-eluted under pH 3.5 elution conditions (Zone C). In conventional processes, removing impurities alone would require multiple solvent extraction stages. Here, HEW5 demonstrates its ability to achieve impurity removal and separation of the most abundant REE in the mixture in a single stage, highlighting its efficiency and potential for simplifying REE separation workflow.
|  | ||
| Fig. 9 (A) Single-stage separation of lanthanum from a simulated bastnäsite leachate solution using immobilized HEW5 achieves high yield and purity. (B) Molar composition of Zone B, showing significant lanthanum enrichment (∼95%) compared to the feed, which contains less than 30% lanthanum. Results are representative of triplicate trials; additional details are provided in ESI (Fig. S22†). | ||
The lanthanides share physical and chemical similarities with calcium ions, such as ionic size and coordination geometry, which allow REEs to substitute for calcium ions in calcium-binding proteins. This property opens the door to utilizing the extensive, well-studied library of calcium-binding proteins for protein-based separation of REEs.
In this study, we performed a bioinformatic search to identify promising candidates for REE binding and separation. The search results were narrowed to seven domains representing different coordination geometries of calcium-binding proteins. In vitro testing revealed a strong correlation between the charge of the binding loop (isoelectric point) and its affinity for REEs. We concluded that highly charged, aspartic acid-rich binding loops exhibit greater electrostatic repulsion, influencing the stabilization energy upon ion binding and correlating with affinity. Selectivity within the lanthanide series appeared to be a function of ionic size, with optimal binding near 1 Å, corresponding to the ionic size of calcium. While selectivity varied among proteins in solution, immobilized proteins demonstrated higher selectivity toward intermediate REEs. This behavior may be attributed to the higher valency states of intermediate REEs, entropic effects of immobilization, or the increased prevalence of carboxyl groups (aspartic acids) contributing to selectivity. Interestingly, a similar selectivity trend to the formation constants for acetate (Visual MINTEQ) within the REE series was observed, which might suggest that REE preference is due to the strength of the REE–carboxylate interaction.
One notable candidate identified in the bioinformatic search was HEW5, a protein presumed to regulate calcium ion concentrations in the cytosol of Nocardioides zeae. HEW5 is a high-capacity binder, with nearly seven binding sites per 100 amino acids compared to fewer than three in EF-hand domains. Competition experiments with XO revealed sub-micromolar affinity of HEW5 for lanthanides and the highest selectivity within the lanthanide series among the tested proteins via FRET and ITC.
We leveraged the selectivity of HEW5 for a single-stage, chelator-free separation of an equimolar lanthanum–neodymium mixture, two major components of bastnäsite leachate. This process achieved high purity (>90%) and yield (90%) for both elements. Additionally, immobilized HEW5 was employed to remove non-REEs from a simulated leachate stream and separate lanthanum (>90% purity) from other REEs in a single stage under mild conditions.
| Footnotes | 
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5sc02315g | 
| ‡ These authors contributed equally. | 
| This journal is © The Royal Society of Chemistry 2025 |