Open Access Article
Bernard D. G. Eenink
a,
Josephin M. Holsteinb,
Magdalena Heberleinab,
Carina Dilkautec,
Joachim Jose
c,
Florian Hollfelder
b,
Bert van Loo
d,
Erich Bornberg-Bauer
*ae,
Tomasz S. Kaminski
*f and
Andreas Lange
*a
aInstitute for Evolution and Biodiversity, University of Münster, Germany. E-mail: andreas.lange@uni-muenster.de; ebb@uni-muenster.de
bDepartment of Biochemistry, University of Cambridge, UK
cInstitute of Pharmaceutical and Medicinal Chemistry, University of Münster, Germany
dDepartment of Applied Sciences, Northumbria University, Newcastle-upon-Tyne, UK
eDepartment of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
fDepartment of Molecular Biology, Institute of Biochemistry, Faculty of Biology, University of Warsaw, Warsaw, Poland. E-mail: ts.kaminski2@uw.edu.pl
First published on 5th February 2026
Characterizing the dynamics and functional shifts during protein evolution is essential, both for understanding protein evolution and for rationalizing efficient strategies for e.g. enzymes with desired and effective functions. Most proteins organize in families, sets of divergent sequences which share a common ancestor and have a similar structural fold. Here, we study aryl sulfatases, a subfamily of the large and evolutionary old alkaline phosphatase superfamily. We demonstrate how ultrahigh-throughput droplet microfluidics can be used for studying aryl sulfatases and their computationally reconstructed putative common ancestors. We compare the evolvability and robustness of three ancestors and three extant aryl sulfatases which all exhibit catalytic promiscuity towards a range of substrate classes. Using varying mutations rates, eleven libraries were constructed and expressed in single-cell microdroplets. In general, higher mutation rates resulted in wider distribution of active variants but fewer improved variants overall. However, the impact of mutation rate differed between enzymes, with some benefiting from higher and others from lower mutation rate, underscoring the need to test diverse mutagenesis regimes.
Enzyme promiscuity is a key factor in the evolution of new functions,5,6 providing organisms with moderate activity toward non-standard substrates and lowering the threshold for selective advantage.6–8 By turning over non-standard substrates, even before gene duplication,6 promiscuous activity can enable the rapid development of biocatalysts for green chemistry and xenobiotic degradation.9
Directed evolution mimics natural evolution10,11 to improve traits such as catalytic efficiency,12 stability,13 enantioselectivity,14 and medium tolerance.15 Success of directed evolution relies on both the kinetic parameters of the starting enzyme but also on the shape of the local fitness landscape.9,16 For example, shallow fitness peaks may be easier to reach and variants may be easier maintained due of the sheer number of viable variants in the near vicinity while steeper fitness peaks may be difficult to reach and variants more easily lost (“survival of the flattest”).17,18
Model substrates for ASs are widely available, and can be used to detect sulfate, phosphate and phosphonate esterase activity using a variety of detection methods (Fig. 1).20
Recently, ASR has been used to create highly stable and functional proteins.38–42 Additionally, mutagenesis has been used to further improve proteins derived from ASR, using both rational design,38 directed evolution,39 or both.40 However, previous studies mostly revolve around introduction of mutations in an ancestral backbone, and directed evolution on ancestral enzymes is compared with campaigns previously conducted on extant enzymes, rather than direct comparison in one experimental setup.
In this study, we perform parallel directed evolution campaigns on ancestral and extant enzymes from the AS family using identical parameters, conditions, and experimental setups. This approach enables a direct, high-throughput comparison of the evolutionary trajectories and performance of ancestral versus extant enzymes.
Recent studies show inoculating droplets with a single cell followed by cell growth inside droplets can improve sensitivity and recovery while preserving genotype-phenotype linkage.50,51 The E. coli autodisplay system (Fig. 2)48,49 is used to present enzymes, including multimeric enzymes52,53 on the cell surface. This technique enables the recovery of live, intact cells after sorting.
![]() | ||
| Fig. 2 Schematic overview of the sulfatase enzymes displayed on the outer membrane of E. coli cells. A signal peptide directs the fused protein towards the periplasmic space where the AIDA β-barrel folds into the outer membrane and displays the passenger enzyme at the surface.49,51 | ||
000g, 4 °C), the supernatant was passed through a 0.45 µm syringe-driven filter. The proteins were purified using affinity chromatography with a Strep-Tactin column (IBA).
Peak fractions were combined, concentrated down to 100 µM. The buffer was exchanged to 50 mM Tris-HCl pH 8.0 by passing the protein solution through PD MiniTrap G-25 columns (GE Healthcare). Protein concentration was determined from absorption at 280 nm. The extinction coefficients and molecular weights (MW) were calculated using ProtParam at ExPASy (https://web.expasy.org/protparam/). Protein aliquots were either used immediately or frozen in liquid nitrogen and stored at −80 °C.
:
1 with a solution of 40 µM fluorescein disulfate in SID buffer pH 7.5 just upstream the flow-focusing (FF) droplet generation junction. At the FF junction, fluorinated oil HFE-7500 (3M) containing 1% (v/v) fluorosurfactant-008 (RAN Biotechnologies) was used to break the continuous stream aqueous phase and generate monodisperse 40 pL droplets with an expected cell occupancy of λ = 0.35 (assuming Poisson distribution). Flow rates were following: 40 µL min−1 for both aqueous phases and 160 µL min−1 for the oil/surfactant phase. The droplet formation was monitored on an inverted microscope (SP981, Brunell Microscopes) equipped with a high-speed camera (Miro ex4, Phantom Research). The droplets were collected in a droplet chamber (as described in ref. 62). Generated droplets were incubated for 1 to 3 days at 30 °C.
The droplets were incubated for 3 days in dedicated droplet chambers.62 The stability and monodispersity of microfluidic droplets was checked using inverted fluorescence microscopy (EVOS FL, Thermo Fisher) before and after incubation. Additionally, droplet stability was verified during flow and fluorescence detection. Droplets entering the detection channel that deviated from the defined size threshold, due to issues such as unwanted merging during incubation, were not sorted. This additional gating was enabled by automated, high-throughput processing of fluorescence signals using field-programmable gate array (FPGA) electronics and LabVIEW software.
During incubation the cells were aerated50 by flushing with fluorinated oil HFE-7500 (3M) containing 1% (v/v) fluorosurfactant-008 at a rate of 4 µL per minute. After 2–3 days the droplets were sorted using FADS as previously described in van Loo et al., 2019.12 The LABVIEW script used is available online https://github.com/droplet-lab/spinDrop/tree/main/LabVIEW%20FADS. Droplets were re-injected from the droplet chamber onto a microfluidic chip (design: https://openwetware.org/wiki/DropBase:droplet_electrosorting_3) using HFE7500 (3M) containing 1.5% (v/v) fluorosurfactant-008 (RAN Biotechnologies). Droplets were spaced with oil and pushed through a narrow detection channel to allow single droplet measurement. As the droplets were pushed past a 488 nm laser in the sorting Y-junction, the fluorescent activity inside each droplet was measured by recording emission at 497–553 nm to quantify substrate turnover. When fluorescent activity exceeded a threshold (dependent on wild-type enzyme) pulse and function generators were triggered and generated a square pulse at 8 volts, which was amplified 100-fold by a high-voltage amplifier (610E, Trek) and applied on the sorting device via salt-water electrodes (5 M NaCl). This pulse pulled the selected droplets into a collection channel. The threshold was set to 0.1% of total droplets during a calibration and stabilization phase. Droplets sorted during this phase were discarded, once droplet flow was stabilized and threshold established, collection tubes were added to the positive channel channel and waste channel and droplet collection started. Droplets were collected at a high frequency until a total of 1
000
000 droplets were sorted.
A FADS sorting threshold of 0.1% of total droplets was selected. This stringent threshold ensured that only substantially improved variants were sorted and recovered. Considering an average occupancy of 0.35 cells per droplet, this corresponded to sorting approximately the single most active variant per ±300 screened variants. The choice of sorting threshold represents a trade-off between stringency and yield. A lenient threshold risks inclusion of wild-type or wild-type-like variants, thereby increasing the burden of downstream rescreening. Conversely, an overly stringent threshold necessitates screening an excessive number of droplets to recover sufficient positive variants. We believe that a 0.1% threshold represents a suitable compromise under the conditions used here. Future studies may further evaluate how varying this parameter influences the recovery and diversity of improved variants.
The droplets in the positive collection channel were collected in a solution of 100 µL Lucigen recovery medium, 45 µl HFE-7500 (3M) and 5 µL PFO surfactant. The collection tubing was flushed with HFE-7500 (3M) to collect all cells. An additional 500 µL of recovery medium was added after collection and the solution was mixed. The liquid phase was plated on LB + amp plates and incubated overnight.
To compare the evolvability of extant and ancestral enzymes we selected three ancestral members (Anc497, Anc498 and Anc499, descending from to the presumed common ancestor of ASPMH) and three extant members (SpAS2, SaAS and AkAS) of the AS family of the previously described AP superfamily.12,20,63 These extant enzymes were selected to obtain a balanced representation of the AS family, and have previously been successfully expressed and purified61 (Fig. 4). Each represents a sub-group of the ASs that descend from ASPMH. We reconstructed the ancestors from SpAS1 towards the root of our set of ASs until the common ancestor of ASs (see Fig. 4). We selected the ancestral enzymes to provide a range of the evolutionary history of ASs.
We successfully expressed each enzyme on the surface of E. coli. We confirmed expression and activity of the enzymes by observing the successful turnover of 4-NPS by intact E. coli cells surface displaying AS and purification of membrane fractions (Fig. S1). We confirmed turnover of 4-NPS for all ancestral (Anc496, Anc497, Anc498 and Anc499) and extant (SpAS2, SaAS and AkAS) ASs in the surface display assay. Observed enzyme activities were roughly proportional to those observed in cytosolically expressed and purified proteins.
To assay the thermostability of the enzymes, the melting temperature (Tm) of each enzyme was determined using a thermal shift assay (as described in Van Loo et al.20). The Tm of the wild-type enzymes selected was between 45.6 °C, and 57.4 °C. All enzymes exhibited Tm values well above the 30 °C assay temperature, indicating sufficient thermal stability under assay conditions. Among the enzymes tested, Anc497 showed the highest thermostability (57.4 °C), whereas AkAS was the least thermostable (45.6 °C). Anc498, Anc499, SpAS2, and SaAS displayed similar thermostabilities, with Tm values within 2 °C of one another (Table 1). Initial kinetic parameters varied between enzymes, both ancestral and extant. The Km values of the extant enzymes differed by more than an order of magnitude, ranging from 9.9 × 10−5 for AkAS to 2.0 × 10−3 for SpAS2. The Km's of ancestral enzymes were more similar to each other and fell in between the Km's of the extant enzymes. The initial Kcat values of the ancestral enzymes were lower than those of the selected extant enzymes. Moreover, a trend was observed in which deeper ancestral nodes corresponded to lower initial Kcat values, whereas the extant enzymes displayed slightly higher and more comparable Kcat values.
| Properties of wild-type and ancestral sulfatases | Kcat | Km | Kcat/Km | Tm |
|---|---|---|---|---|
| Anc497 | 0.26 ± 0.01 | (3.2 ± 0.4) × 10−4 | (8.1 ± 1.1) × 102 | 57.4 |
| Anc498 | 0.56 ± 0.08 | (5.6 ± 0.37) × 10−4 | (1.01 ± 0.14) × 103 | 49.6 |
| Anc499 | 1.01 ± 0.4 × 101 | (8.4 ± 0.8) × 10−4 | (1.3 ± 0.1) × 104 | 48.6 |
| SpAS2 | (4.2 ± 0.4) × 102 | (2.0 ± 0.3) × 10−3 | (2.1 ± 0.4) × 105 | 51.7 |
| SaAS | (2.00 ± 0.04) × 102 | (2.8 ± 0.1) × 10−4 | (7.0 ± 0.4) × 105 | 49.5 |
| AkAS | (9.7 ± 0.2) × 101 | (9.9 ± 0.6) × 10−5 | (9.9 ± 0.6) × 105 | 45.6 |
After verifying the size and integrity of each library we transformed the remainder of the ligation mix into E. coli and divided the cells equally on agar plates. Sulfatase activity was monitored using the FADS with fluorescein disulphate (FDS) as a fluorescent model substrate, in microtiter plates using both FDS and 4-nitrophenol sulfate (4-NPS) as substrates and on agar plates using X-sulfate (5-Bromo-4-chloro-3-indoxyl sulfate), as substrate (Fig. 1).
Ancestral AS sequences were previously inferred towards the root of our set of ASs until the last common ancestor of ASs and PMHs (see Fig. 4) ancestor of AS and PMH. Detailed phylogenetic analysis was performed on extant AS family members.22 In this study we describe a parallel directed evolution campaign on three ancestral and three extant AS members for improved catalytic efficiency towards FDS. Autodisplay in E. coli, combined with microfluidic FADS, is used to screen and sort living E. coli cells presenting unique enzyme variants, with a focus on the shape of the activity distributions of enzyme variants (local fitness landscapes) obtained from FADS sorting (Fig. 1).
All mutant libraries reached the target size of 105 unique variants (Table S1). We determined the amino acid substitution rates for dPTP and 8-oxo-dGTP libraries using Sanger sequencing. In total, we screened 12 libraries, representing two different mutational conditions: (i) dPTP nucleotides (2.3 ± 1.7 mutations) and (ii) 8-oxo-dGTP nucleotides (3.7 ± 2.6 mutations) and six different enzymes, each either inferred ancestral or extant members of AS group of the AP-superfamily. We verified the occupancy of the droplets by observing a sample of droplets using fluoresence microscopy (Fig. S3). For each library we screened 106 occupied droplets (λ = 0.35). As a control and to form a baseline of wild-type activity we screened 105 occupied droplets of each respective wild-type enzyme in the same session using the same device for each set. To ensure reproducibility of the microfluidic screening, detector sensitivity was set to comparable levels for each experiment and adjusted when necessary to achieve similar background signal intensities for empty droplets, which accounted for approximately 70% of droplets in each library. When required, measured fluorescence values were additionally normalized to the background signal of empty droplets to enable direct comparison between libraries. Therefore, the reproducibility of these measurements appears to meet the standards of conventional droplet-based microfluidic screening experiments.
With the help of FADS up to 106 variants could be tested for each library, guaranteeing a large coverage of all possible single mutations and full coverage of the targeted 105-member mutant libraries. Usage of the E. coli autodisplay system48,49 allowed for the direct recovery of recovered intact cells after screening.
All enzymes that had been previously shown to express solubly and active in cytosolic form12,61 also expressed well as autodisplay constructs. By using FADS with living cells, it became possible to directly recover the screened cells and plating them on agar instead of recovering and re-transforming the plasmid.
While the rates of improvement varied widely (Fig. 5), the overall trend being observed showed a larger increase in improvement in sulfatases having a lower initial activity (Table 1) before introducing mutations. The two libraries with the greatest degree of improvement, Anc498 and SaAS (Fig. 5) came from wild-type enzymes with a Tm in the middle of the enzymes considered (Table 1). A qualitative effect of mutation rate on enzyme evolvability can also be observed between dPTP (lower mutation rate) and 8-oxo-dGTP (higher mutation rate). Here, we observed that a higher mutation rate leads to a lower improvement in some libraries.
The distribution of extant SaAS broadly followed the theorized pattern, i.e. that an increased mutation rate leads to a lower number of improved variants. However, the distribution shows variants with higher activity than in the lower mutation rate library (Fig. 5). This implies that variants with more than one mutation are leading to a maximum improvement, and that the coverage of the library is sufficient to find improved variants with multiple mutations. Even though a majority of variants contain multiple mutants, the library size of 105 can still be assumed to cover the majority of the ±5400 possible single mutants due to oversampling.12 Paradoxically, even though most sorted variants contain multiple mutations, the library still does not exhaustively cover all possible (±3 × 107) double mutations. In Anc497 and Anc498 a different trend is found. In the case of Anc497 most improved variants are seen in the dPTP library while for Anc498 most improved variants are seen in the 8-oxo-dGTP library (Fig. 5). This distribution sheds light on the shape of the local fitness landscape, as different mutational rates lead to a quantitatively different outcome. Notably, for Anc499, which had the highest initial catalytic efficiency among the ancestral enzymes, little to no improved variants were observed in the library (Fig. 5). This is reflected in plate re-screening, where no variants with improved activity towards FDS were found (Table 2). Among the other extant enzymes, SpAS2 and AKAS we found a shape similar to SaAS although the proportion of improved variants and the highest improvement were much lower, especially in the case of SpAS2. This is reflected in the top improvements found for each variant in microtitre plate rescreening (Table 2).
| FDS | 4-NPS | Library | |
|---|---|---|---|
| Anc497 | 4.47 | 4.83 | dPTP |
| Anc498 | 1.24 | 4.00 | dPTP |
| Anc499 | 0.74 | 1.68 | dPTP |
| SpAS2 | 52.46 | 2.07 | 8-Oxo-dGTP |
| SaAS | 289.17 | 5.05 | dPTP |
| AkAS | 58.04 | 20.59 | 8-Oxo-dGTP |
However, blue-white screening provides only qualitative results and does not yield quantitative information. Therefore, it was used as a pre-screening step prior to microtitre plate screening to ensure that all active variants were selected. Because the wild-type activity levels of some ASs were relatively low, certain variants may appear white due to reduced activity toward X-sulfate, even if they exhibit improved activity toward FDS or 4-NPS. Consequently, blue-white screening was used to identify only positive variants and not negative ones.
Enrichment, defined here as the proportion of recovered cells that show activity towards X-sulfate, varied widely between variants. A trend was observed that dPTP libraries (low mutation rate) consistently showed a higher degree of enrichment than their 8-oxo-dGTP library counterparts (high mutation rate, see Table 2). AkAS was the exception, with a higher enrichment in the 8-oxo-dGTP library, consistent with the relatively poor performance of the AkAS dPTP library in screening (Fig. 5). Furthermore, enrichment was higher in variants with higher initial activity, while several wild types had low activity towards X-sulfate near the detection limit. Hence, the lower enrichment may be caused by catalytic efficiency toward the X-sulfate model substrate not increasing proportionally with the catalytic efficiency towards the FDS substrate, that was used in microfluidic sorting. Thus, white colonies can still contain variants with improved catalytic efficiency toward FDS and 4-NPS.
Subsequently, preliminary microtitre plate screening was carried out on variants picked from the sorted libraries. 176 picked clones of each library were screened. Variants that showed improved catalytic efficiency towards 4-NPS were found in each library, while variants that showed improved catalytic efficiency towards FDS were found in each library except Anc499 (Table 2). The largest improvements were found in variants with low initial activity. When comparing ancestral enzymes to extant enzymes the most notable difference was that while all extant enzymes showed their greatest improvement towards the FDS substrate also used in FADS screening, ancestral enzymes showed their greatest improvements in catalytic efficiency over their wild-types towards the 4-NPS substrate. This difference between substrates is most pronounced in ancestral enzymes Anc498 and Anc499, and reversely extant enzymes SpAS2 and SaAS. The biggest improvements towards both 4-NPS and FDS were observed in ancestral enzymes Anc497 and Anc498 for both FDS and 4-NPS, and extant SaAS for activity towards FDS and AkAS for activity towards 4-NPS, respectively. Coincidentally, these four enzymes showed low initial activity towards FDS. Although only anecdotal, these results hint that variants with lower initial catalytic efficiency may ‘catch up’ and exceed wild-type enzymes with high initial catalytic efficiency even after one round of directed evolution.
Using high-throughput fluorescence-activated droplet sorting (FADS), we conducted parallel screening of modern and ancestral enzyme libraries and performed preliminary re-screening of the recovered variants.
The variation in mutation rates highlights the distinct shape of the local fitness landscape for each enzyme. Earlier studies on malate dehydrogenase and lactate dehydrogenase enzymes already showed that a single crucial mutation can switch substrate specificity and greatly impact enzyme activity.30 Given that most mutations are either neutral or destabilizing, stabilizing mutations may be necessary to achieve a functional protein with improved or distinct functions. This contrast may explain the disparity between enzyme backbones capable of attaining novel functionality in one step and those needing additional backbone stabilization to achieve novel function.34,65
As expected, the observed mutational landscape varied depending on the enzyme and mutational regime. Naturally, increasing the mutation rate would flatten the curve and possibly widen the activity distribution. Given that most mutations are either neutral or detrimental, accumulating multiple mutations increases the likelihood of encountering a deleterious mutation that impairs variant function.
In addition to an increase in deleterious mutations, one would expect an increase in highly improved variants if multiple point mutations are needed to achieve a substantial increase in function. These mutations can either serve as compensatory mutations, necessary to counteract stability loss or folding issues, or as additional mutations directly facilitating changes in function. By introducing multiple mutations in a single round, variants requiring compensatory mutations or epistatic interactions become available. However, each mutation may also cause the enzyme to be non-functional. Here, a balance between the possibility of jumping evolutionary ‘ratchets’ and avoiding the accumulation of deleterious mutations appears. Extra mutations can either be compensatory mutations that offset thermostability loss or folding issues, or all mutations can directly facilitate changes in function. Often, while enzymes with different functions differ in many residues, only a few mutations are responsible and sufficient for a change in function.30,66 When comparing this phenomenon, in which a few mutations are able to greatly shift the activity of an enzyme, to our library screening we observed a similar phenomenon especially in the case of SaAS and AkAS (Fig. 5). An interesting case is observed in Anc497, where the dPTP library led to greatly improved variants up to five-fold. Yet, the 8-oxo-dGTP library led to much less improved variants, with the top variant showing only half the improvement of most active dPTP variants, as well as showing significantly fewer improved variants (Fig. 5). These results are consistent with a specific fitness landscape for each variant, such that the optimal number of mutations to achieve an improvement may vary between enzymes. The histograms indicate that ancestral libraries yielded on average fewer improved variants than the extant enzyme SaAS (Fig. 5). However, the ancestors yielded more improved variants than SpAS2 or AkAS. Interestingly, in most ancestral enzymes the lower mutation rate dPTP library yielded more improved variants, while the higher mutation rate 8-oxo-dGTP library yielded more improved variants in extant enzymes.
Furthermore, when improved variants were screened in microtiter plate format, increases in catalytic efficiency towards both FDS as well as 4-NPS (that was not used for microfluidic screening) were found. This effect was most pronounced in ancestral variants Anc497, Anc498 and Anc499, in which the increase catalytic efficiency towards 4-NPS was more substantial than the increase in catalytic efficiency towards FDS (Table 2).
We calculated the percentage of variants exceeding the wild-type (WT) activity threshold (the percentage of non-empty library variants that exceeded the average WT-activity), which ranged from 0% for Anc499 to 38.7% for the SaAS ptp library and 56.5% for the AkAS 8-oxo-dGTP library. Overall, the proportion of variants exceeding WT correlates well with the likelihood of identifying the best-performing variant during microtiter plate screening using the FDS substrate. For Anc499 the top variant identified was below the WT average, whereas for SaAS, the best variant, showing nearly 290-fold enrichment, was also the library with the highest percentage of variants above the WT threshold in droplet-based screening. Together, these results demonstrate quite strong agreement between droplet-based screening and whole-cell assays performed in microtiter plates.
Overall, these results show that ancestral enzymes can show promise as starting points for directed evolution. When it came to extant enzymes, our results showed that the enzymes with greatest starting kinetic parameters showed the least overall improvement, with SaAS outperforming these initially more fit enzymes. Notably, libraries with a lower mutation rate showed a greater proportion of highly improved variants in ancestral libraries, whereas higher mutation rates showed greater proportion of highly improved variants in extant libraries. This characteristic makes ancestral enzymes especially interesting when screening is limited to lower throughputs due to limitations such as substrate or product detection.
We demonstrated that cell-surface-displayed enzyme libraries, grown within droplets, are effective for sorting large libraries—each containing over 105 variants—in parallel within a practical time-frame, as well as applying droplet microfluidics to screen proteins derived from ancestral sequence reconstruction for the first time.
We conclude that, in addition to screening of directed evolution libraries, microfluidic FADS sorting and recovery of autodisplayed proteins can be utilized for a variety of applications in high-throughput screening and sorting of proteins. An example would be the screening of synthetic libraries of proteins derived from multiplexed gene synthesis.67,68
When comparing the variant activity histograms of the tested Arylsulfatases (ASs), we found that evolvability was maximized under a lower mutation rate for ancestral enzymes while the extant enzymes benefited from a higher mutation rate. On the other hand, for the ancestral libraries the lower mutation rate dPTP library resulted in a greater proportion of highly improved variants (Fig. 5), demonstrating a qualitative difference in the ideal mutation rate of different enzymes.
The interpretation of data from histograms and microtiter plate rescreens points towards qualitative differences in ideal mutation rate, in which the ancestral enzymes studied show a higher rate of improvement at a lower mutation rate. This work lacks detailed kinetic and structural analysis of improved variants. Further work is required to elucidate the kinetic parameters and structural properties of the improved variants and solidify the conclusions found.
In the future, this study could be followed up with a much more thorough biochemical characterization of improved variants, including detailed kinetics of purified enzymes and elucidation of crystal structures, as well as introduction of the mutations found in the screening into corresponding positions in different enzyme backbones.
Reconstructed ASs showed a propensity towards requiring fewer mutations for improved function. Therefore, ancestral enzymes, particularly in the context of ASs, can serve as valuable initial scaffolds for rapidly achieving significant enhancements in enzyme activity.
Additionally, “imperfect reconstruction”—stemming from less than 100% certainty in assigning residues at ambiguous sites69—can be refined through high-throughput directed evolution. Comprehensive coverage of single mutations includes all one-amino acid-off alternative reconstructions, allowing for the selection of variants that enhance functionality.
An initial directed evolution campaign with multiple (both ancestral and extant) enzymes increases the odds of—success, as the candidate with optimal initial catalytic efficiency may not have a traversable, single/double mutation route towards desired function.
Accuracy versus throughput is a consideration throughout the experiment. This leads to a funnel-like progress where high-throughput screening is used to select improved variants from a large pool, which are then further characterized using lower throughput microtitre plate screening of variants and finally purified protein.
As ASs typically show promiscuity towards other substrate classes the concept of a low initial activity construct with a broad fitness landscape can be extended towards different enzymes with a promiscuous activity towards the desired reaction.
It should be cautioned that screening towards a limited number of substrates only investigates biological specificity and not intrinsic specificity.37 As of such, while ancestral enzymes may be more biologically generalist, a trend linking between ancestrally reconstructed sequences and reduced intrinsic specificity has not been shown, and might not be expected.37
These results also open up several lines of inquiry for future work. A first line of inquiry is a longer directed evolution campaign starting from improved variants of the best performing ancestral and extant enzyme. Another avenue is expanding the screening from sulfate esters (4-NPS and FDS) towards substrates for promiscuous activity towards phosphonate esters and phosphate mono- and di-esters. A further path is screening in parallel through a larger array of conditions such as higher mutation rates or differently biased libraries.
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d5an00865d.
| This journal is © The Royal Society of Chemistry 2026 |