Open Access Article
Wei Zhou†
a,
Yan Wang†b,
Shuning Zhang†
c,
Chengwei Zhang†d,
Jiacheng Panga,
Shaoneng Houa,
Jie Lib,
Ying Yaob,
An Su
*a,
Peixiang Ma
*c,
Hongtao Xu
*b and
Wei Hou
*a
aZhejiang Key Laboratory of Green Manufacturing Technology for Chemical Drugs, College of Pharmaceutical Science & Green Pharmaceutical Collaborative Innovation Center of Yangtze River Delta Region, Zhejiang University of Technology, Hangzhou 310014, China. E-mail: ansu@zjut.edu.cn; houwei@zjut.edu.cn
bShanghai Institute for Advanced Immunochemical Studies & School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China. E-mail: xuht@shanghaitech.edu.cn
cShanghai Key Laboratory of Orthopedic Implants, Department of Orthopedic Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 201210, China. E-mail: mapx@shsmu.edu.cn
dCollege of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310014, China
First published on 3rd November 2025
Chemical reactions compatible with multiple functionalities are essential for rapid, programmable, and automatable synthesis of functional molecules. However, achieving such reactivity poses significant challenges. Here, we developed a novel multi-orthogonal C(sp2)–Se bond formation reaction between benzoselenazolones and boronic acids via Ag(I)-catalyzed selective selenium(II)–nitrogen exchange. This chemistry is compatible with diverse functionalities, enabling sequential and programmable synthesis. Moreover, it features modular, high-yielding (485 examples, with yields or conversions exceeding 70% in 95% of cases), and switchable reaction systems under mild conditions. Its practical utility was exemplified through late-stage functionalization of natural products, peptide modification and ligation, diversified synthesis, sequential click chemistry, protecting group-free syntheses of sequence-defined oligo selenides (nonamers), on-plate nanomole-scale parallel synthesis (200 nmol, 412 selenides), and DNA-encoded library (DEL) synthesis (10 nmol, 92 examples). Notably, a target-based screening identified SA-16 as a potent CAXII inhibitor with an IC50 value of 72 nM. Furthermore, a machine learning-based model (SeNEx-ML) was established for reaction yield prediction, achieving 80% accuracy in binary classification and 70% balanced accuracy in ternary classification. These results demonstrated that this chemistry serves as a powerful tool to bridge the selenium chemical space with the existing chemical world, offering transformative potential across multidisciplinary fields.
Programmable synthesis is an emerging synthesis pattern that has attracted significant attention due to its great potential in the alterable and tunable synthesis of advanced functional small molecules, oligomers, and polymers.17 As depicted in Fig. 1b, it primarily contains two patterns. The first pattern involves the successive functionalization of the multiple functionalities decorated on the starting substrates by leveraging their distinct reactivity,18,19 while the second pattern entails iterative building block (BB) connecting reactions, wherein the requisite functionalities for the subsequent connecting reaction are introduced in preceding steps.20–22 Both patterns heavily rely on multi-orthogonal reactions with the ability to selectively functionalize one handle while preserving the others, which is quite challenging. Therefore, only a limited number of such syntheses have been reported to date. To meet the ever-growing demands for constructing advanced Org-Se compound libraries with expanded chemical space,23 there is an urgent need for a synthetic toolbox that enables rapid, modular, and potentially automatable access to structurally diverse Org-Se compounds. In this context, despite the challenges, the development of highly selective and predictable selenylation chemistry capable of programmable functionalization of building blocks (BBs) bearing multiple transformable handles without the requirement for mandatory protection/deprotection becomes an important and appealing task. This will not only enrich the synthetic toolbox of multi-orthogonal selenylation chemistry but also hold great significance for seleno-medicinal chemistry and chemical biology.24–26
Inspired by the biochemical reaction between ebselen and cysteine (Cys) residues,27,28 our group has designed benzoselenazolone (BSEA, the core structure of ebselen) as a novel bifunctional Se source for efficient selenylation reaction development. Based on the robust performance of these reactions, we have put forward the concept of Se(II)–N exchange (SeNEx) chemistry, which encompasses substitution events at an electrophilic Se(II) center to break Se–N bonds, enabling the rapid and flexible formation of Se–X bonds, especially Se–C bonds around the Se(II) center.29–31 Especially, by leveraging the SeNEx strategy, we have developed several C(sp2)–Se bond formation reactions by employing in situ formed C(sp2)–Rh(III) species32 and free indole as nucleophiles,33,34 respectively. However, flaws such as elevated temperature requirements32 and relatively narrow substrate scope (limited to indole and electron-rich aromatic rings)33,34 have restricted their application in large-scale library construction while hindering the exploration of broader seleno-chemical space. To overcome these obstacles, we hypothesized that boronic acid would serve as an ideal coupling partner due to its ability to undergo a trans-metalation under mild conditions,35 generating nucleophilic carbon–metal species that are less reliant on the electric properties of aromatic rings (Fig. 1d). Notably, although selenylation reactions between aryl boronic acids and some Se electrophiles (e.g., selenyl chloride,36 diselenide,37 and selenocyanate38) have been reported, these methods typically suffer from some intrinsic drawbacks, such as poor atom economy, harsh conditions (up to 100 °C), toxic byproduct (CN−) and poor functional group tolerance, which hindered their practical application in large-scale library construction such as on-plate parallel synthesis (Fig. 1c). Therefore, a mild, general and multi-orthogonal C(sp2)–Se bond formation chemistry is still highly desirable and can be devised from commercially available boronic acids and readily available BSEAs39 via a SeNEx strategy.
However, three significant challenges need to be addressed:
(i) Achieving multi-orthogonality and high chemo-selectivity on Se–N over a wide range of competitive electrophiles, including ester, ketone, –CHO, –SO2F, S–N, –NO2, –N3 and –CN;
(ii) Potential protodeboronation issue in the presence of H2O and certain groups with active hydrogen such as –COOH, –OH, –NH2, amide, and unprotected heterocycles;
(iii) Coordination interference caused by the Se atom and adjacent amide group in the product leading to reduced catalyst reactivity.
We proposed that these formidable intractable obstacles can be overcome through fine-tuning the catalytic systems. Herein, we present an unprecedented multi-orthogonal SeNEx chemistry between BSEAs and boronic acids via Ag(I) catalysis, which features modular, predictable, robust, high-yielding, mild and switchable reaction conditions and is operationally simple and orthogonal to a broad scope of reactive handles, such as carboxylic acids, free amines, azides, sulfonyl fluorides, silanes, aromatic (pseudo)halogens, and others, thus allowing this chemistry to be orthogonal to copper-catalyzed azide–alkyne cycloaddition (CuAAC),40 sulfur–fluoride exchange (SuFEx),41 and nearly all top 20 reactions in medicinal chemistry (e.g., amide bond formation and cross-coupling),42 enabling the sequential and programmable synthesis of advanced org-Se compounds efficiently when combined with other established reactions. The practical application was successfully demonstrated through natural product and peptide modification, multi-orthogonal and programmable synthesis, on-plate nanomole-scale parallel synthesis, and DEL synthesis (Fig. 1d).
:
H2O = 2
:
1, double-distilled H2O and borate buffer (pH 9.4)) are workable (90–95%, Table S1, entries 15–19 and 39–40). Importantly, this reaction could also give good conversions when conducted at 10 times dilution (10 mM, Table S1, entry 20) or in miniaturization (200 nmol, 4 mM, Table S1, entry 21), indicating its potential in nanomole-scale in situ drug screening library synthesis. For the CuI catalytic system, the ligand is a critical factor in promoting the transformation. 5,5′-Dimethyl-2,2′-bipyridine (L3) was the best ligand in the reaction (95%, Table S1, entries 22–28). Likewise, both polar protic (EtOH and iPrOH) and aprotic (EA, DMF, DMSO, and MeCN) solvents are feasible for this reaction (92–95%), while a mixed solvent system (96%, EA
:
DCM = 1
:
1) is the optimum (Table S1, entries 29–38).
With the established reaction conditions in hand, we next explored the substrate scope and generality (Fig. 2 and SI, Fig. S3). For BSEAs, N-substituents, including alkyl (1–7, 23–26, 17–18, and 69), aryl (8–12 and 27), heterocycles (13–16, 19–22, 27, 28, 71, and 72), were all well tolerated, affording the desired selenides in 66–99% yields. Besides, the electronic effect of substituents on the phenyl moiety of BSEAs is not obvious, and both electron-withdrawing groups (EWGs, 23–24 and 27–28) and electron-donating groups (EDGs, 25–26) are compatible, delivering the desired selenides in high to excellent yields (85–95%). Notably, the N-unsubstituted BSEA A58 is an inferior coupling partner in this reaction, affording the desired product 73 in a low yield (25%) under standard conditions. When conducted at 60 °C, the yield of 73 can be increased to 55%, and compound 74 was obtained in a satisfactory 95% yield when 2-naphthaleneboronic acid B80 was employed. For arylboronic acids, a broad scope of substituents regardless of their location (o, m, and p) and electronic properties are all proved to be compatible coupling partners, delivering the corresponding selenides in good to excellent yields (29–52 and 70, 73–98%). To our delight, both heteroarylboronic acids (53–59) and alkenylboronic acids (60–62) underwent this SeNEx reaction favorably to furnish the corresponding products in 75–93% yields. The structure of 52 was unambiguously determined by X-ray crystal analysis. Notably, substrate A28 orthogonally underwent this SeNEx click chemistry to afford the selenylation product 28 in 88% yield. At the same time, the benzo[d]isothiazolone (BITA) moiety in 28 is completely unaffected, albeit the potential electrophilicity of the S–N moiety. This excellent selectivity may originate from the larger size of Se than S, which results in a reduced orbital overlap, longer bond length, and weaker bond strength of Se–N than the S–N bond. The weaker bond strength indicates that the σ* orbital of the Se–N bond is lower in energy than that of the S–N bond, making BSEA Se more reactive than its S analog.10 Significantly, this SeNEx reaction is also suitable for late-stage modification (LSF) of complex molecules, as exemplified by the efficient LSF of (−)-arctigenin (63, 88%) and dehydroabietic acid (64, 92%), the famous ligand (R,R)-TsDPEN (65, 83%), S-methyl cysteine (75), tryptophan (76), lysine (77), tyrosine (78), dipeptide Val–Tyr (66, 93%), Gly–Leu (79), Phe–Trp (80), Val–Trp (81), Tyr–Phe (82), Tyr–Lys (83), Tyr–Trp (84), Tyr–Ala (85), Tyr–Arg (86) and tripeptide Tyr–Glu–Trp (67, 85%) and Tyr–Asp–Ala (87, 88%). Impressively, two dipeptides bearing BSEA (Val–Trp) and boronic acid (Tyr–Ala) respectively can also undergo SeNEx smoothly to give the desired ligation product 68 in 81% yield, indicating its potential application in peptide ligation. Significantly, a broad scope of transformative functionalities and/or pharmacophores including terminal alkenyl (3), adamantyl (7), primary alcohol (5 and 17), phenol (12 and 39), carboxyl (6 and 38), free amine (18 and 40), pyridine (13–16, 22, and 55), pyrimidine (56), free pyrazole (19), 2′,3′,5′-tri-O-acetyl-D-adenosine (20), free indazole (21), azide (41), free indole (53), quinoline (28 and 54), carbazole (57), furan (58), thiophene (59), nitro (24, 36, 46, and 52), ester (27, 37, 63–64, and 73), amide (all cases), ketone (44), cyano (47 and 63–65), silyl (49), sulfonamide (65) and halo (11, 14, 27, 31–33, 35, 42, and 43) are all satisfactorily compatible, highlighting the excellent substrate scope of this SeNEx chemistry. It is noteworthy that many synthetic transformations suffer from a predicament in that polar and highly functionalized compounds fail to afford desired products, leading to the enrichment of hydrophobic compound sets, which are less likely to become successful drug candidates.43 In contrast, the good tolerance of polar functionalities (e.g., hydroxy, phenol, amine, carboxyl, pyridine, and pyrimidine) as well as complex natural products indicated that this SeNEx chemistry may cross this predicament. Moreover, the compatibility with terminal alkenyl, hydroxy, phenol, halogens (Cl, Br, and I), nitro, ester, ketone, silyl, free pyrazole, free indazole, free indole, –CHO, –OH, –NH2, –N3 and –COOH offers flexible and diverse handles for programmable synthesis.
Reliable strategies for the controlled synthesis of sequence-defined oligomers and polymers, which have promising applications in materials chemistry and biological research, are uncommon but highly desirable.22 Inspired by the good orthogonal reactivity of this Ag(I) catalyzed SeNEx chemistry to SuFEx, we next reason that SeNEx and SuFEx are ideal coupling reactions for building sequence-defined oligomers in a protecting group free and iterative-growth pattern. As illustrated in Fig. 4, we chose compound A47 (a tryptophan derived BSEA), –OTBS containing phenylboronic acids (B81 and B82) and fluorosulfate-modified BSEAs (A49 and A51) as coupling monomers for the programmable nonamer synthesis. Trimer 4b (A–B–C) was first prepared in 71% yield over two steps from SeNEx between A47 and B81 and then SuFEx with A49. Next, a second round of the SeNEx–SuFEx reaction sequence involving 4b, B82, and A51 afforded the desired pentamer 4d (A–B–C–D–E) in 60.7% yield over two steps. Starting from 4d and repeating this SeNEx–SuFEx–SeNEx–SuFEx reaction sequence ultimately delivered the desired oligo-selenide 4h (nonamer, A–B–C–D–E–B–C–D–E) in 14% yield over four steps, containing a BSEA group for potential further chain growth. Overall, the successful generation of the nonamer once again confirms the good orthogonality between SeNEx and SuFEx, and it also confirms the good feasibility of combining SeNEx with SuFEx to build sequence-defined oligomers in a protecting group free fashion. In addition, the investigation of combining SeNEx with SuFEx to prepare sequence-defined polymers will be a focus of our future research.21
Encouraged by the robust performance and high yields of this SeNEx chemistry as well as our previous experience in on-plate parallel synthesis,29,33,34 we subsequently performed a nanomole-scale parallel synthesis on 96-well microplates via the combination of 16 BSEAs and 24 boronic acids (16 × 24, 200 nmol, 4 mM, Fig. 5 and SI, Fig. S4 and S5). From the perspective of molecular diversity, the selected 16 BSEAs contain alkyl (Se-A–Se-G), aryl (Se-H–Se-K), heteroaryl (Se-L–Se-N), and amino acid (Se-O and Se-P) side chains. Meanwhile, the selected 24 boronic acids contain ortho- (BA-2–BA-4), meta- (BA-5–BA-9), para-substituted (BA-10–BA-19), electron-neutral (BA-1), EWGs (BA-2, BA-3, BA-7–BA-9, and BA-12–BA-19), EDGs (BA-4–BA-6, BA-10, and BA-11), electron-rich heterocycles (BA-21–BA-23), electron-deficient heterocycles (BA-20), and alkenyl (BA-24), respectively. The conversions were analyzed by liquid chromatography-mass spectrometry (LC-MS). Among the 384 selenides, 272 (70.8%) attained >90% conversion and 368 (95.8%) achieved >70% conversion—a generally accepted threshold for in situ biological screening47,48 (Fig. 5a). Notably, all of the 384 selenides are new structures; therefore, they could further enrich the Org-Se chemical space. In addition, it is reported that carbonic anhydrase XII (CA XII), a tumor-associated membrane protein applicable for hypoxic tumor therapy and imaging, can be recognized by aromatic sulfonamides.49,50 Given the good tolerance of this SeNEx reaction towards the sulfonamide group and to explore its potential in the discovery of bioactive molecules, we further conducted a parallel synthesis (14 × 2, 14 BSEAs and 2 sulfonamide-boronic acids, Fig. 5b) to search for novel CA XII inhibitors via in situ target-based screening. To our delight, 22 out of the prepared 24 compounds exhibited over 80% inhibition at 1 μM, and 3 out of the tested 24 compounds exhibited over 48% inhibition at 100 nM (Fig. 5c). Further resynthesis and IC50 determination indicated both SA-16 and SA-18 are potent CAXII inhibitors, with IC50 values of 72 nM and 90 nM, respectively (Fig. 5d). Overall, these positive results highlight the great potential of SeNEx chemistry in in situ nanomole-scale high-throughput medicinal chemistry. With the experimentally measured dataset comprising 447 reaction yields, we further developed a machine learning (ML)-driven reaction yield predicting model, termed SeNEx-ML, to forecast the yields of these reactions. As illustrated in Fig. 6a, the research workflow comprises four main stages: (1) structural digitization, where BSEAs, boronic acids, and selenylation products are converted to SMILES strings; (2) chemical fingerprint encoding (Fig. 6c), in which molecular structures are represented using molecular additive fingerprints (MAFs) to capture global molecular features and differential MAF (DiffMAF) fingerprints to emphasize the electronic environment around the Se(II) center;51–53 (3) model development and evaluation, where machine learning models are trained on these fingerprints to predict absolute yields and perform categorical classification of the selenylation reactions. The dataset is divided into a training set and a test set in an 8
:
2 ratio; (4) interpretation, where principal component analysis (PCA) is performed to visualize data clusters and identify structural features associated with high or low yields.54–56
To address the interpretability challenge in machine learning, we use PCA and visualization to analyze the model's learned abstract features in SeNEx reaction prediction. As shown in Fig. 6b, PCA reveals pronounced multi-cluster distributions that extend beyond simple steric considerations. High-yield reactions are clustered in the upper-left and lower regions, whereas low-yield reactions predominantly aggregate in the upper-right quadrant. Further inspection of representative reactants in high- and low-yield clusters highlights key structural trends: high-yield reactions are favored by N-substituents on BSEAs with low steric hindrance or aromatic moieties, and boronic acids are generally well tolerated (except for BA-6 and BA-20). Conversely, BSEAs bearing bulky substituents (such as Se-O and Se-P) exhibited diminished reactivity. To address these data characteristics, we propose a modular modeling framework: regression tasks are constructed to predict absolute yield values, while classification tasks grade reaction outcomes using threshold-based categories. Specifically, binary classification partitions the dataset into high and low yields at a 90% yield threshold, whereas ternary classification introduces an intermediate category, with thresholds set at 80% and 90%. Model assessment (Fig. 6d) demonstrates that the support vector machine (SVM) combined with MAF encoding attains 80% accuracy in binary classification, while NeuralNetTorch combined with DiffMAF encoding achieves 70% balanced accuracy in ternary classification. For regression, support vector regression (SVR) combined with DiffMAF encoding delivers experimental-to-predicted yield correlations with a root mean square error (RMSE) of only 6.2%, fulfilling the requirements for rapid reaction outcome prediction in practical scenarios. Representative prediction distributions across the three tasks are displayed in Fig. 6e.
In addition, the DNA-encoded library (DEL), conceptualized by Lerner and Brenner in 1992,57 has emerged as a prevalent technology in current drug discovery due to its merits of huge capacity, easy storage, and efficient screening.50,58–61 Typically, DELs are constructed via 3 to 4 cycles of interactive enzymatic DNA encoding and on-DNA combinatorial synthesis.62–65 As a result, the development of DNA-compatible reactions is one of the key tasks in DELs.66–68 Although a set of DNA compatible reactions such as diarylether synthesis,69 cross-coupling reaction,70–75 C–H activation and functionalization,76,77 photo-promoted reaction,78,79 cycloaddition reaction,80 and the on-DNA synthesis of privileged heterocycles81–83 have been developed in the past few years, only a few good reactions have been used in DEL synthesis,84 due to the highly functionalized and sensitive DNA barcodes, low reaction scale and highly diluted conditions.75,80,85 Nevertheless, the mild and robust conditions and the excellent functional group tolerance of this Ag(I) catalyzed SeNEx chemistry indicate its potential for an ideal DNA-compatible reaction development.
To our delight, by using 20 equivalents of AgNO3, 1000 equivalents of DIPEA, 1000 equivalents of boronic acid, and MeCN–ddH2O (6
:
1) as solvents (SI, Table S2, entry 1), the on-DNA SeNEx reaction proceeded smoothly at room temperature, affording the desired DNA-conjugated selenide D1 in 99% conversion (10 nmol scale), which was confirmed by the co-injection experiment (SI, Fig. S9). As delineated in Fig. 7, most of the tested boronic acids (multiple substitution types) reacted smoothly with DNA-conjugated BSEA Hpi1 to produce the corresponding DNA-conjugated selenides in good to excellent conversions (D1–D56), except for DNA-conjugated selenides D18 (46%), D36 (52%), and D56 (48%). Impressively, sterically hindered boronic acids (e.g., 2-acetyl phenylboronic acid) are known to be inferior coupling partners in on-DNA Suzuki coupling.74 However, in this on-DNA SeNEx reaction, a series of ortho-substituted boronic acids (D2–D9), including sterically hindered nitro (D3), acetyl (D4), methylsulfonyl (D5), isopropoxyl (D9) and even 2,6-dimethyl substituted (D35) boronic acid, are all well tolerated (90–99%). Notably, apart from halo, alkyl, and alkyloxy groups, functional groups (FGs) including aldehyde (D6, D39), ketone (D4), nitro (D3, D10, D23, D44, D56), cyano (D15, D24), carboxy (D27), thiophene (D29), furan (D30), free indole (D31), quinoline (D32), carbazole (D33) and pyridine (D34) are all well compatible. Besides, the on-DNA parallel synthesis between 9 DNA-conjugated BSEAs and 4 representative boronic acids (NO2, CN, CF3, and OCH3) also proceeded well, delivering the target selenides (D57–D92) in 80–99% conversions. Among the 92 tested on-DNA SeNEx reactions, 75 (82%) attained greater than 90% conversion, and 87 (95%) reached greater than 80% conversion. Moreover, a mock DEL construction was performed by using 3 cycles of chemistry including amidation, SeNEx, and Suzuki coupling. The desired conjugates D93–D96 were obtained in 63–85% conversions over 3 steps on-DNA synthesis. More importantly, the gel electrophoresis analysis of the enzymatic ligation of D1 with a 50-basepair primer showed complete ligation (SI, Fig. S10), and the qPCR results revealed that a DEL library has ca. 86% remaining amplifiable material under this DNA compatible SeNEx reaction, which is much higher than the threshold (30%) for practical DEL synthesis86 (SI, Fig. S11). In addition, we further conducted two sets of 1 × 2 × 2 (Fig. S12) and one set of 1 × 2 × 3 (Fig. S21) mock library synthesis,79 both using 3 cycles of chemistry including amidation, amidation and SeNEx chemistry. UPLC-MS data verified the structures of all corresponding members. Overall, this SeNEx click chemistry is a paradigm of both reaction-centric and product-centric on-DNA synthesis that features broad substrate scope and satisfactory DNA compatibility. Thus, it will have great application prospects in the synthesis of SeDEL for probing ultra-large Org-Se chemical world.
The data that support the findings of this study are available in the supplementary information (SI) of this article. Supplementary information: materials and methods, experimental procedures, characterization data, and NMR and MS spectra. See DOI: https://doi.org/10.1039/d5sc05512a.
Footnote |
| † These authors contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2025 |