 Open Access Article
 Open Access Article
Johannes Bingolda, 
Erik Mafenbayera, 
Wibke Langenkamp b, 
Lisa Lianga, 
Chun Zhanga, 
Malte Mildner
b, 
Lisa Lianga, 
Chun Zhanga, 
Malte Mildner c, 
Julia Isabel Bahnerde, 
Mohamed Akmal Marzouka, 
Bettina Böttcher
c, 
Julia Isabel Bahnerde, 
Mohamed Akmal Marzouka, 
Bettina Böttcher de, 
Ann-Christin Pöppler
de, 
Ann-Christin Pöppler c, 
Ralf Weberskirch
c, 
Ralf Weberskirch b and 
Andreas Brunschweiger
b and 
Andreas Brunschweiger *a
*a
aInstitute of Pharmacy and Food Chemistry, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany. E-mail: andreas.brunschweiger@uni-wuerzburg.de
bDepartment of Chemistry and Chemical Biology, Polymer Hybrid Systems, TU Dortmund University, Otto-Hahn Straße 6, 44227 Dortmund, Germany
cInstitute of Organic Chemistry, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
dRudolf-Virchow-Center, Julius-Maximilians-Universität Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany
eBiocenter, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
First published on 28th October 2025
Chemical diversification of DNA-conjugated substrates is key in DNA-encoded library (DEL) synthesis and other nucleic acid-based technologies. One major challenge to the translation of synthesis methods to DNA-tagged substrates is the lack of solubility of the highly charged DNA oligomer in most organic solvents. A neutral acrylate block copolymer, devoid of any canonical nucleic acid-binding structure, tightly interacted with DNA oligonucleotides in their ammonium form, and solubilized them in nonpolar solvents such as dichloromethane, chloroform and toluene. The ternary DNA–copolymer–ammonium salt interactions led to the formation of aggregates in organic solvents whose size correlated with DNA oligomer length. This method for DNA solubilization was successfully applied to diversify DNA-tagged starting materials by three isocyanide multicomponent reactions (IMCR) with broad scope and excellent yields. The copolymer does not require tailored DNA conjugates and solubilized DNA oligomers of up to 80 nucleotides length. It will likely broaden the toolbox of DEL-compatible synthesis methods well beyond IMCR chemistry and it has application potential in other nucleic acid-based technologies that require a broadened solvent scope for nucleic acid conjugate synthesis.
In the early times of DNA-encoded chemistry, DNA-templated reactions have been shown in a 95:5 mixture of acetonitrile and water.11 Pehr Harbury pioneered reversible immobilization of DNA tags by Coulomb interactions on ion-exchange solid phase. This approach was subsequently adopted and refined by several DEL research groups (Fig. 1B).12–15 It requires an immobilization and a high-salt desorption step, and the reaction conditions need to be compatible with the solid support. In the solution phase, cationic lipids like didodecyldimethylammonium bromide have been shown to form tight ion pairs with oligonucleotides (6mer, 14mer and 22mer) and to solubilize these ion pairs in polar organic solvents such as DMF, THF and DMSO for polymer conjugation reactions. This approach facilitated a photoredox reaction on DEL barcodes (Fig. 1C).5,16
Polymers and copolymers are a further class of structures that have been shown to solubilize DNA in organic solvents (Fig. 1D). For instance, block copolymers composed of a polyethylene glycol block and a cationic block formed nanoparticles with calf thymus DNA in polar organic solvents and benzene.17 Long, 10 kDa PEG polymers were covalently attached to DNA oligomers of up to 21 nucleotides length, including G-quadruplex DNA. These DNA PEG–polymer conjugates were soluble in dichloroethane and polar solvents,18 and they allowed for investigating DNA structures in organic solvents. DNA PEG-conjugates were shown to allow for amide couplings on DNA barcodes in polar organic solvents, but required a tailored DNA barcode-PEG conjugation strategy.19 Alternatively, DELs may be synthesized via covalently coupled barcodes on solid phases.20–24 These solid phase approaches may need tailored substrates, and they require a cleavage step if the libraries are not screened on the solid phase.25 This research work shows the ongoing interest of DEL and nucleic acid chemists in approaches to expand the solvent scope for reactions on nucleic acids. The very few approaches encourage investigating new avenues to “DNA in dichloromethane” for applications in nucleic acid-based technologies. Here, we deliver a conceptually new approach to solubilization of DNA oligomers in non-aqueous solvents for e.g. DNA-encoded chemistry that exploits ternary interactions between a DNA, a neutral block copolymer and selectable nitrogenous cations.
We have previously investigated poly(N,N-dimethylacrylamide)–poly(n-butyl acrylate) block copolymers (Fig. 1E) as micellar nanoreactors for DNA-encoded chemistry in aqueous solvents.26 These copolymers formed micellar structures in water and localized a sulfonic acid moiety either in the core or in the corona of the micelles. The copolymer micelles promoted Povarov and Groebke–Blackburn–Bienaymé reactions with DNA-tagged aldehydes. Investigations in this micellar reaction system revealed that the DNA oligomers tightly interacted with the copolymer, and indeed more than 99.9% of the DNA were associated with the copolymer micelles under low-salt conditions.27
The observation that DNA oligomers tightly interacted with the acrylate copolymer led us to hypothesize that the copolymer–DNA complexes might allow for exchanging the bulk solvent from an aqueous environment to a pure organic solvent without precipitation of the DNA oligomer. In this manuscript, we present our findings that the block copolymer shown in Fig. 1E, which is devoid of any canonical DNA-binding structures such as positive charges or intercalators, solubilized DNA oligomers of up to 80 nucleotides length as salts of organic amines in non-aqueous solvents. In organic solvents, the DNA oligomers tightly interacted with the copolymer, forming aggregates that correlated with DNA oligomer length. To investigate the potential of this DNA solubilization method for DNA-encoded chemistry, we selected hydrolysis-sensitive imine multicomponent reactions (MCRs) with slow reaction kinetics (Fig. 1E), i.e. reactions that do not show desirable properties for DEL synthesis.1,2,8,9 These MCRs are underdeveloped in the DEL context, but workhorse reactions in drug research, because they give access to a wide range of structurally diverse scaffolds from simple, readily available starting materials (Fig. 1F).28–31 Diverse DNA-tagged aldehydes were diversified in the copolymer–DNA interaction system in an operationally simple manner by Ugi- (U-4CR) and Ugi-azide (UA-4CR) isocyanide multicomponent reactions (IMCRs). To probe the compatibility of the copolymer system with an acid catalyst we investigated the Groebke–Blackburn–Bienaymé reaction (GBB-3CR). All reactions proceeded with high, in most cases even quantitative conversions of the DNA-tagged starting materials. DNA-ligation and sequencing experiments showed that the reaction conditions were DNA-compatible, and successful reactions with mixtures of substrates suggested DEL-compatibility.
The solubilization of DNA in chloroform by the copolymer was investigated by dynamic light scattering (DLS), NMR and transmission electron microscopy (TEM) analysis. DNA oligomers of different length were solubilized in chloroform or toluene with the copolymer, and indeed particle formation could be observed by DLS measurements in all experiments where copolymer and DNA were present (Fig. S13–S23). It should be noted at this point that the block copolymer alone did not form aggregates in the organic solvents chloroform, DCM, and toluene. Copolymer aggregation was solely found to be induced by the presence of DNA oligomers.
Curiously, the size of the particles was strictly depending on the length of the DNA oligomers, and we observed an approximate correlation between DNA oligomer length and particle size (Fig. S13–S23). For example, a 10mer DNA formed particles with an average diameter of 14 nm, a 20mer DNA formed particles with an average diameter of 35 nm, and a 40mer DNA formed particles with an average diameter of 74 nm. Even the counter ions showed a small, albeit measurable impact on particle size (Fig. S14–S16). Furthermore, we observed a measurable impact of amine substrates on the particle size for the U-4C, UA-4C, and GBB-3C reactions (reaction development vide infra) which hinted at a close DNA–copolymer–amine substrate interaction (Fig. S17).
The interaction of DNA and copolymer was investigated by 1H and 1H DOSY NMR (Fig. 2C, S24, S25 and Table S2). We selected a short hexathymidine (hexa-T) DNA for these experiments, because the signals of a nucleobase homomer can be readily detected even at the typically low DNA concentrations. The 1H NMR spectra of the pure polymer in CDCl3 (upper spectrum) and pure hexa-T in D2O (lower spectrum) were used for signal assignment and identification of relevant signal areas. The spectrum of the same concentration of hexT (TEA salt) incubated with 20 equivalents of the polymer in CDCl3 showed only weak and broadened signals for the hexT DNA (e.g. blue boxes around 7.6 ppm). This indicates that hexa-T was part of a larger aggregate, resulting in a shorter T2 relaxation and thus broader lines. Triethylamine signals could not be observed in the spectra due to overlap with the polymer signals. We then performed DOSY experiments. The signals highlighted by coloured boxes were chosen for the further analysis as they show little signal overlap. Fitting of the respective DOSY signal decay curves yielded values of 2 × 10−10 m2 s−1 for the pure DNA oligomer in deuterated water and 1.8 × 10−10 m2 s−1 for the pure polymer in deuterated chloroform. For the two mixtures with 50 and 100 nmol hexa-T and 20 eq. of polymer, the diffusion of the polymer was slightly lower (∼1.4 × 10−10 m2 s−1). Despite the low signal intensity, for the mixture with 100 nmol hexT, a diffusion coefficient for hexa-T itself could also be estimated. The diffusion coefficient was 2.9 × 10−11 m2 s−1 and thus much lower than the diffusion coefficient of hexa-T in water at the same concentration. Therefore, we concluded hexT to be part of larger aggregates (Table S2). Due to the 20-fold excess of copolymer, the diffusion value of the copolymer was faster than hexa-T, which was consistent with a population of polymer chains in solution and a population of polymer chains that formed aggregates with the hexa-T DNA. Because DLS and NMR analyses supported the formation of larger structures by the DNA–copolymer interaction, we used transmission electron microscopy (TEM) to gain insight into the shape of these structures. TEM pictures readily confirmed that the copolymer and both a 20mer and 80mer DNA aggregated in chloroform (Fig. 2D). The particles induced by the 80mer DNA appeared to be elongated aggregates compared to the more round-shaped structures formed by the 20mer DNA and copolymer, confirming the observation of the impact of DNA oligomer length on particle size. The particle sizes visible in the TEM pictures agreed with the DLS measurements (Fig. 2D).
We are focused on the development of a barcoding strategy that uses a chemically stabilized tag (csDNA), consisting of pyrimidine nucleobases and 7-deazaadenine, because this barcode tolerated acidic conditions for compound synthesis and protective group removal.21 Therefore, we used a short 10-mer pyrimidine (TC) DNA as a readily available surrogate of a chemically stabilized DNA barcode tag to optimize reaction conditions for the three IMCRs and to investigate the reaction scope. With optimized conditions in hand, the IMCRs were tested on barcode oligomers and with mixtures of both csDNA-tagged and native DNA-tagged substrates to show the compatibility of the reactions with genetic tags and pooled substrates. Finally, the DNA tags were post reaction ligated, PCR-amplified and sequenced to show compatibility with a DNA barcoding process.
| No. | Solventa | Eq. (2a, 3a, 4a) | T (°C) | Cat. | Conv. (%) | 
|---|---|---|---|---|---|
| a Alcohol/CHCl3 (3:1, vol/vol).b All reactants were added simultaneously, and the reaction was run for 48 h. | |||||
| 1 | MeOH/CHCl3 | 250 | 50 °C | — | >95% | 
| 2 | MeOH/CHCl3 | 500 | 50 °C | — | >95% | 
| 3 | MeOH/CHCl3 | 1000 | 50 °C | — | >95% | 
| 4 | MeOH/CHCl3 | 2000 | 50 °C | — | >95% | 
| 5 | MeOH/CHCl3 | 2000 | 37 °C | — | >95% | 
| 6 | MeOH/CHCl3 | 2000 | 25 °C | — | >95% | 
| 7 | MeOH/CHCl3 | 2000 | 25 °C | ZnCl2 | >95% | 
| 8 | MeOH/CHCl3 | 2000 | 25 °C | MgCl2 | >95% | 
| 9 | MeOH/CHCl3 | 2000 | 25 °C | FeCl2 | >95% | 
| 10 | EtOH/CHCl3 | 2000 | 50 °C | — | >95% | 
| 11 | iPrOH/CHCl3 | 2000 | 50 °C | — | >95% | 
| 12b | CF3CH2OH/CHCl3 | 3000 | 60 °C | — | >95% | 
The DNA-tagged aldehyde 1a was condensed with the aniline 2a for imine formation, and then both the isocyanide 3a and the carboxylic acid 4a were added to the preformed imine. Already in the first experiments we obtained the target product with full conversion, and without side product formation. The U-4CR reaction was highly robust to several conditions. These included variation of starting material equivalents, reaction temperature, solvents, and addition of a few Lewis acid catalysts to the reaction (Table 1). In all experiments full consumption of the DNA-tagged aldehyde and clean conversion to the target Ugi dipeptide could be observed. Conveniently, the starting materials could be added simultaneously, i.e. without the separate imine formation step (Table 1, entry 12). This finding encouraged investigations into a broad substrate scope, including sterically hindered and electron-rich substrates (Fig. 3, extended scope in Table S4). First, several aldehydes were coupled to the TC-DNA (TC-1a–TC-1l). The scope of arylaldehydes was focused on sterically hindered ortho-substituted substrates (TC-1b-1f and TC-1i and TC-1j) and included substrates with low reactivity such as electron-rich indol-3-carbaldehydes (TC-1k and TC-1l).
All these DNA-conjugates were successfully reacted with 4-ethylaniline 2a, cyclohexylisocyanide 3a, and hexanoic acid 4a, giving the target Ugi dipeptides with quantitative consumption of the DNA-tagged aldehydes. In the next step, we varied simultaneously and with the same success both the aldehyde and the amine component, including aliphatic bifunctional amines (TC-5m–TC-5o). Then, 4-isocyano-N-Boc-piperidine 3b was validated as a bifunctional starting material which would allow for a further synthesis cycle, by e.g. carbonyl chemistry, starting from TC-5p-5q. The product TC-5r contained a tert-butyl protected dicarboxylic acid, which could be diversified by reverse amide synthesis. Finally, DNA-tagged aldehyde TC-1a was reacted with gabapentin and isocyanide 3a yielding the target lactams TC-5s and TC-5t by Ugi four-center three-component reaction (U-4C-3CR). Furthermore, diverse carboxylic acids, including a Boc-protected amino acid (TC-5v) were productive substrates for the U-4C reaction and gave the desired products with excellent to quantitative conversion rates (83% → 95%) (TC-5u–TC-5y, see SI part for an extended substrate scope). Taken together, the U4-CR was in the copolymer reaction system a very robust reaction regarding the reaction conditions and the substrate scope which numbered more than 50 examples and included aldehyde starting materials with low reactivity, starting materials with low solubility in aqueous solvents (e.g. Fmoc-protected building blocks), and diverse starting materials with protected functional groups for further library synthesis.
| No. | Solventa | Eq. (3a, 6a, 7) | T (°C) | Cat. | % conv. | 
|---|---|---|---|---|---|
| a Alcohol/CHCl3 (3:1, vol/vol).b All reactants were added simultaneously, and the reaction was run for 48 h. | |||||
| 1 | MeOH/CHCl3 | 250 | 50 °C | — | 61% | 
| 2 | MeOH/CHCl3 | 500 | 50 °C | — | 70% | 
| 3 | MeOH/CHCl3 | 1000 | 50 °C | — | 67% | 
| 4 | MeOH/CHCl3 | 2000 | 50 °C | — | 90% | 
| 5 | MeOH/CHCl3 | 2000 | 37 °C | — | 87% | 
| 6 | MeOH/CHCl3 | 2000 | 25 °C | — | 67% | 
| 7 | MeOH/CHCl3 | 2000 | 25 °C | ZnCl2 | 51% | 
| 8 | MeOH/CHCl3 | 2000 | 25 °C | MgCl2 | 38% | 
| 9 | MeOH/CHCl3 | 2000 | 25 °C | FeCl2 | n.d. | 
| 10 | EtOH/CHCl3 | 2000 | 50 °C | — | 67% | 
| 11 | iPrOH/CHCl3 | 2000 | 50 °C | — | 38% | 
| 12b | CF3CH2OH/CHCl3 | 3000 | 60 °C | — | 25% | 
All these DNA-tagged conjugates were successfully reacted with cyclohexyl isocyanide 3a, piperidine 6a and TMS-N3 7 with conversion rates of 70% → 95%. Then, we varied the secondary amine component and DNA-aldehyde TC-1g was reacted with amines 6b–6g, isocyanide 3a and TMS-N3 7 (Fig. 3 and Table S7). The set of amines also included several bifunctional amines that were mono-Boc-protected. These would allow for a further DEL-synthesis cycle by e.g. carbonyl chemistry. We observed conversion rates of >95% in most cases. Only a few building block combinations gave lower conversions which were still synthetically useful for DEL synthesis (e.g. TC-8n, 58% conversion).
| No. | Solventa | Eq. (3a, 9a) | T (°C) | Cat.b | % conv. | 
|---|---|---|---|---|---|
| a Alcohol/CHCl3 (7:1, vol/vol).b 1% of acetic acid. | |||||
| 1 | MeOH/CHCl3 | 250 | 25 °C | AcOH | 81% | 
| 2 | MeOH/CHCl3 | 500 | 25 °C | AcOH | 89% | 
| 3 | MeOH/CHCl3 | 1000 | 25 °C | AcOH | >95% | 
| 4 | MeOH/CHCl3 | 2000 | 25 °C | AcOH | >95% | 
| 5 | MeOH/CHCl3 | 2000 | 15 °C | AcOH | 95% | 
| 6 | MeOH/CHCl3 | 2000 | 5 °C | AcOH | >95% | 
| 7 | EtOH/CHCl3 | 2000 | 25 °C | AcOH | >95% | 
| 8 | iPrOH/CHCl3 | 2000 | 25 °C | AcOH | 95% | 
The substrate scope of the GBB-3R was first tested with 10 diverse DNA-tagged aldehydes (TC1a–c, e, f, h–j, m, n), cyclohexylisocyanide 3a, and 2-aminopyridine 9a (Fig. 5). Again, we placed emphasis on ortho-substituted aldehydes to probe steric hindrance. Electron-rich indol-3-carbaldehydes that react sluggishly but are attractive structures from a compound library screening perspective were tested as well. We observed quantitative conversions to the target heterocycles TC-10a–j with all these substrate combinations, even with the more challenging aldehydes. The isocyanide component was exchanged to a Boc-protected piperidine, giving the option of a further scaffold diversification step (TC-10k and TC-10l). These two compounds were obtained quantitatively, too. As the last step, we also varied the heteroaromatic amine (azole component) 9. Here, building blocks 9b–9e with hydrolysis-sensitive functional groups were tested, which were inaccessible in the previously published solid-phase approach, because of the cleavage step with methylamine.38 These building blocks contained either a chloro-substituent, which could be substituted by a nucleophile (9b, 9c), or a methyl ester (9d, 9e) in ortho- or para-position. As expected, the heterocycles TC-10m–TC-10p were obtained with quantitative conversions in the non-aqueous reaction medium without side-reactions, and the functional groups remained intact. These compounds showed that the solution phase GBB-3R tolerated inclusion of different diversifiable functional groups for a future library design. Finally, we diversified the core heterocyclic scaffold and synthesized imidazopyrazines, TC-10q and TC-10r with quantitative conversion, too. To synthesize further core scaffolds, we tested five-membered aminoazoles 4H-1,2,4-triazol-3-amine, and 1,3,4-thiadiazol-2-amine which afforded the annulated 5 + 5 ring systems TC-10s and TC-10t with quantitative conversion.
In the copolymer reaction system, the GBB-3CR showed excellent robustness to different reaction conditions. A diverse scope of starting materials, including 10 different DNA-tagged aldehydes, functionalized building blocks such as a Boc-protected isocyanide substrate and halide- as well as ester-substituted aminopyridines was productive and furnished the desired heterocyclic products in straightforward manner and with almost quantitative conversions in all cases.
We selected two DNA barcodes. These were a chemically stabilized 14mer 7deATC barcode that consisted of pyrimidine nucleosides and 2′-deoxy-7-deazaadenosine (denoted as 7dATC), and an ATGC sequence that contained all four native nucleosides (denoted as ATGC). Each four carbaldehyde building blocks were coupled to both sequences giving 7dATC-1–4 and ATGC-1–4. These conjugates were reacted under the same conditions of the GBB-3CR, U4C-R, and UA-4CR as previously used for the TC oligomer (Fig. 6A and Tables S10–S24). In the case of the GBB-3CR, the reaction outcome of the TC-coupled starting materials could be reproduced with the barcode sequences, i.e. the DNA-tagged aldehydes were fully converted to the desired imidazopyridine (Tables S10–13). The U-4CR (Tables S15–S18), and UA-4CR (Tables S20–23) gave synthetically useful 60–70% conversion to the target compounds under the conditions that gave full conversion to the corresponding pyrimidine DNA conjugates TC-5 and TC-8. For the U-4CR, we extended the reaction time to 48 h, increased the temperature to 60 °C and substituted methanol by trifluoroethanol. Gratifyingly, these conditions allowed us to add all reactants as one single mixture to the DNA-tagged aldehyde, i.e. we could omit the separate imine formation step, and the reaction conversions for the U-4CR to were improved to 85%. In all cases, the DNA oligomers stayed intact, i.e. we did not observe formation of 8-oxopurines or abasic sites, which would be clearly visible by mass-spectrometric analysis.
To investigate the application potential of the U-4CR, UA-4CR and GBB-3CR for DNA-encoded library synthesis, the three reactions were performed with DNA-tagged mixtures of each four aldehyde starting materials as described in Fig. 6B–D. To simulate DEL synthesis, the starting materials were pooled, and the reaction was carried out according to the optimized conditions. These mixture experiments delivered the expected products, as the calculated masses of all products were confirmed by MALDI-TOF-MS and the reaction conversions were in the range of 80–90% for the GBB-3CR (Table S14) and the U-4CR (Table S19), and approximately 80% for the UA-4CR (Table S24).
For further proof of the compatibility of IMCR chemistry with the DNA-barcoding process, we ligated primer sequences to exemplary 7dATC-tagged GBB-3CR-, U-4CR-, and UA-4CR-products, amplified the DNA barcode by PCR, performed qPCR analysis (Fig. S67–S75) and sequenced the amplicon by Sanger sequencing (Fig. 6D, S76 and S77). The sequencing result showed that the barcode was intact. Thus, the reaction conditions of the IMCRs did not lead to nucleobase deamination, a lesion that can be detected better by sequencing than by mass spectrometric analysis. The successful DNA barcode ligation, amplification and sequencing results in combination with the high-yielding IMCR chemistry gave experimental evidence for the utility of the copolymer reaction system for DNA-encoded libraries that are designed by IMCR chemistry.
The DNA solubilization method that we call CECOS “copolymer-mediated encoded chemistry in organic solvents” encourages further research in the design of ternary systems consisting of a nucleic acid oligomer, a counterion and a (co)polymer to broaden the solvent scope for nucleic acid solubilization. Currently, we assume based on the NMR and TEM analyses that the aggregates are composed of a core-like structure that contains the polar polyacrylamide part of the polymer, the DNA oligomer, and the amine, while a shell-like structure surrounding the core is formed by the hydrophobic poly-butyl ester. This hypothesis of the aggregate structure might explain the exclusive solubility of the aggregates in non-polar solvents. Investigations in the hydrophilic–lipophilic balance of the copolymer will be done to improve our understanding of the aggregates and possibly extent the solvent scope. CECOS offers visible potential for reaction development and application in DEL synthesis. As the copolymer system does not require tailored DNA barcodes or substrates and tolerated 6 nt- to 80 nt-long DNA oligomers, it is likely compatible with different DNA-barcoding strategies. The copolymer system may be used in further applications that require nucleic acids to be dissolved in organic solvents.
| This journal is © The Royal Society of Chemistry 2025 |