Zhoulong
Fan‡
a,
Shuai
Zhao‡
a,
Tao
Liu
a,
Peng-Xiang
Shen
a,
Zi-Ning
Cui
a,
Zhe
Zhuang
a,
Qian
Shao
a,
Jason S.
Chen
b,
Anokha S.
Ratnayake
c,
Mark E.
Flanagan
c,
Dominik K.
Kölmel
c,
David W.
Piotrowski
c,
Paul
Richardson
d and
Jin-Quan
Yu
*a
aDepartment of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA. E-mail: yu200@scripps.edu
bAutomated Synthesis Facility, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
cPfizer Medicinal Chemistry, Eastern Point Road, Groton, Connecticut 06340, USA
dPfizer Medicinal Chemistry, 10578 Science Center Drive, San Diego, CA 09121, USA
First published on 7th September 2020
DNA-encoded library (DEL) technology has the potential to dramatically expedite hit identification in drug discovery owing to its ability to perform protein affinity selection with millions or billions of molecules in a few experiments. To expand the molecular diversity of DEL, it is critical to develop different types of DNA-encoded transformations that produce billions of molecules with distinct molecular scaffolds. Sequential functionalization of multiple C–H bonds provides a unique avenue for creating diversity and complexity from simple starting materials. However, the use of water as solvent, the presence of DNA, and the extremely low concentration of DNA-encoded coupling partners (0.001 M) have hampered the development of DNA-encoded C(sp3)–H activation reactions. Herein, we report the realization of palladium-catalyzed C(sp3)–H arylation of aliphatic carboxylic acids, amides and ketones with DNA-encoded aryl iodides in water. Notably, the present method enables the use of alternative sets of monofunctional building blocks, providing a linchpin to facilitate further setup for DELs. Furthermore, the C–H arylation chemistry enabled the on-DNA synthesis of structurally-diverse scaffolds containing enriched C(sp3) character, chiral centers, cyclopropane, cyclobutane, and heterocycles.
Our recent development of a wide range of transformations of β-C(sp3)–H bonds demonstrates the potential for creating unprecedented diversity from simplicity (Fig. 1A).11 The attractiveness of using C–H arylation of carboxylic acid derived substrates to build DNA-encoded libraries is evident from a recent study where C–H activation reactions were performed in organic solvent and the DNA tags were subsequently attached individually. However, this approach does not allow encoding each single reagent in every reaction step during the split-pool synthesis, thus limiting the number of available building blocks and thereby the size of the library (Fig. 1B).12 In contrast, using C–H activation as a coupling step on DNA would allow much larger libraries to be constructed. Notably, despite the use of a very powerful directing group, C(sp3)–H activation reactions have not been successful in the presence of DNA thus far.13 We envision the development of on-DNA C(sp3)–H activation of different classes of monofunctional building blocks to participate in the key cycle for DEL buildup with enriched C(sp3) character, chiral centers, small rings and heterocycles. The reserved functional group could also be directly employed for further setup in DEL synthesis (Fig. 1C).
A broad range of carboxylic acids adjacent to quaternary carbon atoms are suitable for this chemistry, including those containing ethers or fluorine (2 to 14). However, acids adjacent to a secondary or tertiary carbon react with 26–28% yields (15 to 16). Importantly, cyclopropane- and cyclobutanecarboxylic acids (desirable as alkene isosteres) are competent coupling partners (17 to 21). The mechanism of the Pd-catalyzed β-C–H arylation is well known to give the cis stereoisomer for cyclic carboxylic acid exclusively. Importantly, these cis products are resistant to epimerization in the presence of strong bases.14b Our LCMS analysis was unable to determine whether chiral ligand L1 gave any absolute stereoinduction. Though we screened different silvers for achieving best yields for all of carboxylic acid substrates (see ESI†), silver trifluoroacetate gave acceptable yields in most of cases and can be used in split-pool synthesis for a DEL buildup.
The aryl iodide substitution pattern is flexible (22 to 28, 39 to 41). Although our previously-published reaction conditions in organic solvent could not couple heteroaryl iodides,14 we were optimistic that conditions that tolerate DNA—the bases of which contain nitrogen heterocycles—should allow coupling of heteroaryl iodides (e.g., pyridines and pyrazoles). This was indeed the case; heteroaryl iodides successfully reacted with carboxylic acids under the same reaction conditions (29 to 38, 42 to 44). Since aryl and heteroaryl iodides react under the same conditions, they can both be present in a split-pool synthesis of a DEL. In addition, the β-C(sp3)–H arylated product attached to DNA can undergo the second carboxylate-directed C(sp3)–H arylation to obtain the DNA-tagged product 1′. Furthermore, we selected product 24 to proceed off-DNA synthesis and demonstrate the reliability of the present on-DNA C–H reaction.
The carboxylic acids in the products from the above chemistry can be further derivatized (vide infra). One interesting possibility is coupling to chiral amino acids in order to enhance the chiral recognition potential of the resultant DNA-encoded libraries. We recognized an opportunity in these cases to develop a ligandless Pd-catalyzed β-C–H arylation. Rather than couple the free carboxylic acid and then amide couple, one could pre-synthesize amides derived from carboxylic acids and amino acids and take advantage of the presence of a bidentate directing group to run a more facile Pd-catalyzed arylation.15 Re-evaluation of the silver source, base, and co-solvent in the absence of ligand led to optimized conditions that afforded compound 45 in 69% yield (entry 1, Fig. 3). Palladium is essential for this transformation (entry 2). Silver salts and bases are not strictly required but have a significant impact on yield (entries 3 to 9). Surprisingly, we found this reaction also worked well at room temperature (entry 10).
Amides derived from cyclopropane- or cyclobutanecarboxylic acid and a broad range of α-amino acids reacted smoothly both heated and at room temperatures (45 to 56). Pd-catalyzed arylation proceeds only at the β-C–H bond from the amide; attempted coupling of Ac-L-Val-OH gave none of arylation product 59 indicating that α-arylation did not occur. Two LC peaks having same mass were due to the generation of diastereomers after the C–H arylation. To demonstrate this, we ran a representative off-DNA reaction to synthesize product 51. As expected, a mixture of diastereomers was observed, although the ratio is slightly lower due to the different reaction temperature. Although we focused on amides derived from α-amino acids and containing cyclopropyl or cyclobutyl rings, the chemistry can be extended to other alkyl carboxylic acids (57 and 58) and to β-amino acids (60). Diverse arene substitution patterns on the DNA-tethered aryl iodide were tolerated (61 to 65). As was the case for β-C–H arylation of carboxylic acids, the DNA-tolerant conditions for amide arylation can also be employed for coupling DNA-tethered heteroaryl iodides such as pyridines and pyrazoles (66 to 72). If desired, the carboxylic acid of the product can be further modified (73).
Having developed DNA-compatible C(sp3)–H arylations for carboxylic acids and amides, we turned our attention to ketones. Ketones are useful monomers for building DELs since they can be further elaborated via reductive amination. Guided by our previous work using aminooxyacetic acids as removable directing groups to recruit palladium to activate the β-C–H bond of ketones,16 we optimized a DNA-tolerant version of this reaction. The optimized reaction gave arylation product 83 in 62% yield (entry 1, Fig. 4). This reaction requires palladium and heat (entries 2 and 3) and is strongly influenced by the silver salt (entries 4–6); ligand and base play a lesser role (entries 7 and 8). We also found L8 could decrease the degradation of DNA and provide clean LC traces. Although a large excess of palladium is often associated with degradation of the DNA tag, this chemistry gives higher yield at 40 equiv. Pd/L (entry 1) than at 30 equiv. (entry 9).
Fig. 4 DNA-compatible C–H arylation of ketones. Unless otherwise noted, condition of entry 1 was used as the standard condition. For 89 and 90: Pd(OAc)2, 30 equiv.; L8, 30 equiv.; H2O/DMA (2/1). |
The above conditions affect β-C–H arylation in diverse settings, including on acyclic ketone derivatives (74 to 77, 84 to 89), at positions next to simple or complex rings (78 to 82, 90, 91), and on simple or complex rings (83, 92). A broad range of functional groups are tolerated, including esters, ethers, acetals, and amides. Ketone derivatives bearing β-quaternary centers can be γ-arylated (93). The DNA-tethered aryl iodide accepts different substitution patterns (94 to 97) and heteroaryl iodides can react under the same conditions (98 to 102). We also noticed that cis and trans-isomers of directing group were separated in the LC traces.
The ability to convert the oxime ethers back into ketones is critical for implementing this chemistry in a DEL buildup. We discovered that these oxime ethers readily hydrolyze in the presence of aniline and acetone (103), likely through equilibrium transimination with aniline and trapping of the free aminooxyacetic acid with acetone.17
Each of these C(sp3)–H activation reactions, as new disconnections for DEL synthesis, can incorporate unique structural motifs. The combination of multiple C(sp3)–H activations in DEL synthesis can further enhance the diversity. Hence, we embarked on a multi-step synthesis on DNA consisting of β-C–H arylation of a carboxylic acid, amide formation, β-C–H arylation of a masked ketone, and ketone deprotection (Fig. 5A) in order to demonstrate how these C–H activation chemistries can be combined to develop large DELs of diverse, drug-like compounds. A representative analog synthesis is shown in Fig. 5B. Thus, pyridyl iodide S18 and pivalic acid were coupled to form intermediate 31, and amide coupling with p-iodobenzyl amine then set up a second C–H arylation event. Coupling with a masked cyclobutyl ketone gave oxime ether 105. The above is only one of many sequences that one could design by using sequential multiple C–H arylation steps in combination with common DNA-encoded library-building steps such as amide formation or reductive amination. The use of multiple C–H activation reactions connects commonly used building blocks at different carbon centers hence providing complementary diversity of chemical space.
In order to evaluate DNA compatibility of the C–H activation chemistry with DEL synthesis, a select set of chemically modified on-DNA analogs (C–H arylation products 1, 45 and 83), along with their starting aryl iodide analog S1a were enzymatically ligated to a 65-mer dsDNA, so that the resulting oligomers were approximately equal in length to a encoding tag of a 3-cycle DEL build. All four ligation reactions proceeded smoothly, indicating that chemistry had no significant impact on encodability. In order to determine the amount of amplifiable DNA remaining after exposure to C–H activation conditions, the ligation products from 1, 45 and 83 were amplified by PCR and compared with that of S1a (untreated control). All three reactions showed satisfactory PCR viability (60–80% amplifiable DNA remaining). Moreover, Sanger sequencing reads also confirmed the integrity of their nucleobase sequence structures (see ESI†).
As purification is an inherently difficult process in split-pool synthesis, the reactivity required for DNA-compatible reactions must be devoid of unidentified byproducts that can complicate analysis. In this case, the main byproducts generated through our reaction platform consists only of starting material or its protodehalogenated derivative. Finally, we were able to obtain all products through our on-DNA C–H arylation platform in moderate and synthetically useful yields; higher than the threshold of 25% deemed practical in DEL synthesis.10f Gratifyingly also, we were able to obtain 60–80% DNA recovery from qPCR experiments, greater than the acceptable 30% threshold deemed practical in these processes.4c Altogether, these promising results further demonstrate the practicality of our DEL-compatible C(sp3)–H activation platform, enabling practitioners to rapidly generate structural complexity and diversity in a modular manner.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc03935g |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2020 |