Léa
Bouton
ab,
Agathe
Ecoutin
ab,
Florian
Malard
*ab and
Sébastien
Campagne
*ab
aInserm U1212, CNRS UMR5320, ARNA Laboratory, University of Bordeaux, 146 rue Léo Saignat, 33076 Bordeaux Cedex, France. E-mail: florian.malard@inserm.fr; sebastien.campagne@inserm.fr
bInstitut Européen de Chimie et de Biologie, F-33600 Pessac, France
First published on 11th January 2024
In eukaryotic cells, RNA splicing is crucial for gene expression. Dysregulation of this process can result in incorrect mRNA processing, leading to aberrant gene expression patterns. Such abnormalities are implicated in many inherited diseases and cancers. Historically, antisense oligonucleotides, which bind to specific RNA targets, have been used to correct these splicing abnormalities. Despite their high specificity of action, these oligonucleotides have drawbacks, such as lack of oral bioavailability and the need for chemical modifications to enhance cellular uptake and stability. As a result, recent efforts focused on the development of small organic molecules that can correct abnormal RNA splicing event under disease conditions. This review discusses known and potential targets of these molecules, including RNA structures, trans-acting splicing factors, and the spliceosome – the macromolecular complex responsible for RNA splicing. We also rely on recent advances to discuss therapeutic applications of RNA-targeting small molecules in splicing correction. Overall, this review presents an update on strategies for RNA splicing modulation, emphasizing the therapeutic promise of small molecules.
Fig. 1 Principles of RNA splicing and correction with ASO splicing switches. (A) RNA splicing in eukaryotes. Non-coding introns (lines) are removed and coding exons (boxes) are connected. Regulation can produce distinct protein isoforms via alternative splicing. (B) Topology of snRNPs. An example is given with U1 snRNP which contains U1 snRNA (lines), three U1-specific proteins (U1-A, U1-C and U1-70K) and seven shared Sm proteins (Sm B/B', Sm D1, Sm D2, Sm D3, Sm E, Sm F and Sm G). In all spliceosomal snRNPs, the Sm proteins are assembled in a ring around the Sm site of snRNAs. In particular for U1 snRNP, the 5′ end of U1 snRNA hybridizes with the 5′ splice site (red line). (C) Splicing reaction. The pre-mRNA needs two exons (boxes) separated by an intron (line) with defined 5′ and 3′ splice sites (5′ss, 3′ss) and a branch point (BP) adenosine. The spliceosome includes five snRNPs (U1/2/4/5/6) and auxiliary factors like SF1 and U2AF.1,2 (D) ASO in splicing correction. ASOs can adjust the usage of 5′ splice sites by masking cis-regulatory sequences. Exon skipping occurs with enhancer-targeting ASOs, while silencer-targeting promotes exon inclusion.18,19 |
Correct gene expression is closely tied to physiological splicing patterns. These patterns can vary depending on the tissue type and on the developmental stages, as well as in other situations.5–7 Mutations in certain trans-acting splicing factors, specifically those that act in a tissue-specific manner, can disrupt these physiological patterns.8 In cancer, some of these splicing factors can function as either oncogenic proteins or as tumor suppressors. Mutations in these factors are often linked to the onset and progression of cancers.9–11 Moreover, both germline and somatic mutations can alter cis-regulatory sequences within pre-mRNA. This can lead to incorrect splicing patterns. Such patterns can underlie various diseases. For example, aberrant splicing is implicated in inherited conditions like Huntington's disease, a rare neurdegenative disorder.12 It is also involved in specific types of breast and colorectal cancers, among others.13–16
The therapeutic modulation of RNA splicing emerged as a promising avenue for the treatment of several diseases previously refractory to medical intervention.2,17 Historically, antisense oligonucleotides (ASOs) have been used to target cis-regulatory sequences in the pre-mRNA (Fig. 1D).18–20 In this paradigm, RNA analog sequences are designed to hybridize with complementary cis-acting regulatory elements on the target pre-mRNA. By doing so, ASO drugs can inhibit exon inclusion events by masking a splicing enhancer cis-acting regulatory sequence, hence preventing the recruitment of an activator trans-acting splicing factor (e.g., eteplirsen19). Complementary, ASOs can also anneal with splicing silencer cis-acting regulatory sequences, thereby promoting exon inclusion (e.g., nusinersen18). Although ASOs have manifested efficacy in clinical contexts, their deployment typically necessitates more intensive therapeutic protocols, including recurrent injections in the hospital setting.2,21
Over the past decade, the pharmacological modulation of RNA splicing using small organic molecules has emerged as a viable and innovative complement to traditional methodologies.2 A significant advantage of this approach lies in its therapeutic administration, because these molecules are orally bioavailable and do not necessitate specific clinical environments for administration.22,23 These small molecules can be designed to target an array of actors within the context of splicing, from spliceosome components to diverse trans-acting splicing factors, and even RNA targets themselves, including both regulatory and precursor messenger RNAs.24–28 In particular, the task of designing small molecules that specifically bind to RNA and modulate splicing is a substantial challenge. Indeed, RNA molecules have markedly fewer hydrophobic surfaces relative to proteins and they present more restricted binding pockets.29,30 Yet, they remain amenable to drug targeting by exploiting their distinct three-dimensional conformations and inherent sequence element specificity.31,32
In this comprehensive review, we examine small molecules that can influence RNA splicing. We group and discuss them based on their target types: the spliceosome, trans-acting splicing factors, and RNA targets. For each target type, we describe its specific role and highlight key small molecules that can adjust splicing by interacting with that target. We finally discuss future perspectives with respect to each target type. Overall, our main goal is to picture the landscape of innovative strategies aiming to modulate RNA splicing with small molecules in the context of diseases.
Fig. 2 Small molecules targeting the spliceosome to modulate splicing. (A) Crystal structure of the human SF3b core complex.35 The region for binding of inhibitors is highlighted (red dashed frame) and corresponds to the close-up view shown in (C). (B) Chemical structure of molecules targeting the SF3b complex. (C) Crystal structure of the SF3b complex bound to the splicing inhibitor pladienolide B.37 (D) UHM–ULM interactions in pre-spliceosomal complexes. ULM ligands (W-containing yellow circles) and UHM domains (red-contoured circles) are highlighted. (E) Chemical structure of molecules targeting UHM-containing proteins. (F) Crystal structure of PUF60-UHM in complex with 7,8-dimethoxyperphenazine.67 (G) Inhibition of PRMT5 during snRNP biogenesis. The 20S methylosome (PRMT5-containing complex) catalyzes the arginine-methylation of Sm proteins D1, B/B′ and D3 using S-adenosylmethionine (SAM) as methyl-donor. Upon nuclear export of snRNA, the Sm site is the location around which mature Sm proteins are assembled in a ring. |
Pladienolides originate from natural products that contain a large macrocyclic lactone ring, so they are referred to as macrolides (Fig. 2B). Pladienolides include molecules such as pladienolides A–G, H3B-8800,38 and E7107.39,40 Correspondingly, herboxidienes are also of natural origin and they are polyketide compounds containing a tetrahydropyran ring and an epoxy group that is important for splicing inhibition (Fig. 2B).41 Finally, the spliceostatins group comprises spliceostatin A, sudemycins C, D1, D6, E, F, meayamycin B, and other synthetic compounds.42–45 Spliceostatins contain two functionalized tetrahydropyran rings joined by a diene moiety and an acyclic side chain linked with an amide bond (Fig. 2B).46 Pladienolides, herboxidienes and spliceostatins are highly cytotoxic and they are reported to exhibit strong antiproliferative effects against multiple cancer cell lines.47–49 These splicing inhibitors target SF3b1 within the SF3b complex, but they also interact with PHF5A, another component of the SF3b complex.37,49–52 Interestingly, mutations of Y36 in PHF5A and of K1071, R1074, V1078 in SF3b1 can be associated with resistance to these inhibitors, suggesting a shared binding site.51,53 These resistance-related mutations are localized around the branch point adenosine binding pocket, suggesting a competitive mechanism of inhibition that locks the SF3b complex into an open, inactive conformation.37,51 Furthermore, all-atom simulations of SF3b in apo form and in complex with splicing modulators not only predicted these results but also revealed the impact of recurrent drug resistance and sensitising mutations in SF3B1 or PHF5A on the functional dynamics of SF3b1.53
In the crystal structure of the human SF3b core in complex with pladienolide B, the molecule is accommodated within an hourglass-shaped tunnel that matches the binding site of the branch point adenosine (Fig. 2C).37 Accordingly, the cryogenic electron microscopy (cryo-EM) structure of the SF3b complex bound to the pladienolide derivative E7107 showed that the drug also binds to the branch point adenosine binding pocket.52 Notably, E7107 closely interacts with two residues, R1074 in SF3b1 and Y36 in PHF5A. When these residues are altered, resistance occurs. Overall, pladienolide B and E7107 stall SF3b in an open conformation, preventing the formation of a stable complex with the duplex between the branch point and U2 snRNP.37,52 E7107 was the first spliceosome inhibitor to enter a clinical trial but was discontinued due to adverse effects, including vision loss for two participants.54 Since then, the pladienolide derivative H3B-8800 was discovered as an orally administered modulator of the SF3b complex, which had shown specificity and efficacy in targeting spliceosome-mutant cells in murine models of various cancers (Fig. 2B).38 Interestingly, H3B-8800 induced preferential lethality in cancer cells bearing the frequent K700E mutation in SF3b1 by triggering aberrant splicing.55,56 H3B-8800 entered a clinical trial but, despite a more favorable safety profile compared to E7107, failed to recapitulate the murine experiments with no complete or partial responses observed.57 Despite the chemical similarities between H3B-8800 and E7107, and the assumption that they act through a similar mode of action, the mechanisms underlying their distinct efficacy and safety profiles require further investigations.
In summary, the discovery and development of spliceosome inhibitors such as pladienolides, herboxidienes, and spliceostatins, have highlighted the therapeutic potential of targeting the SF3b complex for cancer treatments. The structural insights from both crystallography and cryo-EM studies have advanced our understanding of how these inhibitors bind and inhibit the SF3b complex, providing a structural basis for further developments in this field.37,52 Despite the initial setback in clinical trials with E7107 and H3B-8800,54,57 the development of safer and more effective spliceosomal inhibitors is underway and could hopefully lead to new options in cancer-therapy.
Using virtual screening on a large small molecule database and in vitro competition assays, a novel small molecule, UHMCP1, was identified (Fig. 2E).65 UHMCP1 efficiently blocks the SF3b1/U2AF2 interaction in the pre-spliceosome complex A, as supported by both NMR spectroscopy analyses and molecular dynamics simulations.65 The effect of UHMCP1 on RNA splicing modulation is correlated with antiproliferative properties, which makes the compound a potential novel anticancer drug candidate. UHMCP1 was shown to interact with the hydrophobic pocket of U2AF2-UHM domain.65 The molecule features a polycyclic group that binds to the tryptophan binding pocket of UHM, analogous to the indole group of the ULM tryptophan residue.61,65 In the structural model of the complex, the tail of UHMCP1 extends towards loops connecting β1 to α1 and α2 to β4 in U2AF2-UHM domain, closely resembling to the binding mode of the native ULM peptide.61,65 In addition to spliceosomal SF1, U2AF1 and U2AF2, the splicing factors RBM39, SPF45 and PUF60 also contain an UHM domain that can be targeted with small molecules. In a recent study, the binding activity profiles of UHMCP1 derivatives were determined using binding assays.66 Unmodified UHMCP1 showed IC50 values of 74.85 μM for SPF45-UHM, 159.35 μM for RBM39-UHM, 175.61 μM for PUF60-UHM and 239.9 μM for U2AF35-UHM, highlighting the limited selectivity of UHMCP1 to target UHM domains.66
Phenothiazines have been identified as broad inhibitors of UHM–ULM interactions (Fig. 2E).67 Specifically, derivatives of 7,8-dihydroxyperphenazine can prevent the formation of the pre-spliceosome complex A, leading to a stalled spliceosome. NMR spectroscopy studies confirmed that these compounds bind to the tryptophan binding pocket within the UHM domain of SPF45.67 This is further supported by a co-crystal structure that shows 7,8-dimethoxyperphenazine bound to the UHM domain of PUF60 (Fig. 2F).67 In this structure, the phenothiazine moiety fills the tryptophan binding pocket of PUF60-UHM, which is slightly extended compared to the recognition of ULM peptide in order to fit the three aromatic rings of the phenothiazine. The hexagonal ring at the 7,8-dimethoxy position engages in π-stacking with F534 in the PUF60-UHM domain. Additionally, the N14-amine group of the piperazine part interacts with the E483 side chain. This bond is further stabilized by an intramolecular electrostatic interaction with the side chain of K486. Compared to UHMCP1, 7,8-dimethoxyperphenazine showed lower IC50 values with 10.4 μM for SPF45-UHM and less specificity with 10.4 μM for PUF60-UHM and 9.2 μM for U2AF2-UHM.67 Other derivatives reported in the study showed reduced or increased binding activities but without convincingly improving drug specificity.
In summary, small molecule UHM inhibitors, together with synthetic ULM peptides,68 modulate UHM–ULM interactions. Despite challenges in target specificity, these inhibitors can block early spliceosome assembly by affecting interactions between components like SF1, U2AF1, U2AF2, and specific trans-acting splicing factors. They can be used as investigative tools to study the role of UHM-mediated interactions in splicing, awaiting the development of more selective UHM inhibitors.
Selective inhibitors of PRMT5 were developed, showing in vitro and in vivo potency in lymphoma-derived cells (Fig. 2G).72 Antiproliferative effects and cell death were observed with IC50 values in the nanomolar range for the reported GSK3235025 and GSK3203591 compounds.72,73 Importantly, PRMT5 inhibition was monitored through levels of Sm D3 methylation, which means that decreased methylation of Sm proteins was correlated with anti-tumor effect upon anti-PRMT5 treatment.72,73 Structural studies highlighted the importance of the tetrahydroisoquinoline core of the designed ligands for cation–π interactions, as shown in the crystal structure of PRMT5/MEP50 in complex with GSK3235025.72 Even though PRMT family members regulate the activity of many proteins, including splicing factors, it is known that inhibition may affect splicing activities.74,75 Moreover, decreased Sm protein methylation correlates with antiproliferative effects, which encourages to investigate how strong is the causal link between inhibition of snRNP biogenesis itself and antiproliferative effect of PRMT5 inhibitors.
Yet, RNA-binding proteins are the most enriched cellular substrates of PRMTs and inhibition of RNA splicing as a whole underlies the cytotoxic effects of PRMT inhibitors.76 Many splicing factors can be methylated by PRMT family members and mutations can result in non-physiological splicing patterns that contribute to diseases. In acute myeloid leukemia cells, mutations in splicing factors SRSF2, SF3b1 and U2AF35 can sensitize cells to PRMT1 inhibition with MS023,77 PRMT5 inhibition with GSK3203591,73 or both.76 A combined treatment using type I PRMT inhibitors (e.g., GSK3368715) with PRMT5 inhibitors has shown superior tumor growth suppression both in vitro and in vivo than using either inhibitor in isolation.76,78,79 Protein profiling reveals that separate treatments target different protein groups, while combined treatments impact a more extensive protein selection, potential substrates for both PRMT types. Transcriptomic studies consistently showed that combined treatments lead to an increase in aberrant splicing events compared to individual treatments.76,78
In summary, PRMT5 inhibition with small molecules result in splicing defects associated with antiproliferative and cytotoxic properties in cancer cells. Whether the inhibition of snRNP biogenesis is a sufficient route to achieve this effect remain an open question, as it remains highly intricate with changes in the methylation of splicing factors as a whole, and subsequent effect on splicing and cell viability.
SRPKs form a distinct subfamily of kinases, notable for phosphorylating serine residues within ser–arg and arg–ser dipeptide motifs. This phosphorylation is essential to both constitutive and alternative mRNA splicing.83 SRPK1, in particular, is a key modulator of pro-angiogenic VEGF splicing, achieving this through the phosphorylation of serine-rich splicing factor 1 (SRSF1), which then associates with VEGF pre-mRNA.84VEGF pre-mRNA undergoes alternative splicing to produce two major VEGF isoforms. The pro-angiogenic isoform is generated when exon 8a is included, a process promoted by the binding of phosphorylated SRSF1 to a specific regulatory element in exon 8a of the pre-mRNA (Fig. 3A).85 SRPK1 knockdown via a lentiviral approach has been shown to reduce VEGF-mediated angiogenesis in vivo. Pharmacological inhibition of SRPK1 selectively decreases the expression of pro-angiogenic VEGF isoforms, while sparing the anti-angiogenic variants.86 The kinase catalytic domain in SRPK family members features a unique domain insert within the kinase hinge region. Structural studies of SRPK kinases reveal a helix in this insertion domain, positioned parallel to the kinase hinge, offering a potential anchoring site for inhibitors.84 This structural feature has been used to develop inhibitors of SRPK family members. SRPIN340, an inhibitor characterized by a trifluoromethylphenyl group essential for kinase inhibition, has shown efficacy in inhibiting the kinase activity of SRPK1 (Fig. 3A and D).84In vivo studies in a melanoma mouse model demonstrated the antiproliferative effects of SRPIN340 on tumor cells upon peritumoral administration (IC50 = 0.96 μM).86 Furthermore, overexpression of SRPK1 in leukemia models and the subsequent treatment with SRPIN340 led to both early and late apoptotic events.87 Finally, the SRPIN340 derivative SPHINX, exhibiting more potent SRPK1 inhibition (IC50 = 0.58 μM) and improved selectivity over SRPIN340, underscores the importance of pursuing the development of highly selective SRPK1 inhibitors.84
Similarly, CLKs (Cdc2-like kinases) are essential in regulating mRNA splicing through the phosphorylation of SR-rich domains in splicing factors, which leads to the production of protein isoforms crucial for cell growth and survival.88 CLK targets include SR proteins, SPF45, and U1-70k. CLK-mediated phosphorylation results in increased alternative splicing of certain genes, such as the exclusion of exon 6 in FAS and skipping of exon 6.3 in CCNL2. Typically, this phosphorylation process is guided by the R-x-x-S/T consensus sequence.88 Chlorexidine, a member of the cationic bisbiguanide class with antimicrobial properties, has been identified as a potent modulator of SR protein-mediated alternative splicing. It achieves this by selectively inhibiting CLK family members that phosphorylate SR protein splicing factors.89,90 Exon array experiments revealed that chlorexidine induces significant alternative splicing changes, affecting 1 444 transcripts, including those of RON, caspase 9, JAK3, and HIV Tat2-3.90 In the context of HIV-1, CLK1 has been shown to enhance the expression of HIV-Gag genes, which are crucial for the assembly, release, and maturation of virus particles. The inhibitory effect of chlorexidine on CLK1 leads to altered HIV-1 RNA processing, significantly reducing HIV-1 Gag processing and virus replication.91 Other notable CLK inhibitors, such as TG-003 and TG693, have been effective in suppressing CLK1 kinase activity, resulting in splicing alterations both in vitro and in vivo. These compounds have been shown to facilitate the exclusion of the mutated exon 31 of dystrophin in Duchenne muscular dystrophy (DMD), while preserving the wild-type protein expression. This action promotes the production of functional dystrophin protein, offering potential therapeutic benefits.92,93
In summary, the exploration of small molecule inhibitors targeting kinases essential for RNA splicing, particularly SRPKs and CLKs, has opened new therapeutic avenues. The mechanism of SRPK1-mediated SRSF1 phosphorylation and its regulation of VEGF splicing, alongside the potential of inhibitors like SRPIN340 and SPHINX, offers a promising strategy for treating angiogenesis-related conditions and cancer. Concurrently, CLK inhibitors, including chlorexidine, TG-003, and TG693, are showing promise in modifying alternative splicing patterns, presenting potential treatments for conditions like HIV-1 and DMD.
In the context of splicing repressors, particularly the heterogeneous nuclear ribonucleoproteins (hnRNPs), several inhibitory small molecules have been identified. hnRNP A1, one of these targets, is known for its ability to bind with high affinity to the UAGGGA/U consensus sequence present within pre-mRNA exons.95 When it binds to the pre-mRNA, hnRNP A1 acts as an exportin, interacting with the nuclear pore complex to facilitate the export of the pre-mRNA into the cytoplasm.26 Quercetin, a phytochemical from the flavonoid class, has been found to inhibit hnRNPs.96,97 This compound has shown effectiveness in various cancers, including breast and pancreatic tumors. Under normal conditions, transportin 1 (Tnpo1) binds to the C-terminal region of hnRNP A1, enabling the nuclear re-import of hnRNP A1.98 However, quercetin competes with Tnpo1 for binding to the C-terminal domain of hnRNP A1, thus preventing nuclear re-import and causing cytoplasmic retention of hnRNP A1.26 This interaction triggers cellular changes such as recruitment of stress granules, cell cycle arrest, enhancement of apoptosis, and inhibition of angiogenesis (Fig. 3B).26,96 Additionally, the related hnRNP A18 can recognize a unique RNA signature motif in the 3′ UTR of transcripts.99 This motif is critical for cancer progression and is found in genes like VEGF, TRX, RPA, and the CTLA4 immune checkpoint.99 Chemical probes such as Chembridge 5224046 or 6823240, which bind to the RNA recognition motif (RRM) of hnRNPA18, have been discovered. Their binding disrupts the interaction between hnRNP A18 and the pre-mRNA, leading to reduced levels of Trx and CTLA4 proteins, among others. Such treatments inhibit the growth of various cancer cells while sparing normal epithelial cell viability.99
On the other hand, splicing activators such as the splicing factor SRSF6 are known to be overexpressed in various cancers, correlating with poor prognostic outcomes.100 Studies have demonstrated that SRSF6 promotes colorectal cancer (CRC) proliferation, migration, and invasion in vitro, as well as tumor development and metastasis in NOD/SCID mice.101 SRSF6 contributes to tumorigenesis by modulating the alternative splicing of key tumor-associated genes, including ZO-1 exon 23, a critical cell adhesion molecule.101 The overexpression of SRSF6, leading to cellular transformation, underscores its role as an oncogene and its potential as a therapeutic target. In the context of drug discovery, virtual screening methods have identified indacaterol, a β2 adrenergic receptor-agonist typically used for treating chronic obstructive pulmonary disease (COPD), as a potential pharmacological inhibitor of SRSF6.101,102 Indacaterol inhibits the function of the SRSF6 protein by binding to its RRM2 domain, forming two hydrogen bonds with the side chains of residues E180 and D167, and two more with the backbone of residue L166.101In vitro experiments have shown that this inhibition alters ZO-1 splicing patterns, leading to an increase in the inclusion of exon 23.101 Consequently, this resulted in reduced growth of colorectal cancer cells, as well as decreased tumor development and metastasis in mouse models.101
In summary, the modulation of splicing factors by small molecules, as exemplified by the inhibition of hnRNPs with quercetin and the targeting of SRSF6 with indacaterol, represents a significant advance in oncological therapeutics. These interventions, impacting pre-mRNA splicing and function, demonstrate the potential of precise molecular targeting in cancer treatment, offering pathways for innovative drug development and therapeutic strategies.
Alternatively, molecular glues that act as degraders of splicing factors have demonstrated significant efficacy, making them attractive candidates for drug development.107,108 In the context of acute myeloid leukemia (AML), RNA binding motif 39 (RBM39) has been identified as a promising target due to its overexpression and vital role in the maintenance and survival of AML cells. RBM39 is essential for the proper splicing of RNA-binding proteins (RBPs) preferentially required in AML, thus making its degradation a feasible strategy in AML treatment.109,110 Arylsulfonamides, such as indisulam, are particularly effective in inducing the proteasomal degradation of RBM39 (Fig. 3C).111 These compounds function as molecular glues, facilitating the interaction between the RRM2 domain of RBM39 and the DCAF15-CRL4 ubiquitin ligase complex.108,112,113 This interaction leads to the ubiquitination of RBM39 in its N-terminal region by the CRL4-U3 ubiquitin ligase, resulting in proteasome-mediated degradation. The depletion of RBM39 disrupts normal splicing patterns of transcriptional regulators crucial for AML cell survival, triggering an apoptotic cascade that leads to the death of these cells.109,113,114 Furthermore, the versatility of arylsulfonamides extends to the degradation of other RNA binding motif (RBM) proteins, including RBM23, a paralog of RBM39.108
Overall, the development of proteolysis targeting chimeras (PROTACs) and RNA–PROTACs marks a significant leap in RNA splicing regulation, offering targeted approaches for degrading specific splicing factors. These novel strategies have shown promise in treating diseases like myelogenous leukemia and acute myeloid leukemia (AML), highlighting a new frontier in therapeutic interventions and drug development.103,105–108
A prototypical disease associated with RNA splicing is spinal muscular atrophy (SMA), a leading cause of child mortality.118 Over 95% of SMA cases are caused by homozygous inactivation of the SMN1 gene, which encodes the survival motor neuron (SMN) protein.119 Humans also have a paralog gene, SMN2. A higher SMN2 gene copy number often correlates with milder SMA phenotypes.120 However, a silent mutation in SMN2 exon 7 weakens its 5′ splice site, resulting in about 85% of the transcripts excluding exon 7.121 This exclusion produces a truncated, non-functional SMN protein. SMA was historically the first condition for which RNA-targeting small molecule splicing modifiers were developed, showing potency in enhancing motor function and lifespan in animal models and humans with SMA.122–125
The molecular basis for splicing correction by related compounds was established using the selective SMN2 splicing modifier SMN-C5, a substituted pyrimidone, as model (Fig. 4A).27,126 SMN-C5 interacts with the RNA helix formed between the 5′ splice site and the 5′ end of U1 snRNA in the solution structure of the complex (Fig. 4B). The carbonyl group of SMN-C5 provides specificity by forming a direct hydrogen bond with the primary amine of the bulged adenine at position −1. The positively charged amine in the piperazine moiety forms a salt bridge with the negatively charged oxygen in the phosphate group of C9 in the 5′ end of U1 snRNA. By stabilizing an unpaired adenine at the exon-intron junction, SMN-C5 is reported to act through the bulge-repair mechanism.27 This interaction induces an allosteric shift, enhancing the attachment of the U1-C zinc finger, and subsequently U1 snRNP, to the 5′ splice site of SMN2 exon 7, and thus promoting exon inclusion and the production of a functional SMN protein from SMN2 gene (Fig. 4C).
Fig. 4 Small molecules targeting RNA and its metabolism. (A) Chemical structure of splicing modifiers SMN-C5, Risdiplam, and Branaplam for the treatment of spinal muscular atrophy and Huntington's disease. (B) The bulge-repair concept.27 A weak 5′ splice site may be strengthened by small-molecule splicing modifiers acting as a glue between the 5′ end of U1 snRNA and the 5′ splice site. (C) A close-up view of the SMN-C5 binding pocket.27 (D) Chemical structure of tau splicing regulatory element (SRE) stabilizers based on dibenzothiopene (left), quinoline (middle) or pyrimidoindole (right) scaffolds. (E) A close-up view of the quinoline derivative in complex with tau SRE.28 (F) Mechanism of small molecule that stabilize tau SRE to promote exon 10 exclusion. (G) Principle of RNA degraders. |
The related splicing modifier, risdiplam, is a substituted pyrimidone already in use for SMA treatment (Fig. 4A).23,125 This early development paved the way for branaplam, a substituted phenol, which targets the huntingtin (HTT) gene to induce the inclusion of a poison exon, leading to the degradation of mRNA instead of the production of a pathological HTT mRNA isoform (Fig. 4A).22,127 Branaplam is in advanced clinical trials for Huntington's disease (HD) treatment, another inherited neurodegenerative disorder. Both SMA and HD share a feature: an unpaired adenine at position −1 (A−1) weakens the 5′ splice site. Splicing modifiers, believed to act via the bulge-repair mechanism, can correct this anomaly, leading to therapeutic effects.22,127
However, there remains an urgent need to discover splicing modifiers that target other bulged bases in the 5′ splice site. Drugs that could correct bulged −1 uracil or cytosine seem especially important, as mutations here are linked to diseases like X-linked lymphoproliferative syndrome and erythropoietic protoporphyria.116 The prevalence of genetic disorders associated with bulged 5′ splice sites emphasizes the need for drugs that can target and rectify these specific mutations.
In neurodegenerative diseases, the pre-mRNA of tau, an essential microtubule-associated protein, emerges as a promising therapeutic target.128 Mutations in the related gene have been linked to frontotemporal dementia and parkinsonism related to chromosome 17 (FTDP-17), with potential implications for Alzheimer's disease.128 Specifically, a mutation characterized by a C-to-U alteration in intron 10 of tau pre-mRNA, located 14 nucleotides downstream from the 5′ splice site, is noteworthy. This mutation destabilizes the hairpin structure at the junction of exon 10 and intron 10, a sensitive splicing regulatory element (SRE).129 This structural alteration enhances U1 snRNP binding to the 5′ splice site, leading to increased exon 10 inclusion. Small molecules that strengthen this hairpin could be therapeutically advantageous, as they might promote exon 10 skipping and diminish the generation of pathogenic tau variant.28,130–132 Recently, an InfoRNA database search identified compounds that stabilizes the tau pre-mRNA SRE (Fig. 4D).28,133 These molecules interact with a pocket at the A-bulge within the SRE stem, echoing the binding mode of splicing modifiers that target A−1 bulged splice sites (Fig. 4E). Interestingly, while some of these compounds are derived from dibenzothiopene, others contain a quinoline or pyrimidoindole core. Of these, only the quinoline derivatives demonstrated interaction, yet unspecific, with the bulged base in the tau SRE (Fig. 4E). The stabilization provided by dibenzothiopene and pyrimidoindole derivatives arised mainly from π–π and Van der Waals interactions, not direct engagement with the bulged adenine. Yet, all these compounds effectively promoted tau exon 10 exclusion, thus mitigating the effect of C-to-U mutation in the SRE of intron 10 in tau pre-mRNA (Fig. 4F).28
In the context of inherited diseases, myotonic dystrophy type 1 (DM1) originates from an extended RNA repeat expansion, termed r(CUG), located in the 3′ untranslated region of the dystrophia myotonica protein kinase (DMPK) mRNA. This mutation causes the disease through a gain-of-function mechanism.134 The r(CUG) repeat serves as a binding site that captures the muscle-blind like protein 1 (MBNL1). MBNL1 is crucial for alternative splicing of multiple transcripts, especially the insulin receptor (IR) and muscle-specific chloride ion channel (Clnc1).135–137 When MBNL1 is trapped by the r(CUG) repeats, it results in improperly spliced products, central to DM1 pathology. The affinity of MBNL1 to these repeats ranges from 3–200 nM.138 Thus, discovering molecules that can bind to the r(CUG) repeats more strongly than MBNL1 could potentially prevent MBNL1 capture and restore its physiological activity on splicing. A particular study identified a cell-permeable compound with two triaminotriazine units connected by a bisamidinium segment that can compete effectively with MBNL1 for r(CUG) repeat binding.139 Consequently, researchers developed multivalent cell-penetrating ligands targeting r(CUG) repeats. Designed to mirror the repetitive target sequence, they alternate recognition sites with bisamidinium groove binders, yielding an amphiphilic, polycationic structure reminiscent of cell-penetrating peptides.140 These compounds significantly alleviated DM1 symptoms in DM1 cells and a DM1 mouse liver model.140 Additionally, improvements in climbing defects were noted in adult DM1 Drosophila models.
In summary, targeting RNA structures with small molecules opens new therapeutic avenues for diseases driven by RNA mutations and splicing anomalies. This approach represents a significant step forward in developing innovative therapies for RNA-related pathologies, indicating a promising direction for future medical research and drug design.
In RNA degradation research, ribonuclease-targeting chimeras (RIBOTACs) represent a significant advancement. These chimeras combine an RNA-binding molecule with a compound that recruits RNase L locally (Fig. 4G).141,142 RNase L preferentially cleaves RNAs with a UNN pattern, especially those with an unpaired uracil. In RIBOTACs, the RNA-binding section specifically recognizes the structured RNA segment, due to the broader binding surfaces offered by folded RNAs. Successful examples include RIBOTACs developed to selectively target the precursor to oncogenic microRNA-21 (pre-miR-21) for cleavage and degradation with high selectivity.142 One part of the compound specifically binds pre-miR-21 and the other contains a heterocycle that recruits and activates a ribonuclease to degrade pre-miR-21, leading to antiproliferative effect in cell models of breast cancer.142 Using high-throughput approaches, recent studies have also investigated the interaction dynamics of natural product-inspired small molecules with RNA structures.115 While many identified interactions were presumed biologically inactive, the RIBOTAC strategy enabled conversion into degraders, with some exhibiting bioactivity. Specific degraders were designed for disease-related pre-miR-155, JUN mRNA, and MYC mRNA. Applying this strategy to a miRNA precursor revealed that RIBOTAC degraders decreased miRNA concentrations in disease-associated cell lines and in vivo.115 Additionally, RIBOTAC degraders effectively targeted JUN and MYC mRNAs, inhibiting pathways regulated by these oncogenes.
In parallel, the proximity-induced nucleic acid degrader (PINAD) approach is an alternative method for RNA targeting and degradation.143 Designed in part to bypass the necessity for localized RNAse L expression, PINADs are composed of an RNA-binding small molecule, an elongated flexible linker, and an imidazole, the RNA-degrading component found in many ribonucleases. Previous studies indicated that attaching imidazole groups to RNA leads to their degradation.144 This suggests the PINAD work through spatial proximity of the imidazole to the cleavage site. Consistently, a study using the aminoglycoside antibiotic neomycin, coupled with an amino acid histidine, demonstrated RNA degradation capability under physiological conditions.145 Using the PINAD methodology, two RNA degrader classes targeting unique SARS-CoV-2 genome structures were formulated against G-quadruplexes and betacoronaviral pseudoknots.143 Their efficacy in degrading targets was validated through in vitro, in cellulo, and in vivo SARS-CoV-2 models.
In summary, small molecules that target RNA have emerged as innovative tools to manipulate RNA metabolism. These molecules can be used to modulate alternative splicing and address several diseases associated with aberrant RNA splicing. The development of these small molecules is based on specific interactions with the target RNA sequence and structure. Recent advancements, such as the discovery of small molecules that act through bulge-repair27 and RNA degraders like RIBOTACs,115,142 not only highlight the potential of RNA as a drug target but also underscore the therapeutic promise of these molecules.
Name | Target class | Molecular target | Clinical trials | Chemical formula |
---|---|---|---|---|
H3B-8800 (ref. 38) | Spliceosome (U2 snRNP) | SF3b1 | NCT02841540 | |
E7107 (ref. 39) | SF3b1 | NCT00499499, NCT00459823 | ||
Herboxidiene41 | SF3b1 | — | ||
FR901464 (ref. 50) | SF3b1 | — | ||
UHMCP1 (ref. 65) | Spliceosome (UHM/ULM) | UHM domains (U2AF1, U2AF2, PUF60,…) | — | |
7,8-dimethoxyperphenazine67 | — | |||
GSK3203591 (ref. 72 and 73) | Spliceosome (snRNP biogenesis) | PRMT5 | — | |
SRPIN340 (ref. 84) | Splicing factors (via kinases) | SRPK1 | — | |
Quercetin96,97 | Splicing factors | hnRNP A1 | No relevant trial | |
Indisulam111 | RBM39, RBM23 | NCT00014625, NCT00165594, see https://clinicaltrials.gov/ | ||
SMN-C5 (ref. 27 and 125) | RNA | A−1 bulged | — | |
5′ splice sites | ||||
Risdiplam23,125 | A−1 bulged | NCT02908685, NCT03779334, see https://clinicaltrials.gov/ | ||
5′ splice sites | ||||
Branaplam22,127 | A−1 bulged | NCT05111249, NCT05330286, NCT02268552 | ||
5′ splice sites | ||||
Cmp5 (ref. 28) | Tau SRE | — |
In this review, we highlighted how small molecules that target the spliceosome can function as potential anti-cancer agents and investigative tools.24,25 Several inhibitors have been identified to target SF3b1 within the SF3b complex of U2 snRNP, including pladienolides, herboxidienes, and spliceostatins. Notably, pladienolides have moved from laboratory investigations to clinical trials, though with mixed results regarding safety and efficacy.54,57 Recent research also explained how spliceostatin A can disrupt prespliceosome formation by forming a covalent link with the zinc finger of PHF5A, a component of the SF3b complex, as revealed by crystallographic studies.147 Alternatively, recent efforts demonstrated that proteolysis-targeting chimeras (PROTACs) can be developed to selectively target SF3b1, leading to proteasome-mediated degradation and cell death.105 Moreover, it was recently shown that the cyclin-dependent kinase 11 (CDK11) phosphorylates SF3b1, and that selective inhibition of CDK11 led to widespread intron retention and accumulation of non-functional spliceosomes on pre-mRNAs and chromatin.148 Therefore, the development of inovative approaches to target the SF3b complex is ongoing. In parallel, the U2AF homology motif (UHM) domains play a crucial role in spliceosome assembly. Recent molecules like UHMCP1 have shown potential to inhibit UHM-mediated interactions within pre-spliceosomal complexes.65 Yet, achieving specificity remains challenging, as demonstrated by limited selectivity of UHMCP1 and phenothiazine derivatives for target UHMs.66,67 Future studies aim to optimize these molecules for specificity and safety. Alternatively, it should be noted that UHM-containing proteins can also be targeted on distinct regions. For instance, the small molecule NSC 194308 interacts with the tandem RRMs of U2AF2, thereby promoting an early-stage U2AF2–RNA complex and triggering a stall in pre-mRNA splicing.149,150
Efforts to modulate tissue-specific splicing can involve targeting trans-acting splicing factors with small molecules.26,100,109 Among the most promising developments are PROTACs and molecular glues, both of which promoting proteasome-mediated degradation of the target.103,151 These drug agents can induce almost complete protein degradation even when present at substochiometric concentration with respect to the target splicing factor. In this review, we have explored PROTACs bearing an RNA component that target the splicing factor Lin28A.106 Subsequently, the RNA–PROTAC triggers proteasome-mediated degradation through its E3 ligase recruitment warhead. RNA–PROTACs leverage the RNA binding capabilities of splicing factors, and it is thus plausible to imagine developing PROTACs purely based on small molecules for targeting splicing factors. For instance, attaching an E3 ligase-recruiting component to a small molecule known to target UHM domain in splicing factors could enable directed degradation of UHM-containing splicing factors.65,103 On another front, the scientific community has shown increasing interest in expanding the list of molecular glues that target RNA-binding proteins (RBPs) beyond arylsulfonamides and their representative compound indisulam, with a potential therapeutic benefits for a range of conditions.109,151 Molecular glues have unlocked opportunities to target proteins previously considered undruggable.113,152 Indeed, molecular glues work by inducing a tripartite complex between the target and the E3 ligase adapter (e.g., DCAF15), without the need for targeting a specific functional site in the target splicing factor, and while being effective at substochiometric concentrations.153 This means that, in principle, it is not necessary for the drug to fully saturate its target to achieve the desired effect on splicing modulation, thus encouraging to focus on ligand specificity rather than affinity in the design of future molecular glues.
When it comes to target RNA in order to modulate its metabolism, a recent groundbreaking progress is the development of RNA degraders, either ribonuclease-targeting chimeras (RIBOTACs)141,142 or proximity-induced nucleic acid degraders (PINADs).143 This is primarily due to their potential to selectively diminish the levels of disease-associated, target RNAs.115 While such approaches were not yet applied to splicing modulation, one could easily imagine designing RNA degraders directed toward, for instance, the toxic mRNA responsible for Huntington's disease. More generally, RNA degraders could be used in principle to selectively degrade mRNA containing the toxic exon or motif, based on specific sequence and structure elements. In parallel, there is a growing interest directed toward splicing modifiers acting through the bulge-repair mechanism at the junction between U1 snRNP and the 5′ splice site. Recent research highlights that three distinct chemotypes consistently strengthening the 5′ splice site toward U1 snRNP.152 Consistently, these chemotypes appear to share a common mode of action recapitulated by the bulge-repair concept at the 5′ splice site.27,154 In the future, a notable challenge will be the diversification of these modifiers to target other bulged bases for any position in the splice site, as bulged sites are associated with various inherited diseases and several types of cancer.116 Given the systematic nature of the associated pharmacological targets, which are short RNA duplexes with single point differences, it is anticipated that leveraging high-throughput screening combined with both generative and deterministic artificial intelligence methods will speed-up the identification of new compounds. These compounds will target the spectrum of bulged bases in 5′ splice sites related to diseases.116
In conclusion, recent studies highlight the significant promise of small molecules for modulating RNA splicing in clinical settings. Their adaptability to engage with a spectrum of targets with diverse mode of action hopefully lays the groundwork for upcoming therapeutic breakthroughs.
This journal is © The Royal Society of Chemistry 2024 |