Strong sequence–dependence in RNA/DNA hybrid strand displacement kinetics

Francesca G. Smith; John P. Goertz; Križan Jurinović; Molly M. Stevens; Thomas E. Ouldridge

doi:10.1039/D4NR00542B

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

DOI: 10.1039/D4NR00542B (Paper) Nanoscale, 2024, 16, 17624-17637

Strong sequence–dependence in RNA/DNA hybrid strand displacement kinetics†

Francesca G. Smith ^ab, John P. Goertz ^a, Križan Jurinović ^bc, Molly M. Stevens *^ac and Thomas E. Ouldridge *^bc
^aDepartment of Materials, Imperial College London, London, SW7 2AZ, UK
^bImperial College Centre for Synthetic Biology, Imperial College London, London, SW7 2AZ, UK
^cDepartment of Bioengineering, Imperial College London, London, SW7 2AZ, UK. E-mail: t.ouldridge@imperial.ac.uk; molly.stevens@dpag.ox.ac.uk

Received 6th February 2024 , Accepted 19th August 2024

First published on 5th September 2024

Abstract

Strand displacement reactions underlie dynamic nucleic acid nanotechnology. The kinetic and thermodynamic features of DNA-based displacement reactions are well understood and well predicted by current computational models. By contrast, understanding of RNA/DNA hybrid strand displacement kinetics is limited, restricting the design of increasingly complex RNA/DNA hybrid reaction networks with more tightly regulated dynamics. Given the importance of RNA as a diagnostic biomarker, and its critical role in intracellular processes, this shortfall is particularly limiting for the development of strand displacement-based therapeutics and diagnostics. Herein, we characterise 22 RNA/DNA hybrid strand displacement systems, alongside 11 DNA/DNA systems, varying a range of common design parameters including toehold length and branch migration domain length. We observe that differences in stability between RNA–DNA hybrids and DNA–DNA duplexes have large effects on strand displacement rates, with rates for equivalent sequences differing by up to 3 orders of magnitude. Crucially, however, this effect is strongly sequence-dependent, with RNA invaders strongly favoured in a system with RNA strands of high purine content, and disfavoured in a system when the RNA strands have low purine content. These results lay the groundwork for more general design principles, allowing for creation of de novo reaction networks with novel complexity while maintaining predictable reaction kinetics.

Introduction

Strand displacement reactions form the basis of most dynamic nucleic acid reaction networks and are fundamental to the field of DNA nanotechnology.^1–3 Strand displacement is a nucleic acid-based process in which an incumbent strand (Inc) is replaced by an invader strand (Inv) within a complex (SInc) with a substrate strand (S). Specifically, toehold-mediated strand displacement (TMSD) reactions are a subset of strand displacement reactions that repeatedly appear in nucleic acid circuits due to their relatively high reaction rates. The presence of an overhang or ‘toehold’ motif on the substrate strand to which the invader can bind, known as the invader toehold (γ), provides the thermodynamic drive for forward strand displacement (Fig. 1A). Hybridisation of the invader strand to the invader toehold is followed by branch migration, in which displacement of the incumbent strand occurs by a random walk process. Completion of branch migration results in two nucleic acid products: a single-stranded incumbent strand (Inc) and fully complementary invader–substrate complex (SInv).


	Fig. 1 Mechanism of strand displacement reactions. (A) The mechanism of toehold-mediated strand displacement (TMSD). An invader (Inv) strand hybridises to a partially-complementary incumbent–substrate complex (SInc) via the invader toehold (γ) and displaces the incumbent strand within the displacement domain (β). (B) The mechanism of toehold exchange. The presence of a second toehold (ε) allows the incumbent strand to displace the invader from the invader–substrate complex (SInv).

The highly efficient and specific nature of TMSD reactions makes this mechanism a critical tool within dynamic nucleic acid reaction circuit designs. Moreover, the kinetic and thermodynamic properties of DNA-based (or DNA > DNA) TMSD reactions have been extensively and systemically characterised.^4,5 Zhang and Winfree (2009) pioneered systematic investigations into the effect of toehold length on the rate of TMSD. This foundational work revealed that the rate of strand displacement increases exponentially with a linear increase in toehold length up to approximately 6–7nt, above which the rate of strand displacement plateaus. The understanding obtained from this seminal work has enabled development of increasingly complex de novo reaction networks with predictable reaction kinetics, implemented across broad range of applications including digital nucleic acid computation,^1,2,6 molecular motors,⁷ and diagnostic biosensors.^3,8,9

Since the foundational work by Zhang and Winfree (2009), further studies have explored the kinetics of DNA > DNA strand displacement in more detail to harness more fine-tuned kinetic control within this reaction motif. The introduction of mismatched base pairs in the branch migration domain of the invader–substrate complex has been found to decrease the rate of strand displacement by up to 4 orders of magnitude.^10,11 On the other hand, elimination of an existing mismatch in the branch migration domain of the incumbent–substrate complex can increase in the rate of strand displacement by up to 2 orders of magnitude.^10,12 Fine-tuned programmable control has equally been realised through introduction of a spacer between the invader toehold and branch migration domain, known as a remote toehold.¹³ This kinetic variability offers improved system flexibility and has facilitated circuit design with more regulated dynamics.

Building on the experimental work towards understanding DNA > DNA TMSD, a number of predictive models have been developed to capture strand displacement kinetics. Within the second-order limit, TMSD kinetics can be effectively described as an instantaneous, bimolecular reaction, with two products formed from two reactants via a single reaction step. Zhang and Winfree employed a simple, two-intermediate model to describe TMSD reaction kinetics.⁵ While this phenomenological model was able to successfully explain the exponential increase in rate with toehold length as well as the presence of a rate plateau, it was unable to provide insight into why the rate plateaued for toehold lengths of approximately 6–7nt. Subsequently, Srinivas et al. (2013) developed an ‘Intuitive Energy Landscape’ (IEL) model for TMSD, which provides a nucleotide-level understanding of strand displacement kinetics and offers useful insights from a biophysical perspective. The IEL model has since been expanded to successfully capture the effect of introduction or elimination of mismatches on reaction kinetics, supporting the previous experimental data.¹⁰

While TMSD is frequently employed in nucleic acid reaction networks, it is limited in its use due to its irreversibility. When reversible displacement reactions are desired, the toehold exchange reaction motif is often used as an alternative. Toehold exchange involves a second toehold, known as the incumbent toehold (ε), that is initially sequestered in the incumbent–substrate complex but is exposed following incumbent displacement. This toehold facilitates hybridisation of the incumbent strand to the invader–substrate complex and as such allows effective reversibility⁵ (Fig. 1B). The second toehold enables construction of more flexible, reversible reaction networks by weakening the coupling between the thermodynamics and kinetics of such systems. There are numerous examples in which this property of toehold exchange has been exploited to achieve novel functionality including reversible logic gates^14,15 and regulated molecular switches.^16,17

The kinetics of toehold exchange have also been studied in depth. Systematic characterisation showed that the rate of strand displacement is effectively independent of the length of the incumbent toehold within the limit that the incumbent toehold (ε) is shorter than the invader toehold (γ). However, for ε > γ, the rate of toehold exchange decays sharply with increasing ε.⁵

While DNA remains a critical building block in nucleic acid nanotechnology, recent years have seen increased interest in RNA-based or RNA/DNA hybrid nucleic acid nanodevices and reaction circuits.^3,9,18–21 Despite being less stable and more costly than DNA, the structural and catalytic properties of RNA open up the field of nucleic acid nanotechnology to novel functionalities and design capabilities. Furthermore, RNA has vital roles within the cell, with direct functions in gene regulation, protein-coding, structural scaffolding and catalysis. The fundamental role for RNA in the correct functioning of cells makes it a highly informative biomarker for many diseases and developmental disorders including cancers, heart disease and neurological disorders.^22–27 As such, it is unsurprising that many novel nucleic acid reaction schemes have been geared towards RNA biosensing applications in recent years.^3,9,28,29 Additionally, RNA–DNA hybrids have been shown to hold important roles in chromosome segregation, telomere regulation and replication regulation.^30–32 These observations emphasise the importance of effectively interfacing RNA within hybrid nucleic acid reaction circuits and networks with a view to diagnostic and therapeutic applications. Moreover, in vivo studies have also revealed that RNA/DNA strand displacement reactions (in which RNA displaces DNA or equally DNA displaces RNA) appear to play a critical role within numerous intracellular processes including RNA transcription; genome repair, and the operation of the CRISPR/Cas machinery, highlighting the importance of gaining a clear understanding of the kinetics of this reaction in particular.^33–35

At a fundamental level, RNA > DNA (RNA displacement of DNA from a DNA–DNA duplex) strand displacement involves the separation of DNA–DNA complexes and the production of RNA–DNA hybrid complexes and vice versa in the case of DNA > RNA (DNA displacement of RNA from an RNA–DNA hybrid complex) strand displacement. The kinetics of RNA > DNA and DNA > RNA strand displacement are therefore likely to be highly dependent on the stability of RNA–DNA complexes relative to DNA–DNA complexes, a consideration that does not arise in DNA > DNA and RNA > RNA strand displacement. Moreover, it is likely that the properties of the displacement domain will play a much more important role in hybrid strand displacement than the DNA > DNA context, in which branch migration along the displacement domain replace base pairs in one duplex with identical base pairs in another.

Previous hybridisation studies of RNA–DNA hybrids point to a dependence on the purine (G/A) content within the RNA sequence. The consensus suggests that, for low purine content in the RNA strand of the hybrid, RNA–DNA hybrids are less stable than the corresponding DNA–DNA complexes. In contrast, RNA–DNA hybrids show greater stability than the equivalent DNA–DNA complex when there is a high purine content in the RNA sequence of the hybrid.^36–39 Notably, many of these studies only tested short oligonucleotides and a relatively limited number of sequences were considered.^36,37 TMSD reactions allow us to effectively probe this purine dependence and quantify its effect on the thermodynamics and reaction kinetics.

Despite the importance of RNA/DNA hybrid (RNA > DNA and DNA > RNA) TMSD, and its potential complexity with respect to the DNA-based analog, understanding of RNA/DNA hybrid strand displacement kinetics remains limited. Some initial studies into RNA/DNA hybrid strand displacement have been performed,⁴⁰ however thus far there is insufficient experimental data to draw consistent conclusions about the kinetics of these systems and to exploit for further computational modelling. Moreover, no studies have explicitly addressed the role of purine content on the kinetics of RNA/DNA hybrid strand displacement.

In this work we characterise RNA/DNA hybrid strand displacement reactions across a range of common design parameters including invader toehold length and branch migration domain length. We extract rate constants for each system and reveal distinct differences in strand displacement kinetics between DNA > DNA and RNA/DNA hybrid systems. We highlight the importance of the sequence composition and specifically purine content of the branch migration domain in determining the rate of hybrid strand displacement reactions, which lies in contrast to DNA > DNA strand displacement. Finally, we interpret this experimental rate data by parameterising a continuous-time Markov chain model of RNA/DNA hybrid strand displacement. This model can describe strand displacement rates for RNA/DNA hybrid systems in terms of toehold length, branch migration domain length and, critically, includes a sequence-dependent parameter for the relative stability of RNA–DNA and DNA–DNA complexes as dictated by sequence composition.

Materials and methods

Reagents and sequence design

All sequences used in the low RNA purine content experiments (DNA_py > DNA_py, RNA_py > DNA_py, DNA_py > RNA_py) were designed in NUPACK (https://www.nupack.org).⁴¹ Sequences were designed with minimal secondary structure. All sequences used in the high RNA purine content experiments (DNA_pu > DNA_pu, RNA_pu > DNA_pu, DNA_pu > RNA_pu) were adapted from Yao et al. (2015). All DNA and RNA strands were ordered from Integrated DNA Technologies (IDT, Coralville, Iowa) with HPLC purification, and normalised to 100 μM in LabReady 1× IDTE buffer (pH 8.0). For labelled strands a TTT spacer was introduced between the fluorophore/quencher and the sequence of interest. All sequences were designed to have between 40% and 60% GC content. All sequences used in this work are given in Tables S1 & S2.† Table S3† lists all strands required to recreate each figure in this work.

Annealing complexes

Strands were combined and heated to 95 °C for 5 minutes and then cooled to 20 °C at a constant rate of 1 °C min⁻¹ to form complexes. For reporter (FQ) complexes, F and Q strands were combined for a final concentration of 300 nM and 360 nM, respectively. For SInc complexes, S and Inc strands were combined for a final concentration of 200 nM and 240 nM, respectively. We used a 20% excess of Inc and Q strands in order to ensure that all S and F strands were bound into complexes. All annealing was performed at 1 M NaCl in 1× TAE (Tris–Acetate–EDTA) buffer.

Fluorescence spectroscopy

All fluorescence measurements were performed using the BMG CLARIOstar® microplate reader. Reactions were performed in clear, flat bottom 96-well plates from Greiner Bio-One. All measurements were taken from the bottom of the wells. For all measurements, each read was an average of 20 flashes. Flashes were taken in a spiral configuration in the well with a 4 mm diameter, to account for any heterogeneity within each well. For fast (well mode) reactions measurements were not taken in a spiral but at a single point at the centre of each well. Plates were maintained at the reaction temperature (25 °C) for 30 minutes prior to the first measurement to ensure that the reaction mixtures were at the correct temperature. Injection of the relevant trigger strand for each reaction was performed using the automatic injection feature of the CLARIOstar® microplate reader at a pump speed of 430 μL s⁻¹. After each injection, samples were shaken for 6 s at a speed of 400 rpm in a double-orbital configuration. The injectors were passivated with 5% BSA for 20 minutes prior to being loaded with DNA or RNA samples to minimise loss of the nucleic acid and maximise reproducibility. Plates were sealed using Thermo Fisher Adhesive PCR Plate Seals and kept for up to a month in a shaking incubator at the reaction temperature and 70 rpm to allow the reaction to reach equilibrium. F was labelled with AlexaFluor488 (excitation: 488 nm; emission: 496 nm) and Q was labelled with IowaBlack FQ®. An excitation window of 488-14 and emission window of 535-30 were used. The same focal height (5.9 mm) was used across all experiments to allow for comparison between experiments. The gain was set to the same value as that used for calibration curve measurements to ensure comparability (the gains used for each calibration curve are given in Fig. S2†).

General procedure for experimental strand displacement characterisation

Experimental protocols were adapted from ref. 42. All experiments had a final reaction volume of 200 μL, and used a monovalent salt concentration of 1 M NaCl unless otherwise stated. All experiments were composed of multiple measurements that each captured either the reaction kinetics or allowed for estimation of reactant or product concentrations. All experiments included a positive control with 15 nM FQ and an excess (20 nM) Inc; and a buffer-only negative control. All reactions were measured in at least triplicate to obtain an estimate of the error within our measurements. Of note, the concentrations given are intended concentrations of reactants, however for fitting purposes we determined the exact concentrations for each reactant based on fluorescence output after calibration.

Fluorescence conversion calibration. Calibration curves were generated to allow for conversion of fluorescence in arbitrary fluorescence units (afu) to concentration of fluorescent product (FInc) in nM to quantify the concentration of reacted FQ in each assay. Final reaction volumes of 200 μL were used for all calibration experiments. Calibration curves were generated by reacting 15 nM of FQ with varying concentrations of Inc (2–20 nM for the low RNA purine content system and 4–30 nM for the high RNA purine content system). An additional negative control with 0 nM Inc was also included for all experiments. Further details of the calibration experiment protocols are given in ESI Note 6.1† (full protocol in Tables S4 & S5†). Details of the results of calibration experiments are given in Fig. S2 and ESI Note 6.3.†

Reporter characterisation reactions. Reporter characterisation reactions were used to estimate the reporter rate constant, k_rep, for each FQ and Inc combination used in this work. Reporter reactions are described as


	(1)

where FInc represents the fluorescent product. We can fit a simple ordinary differential equation (ODE) model to the normalised reaction data in order to extract estimates for the reporter rate constant (k_rep). Using conservation laws, the reporter reaction can be reduced to a single ODE


	(2)

For these experiments we used [FQ] = 15 nM and [Inc] = 4–12 nM. The majority of FQ complexes used herein were designed with a 6nt toehold (RNA > DNA and DNA > DNA strand displacement reactions for the high RNA purine content system used an FQ reporter toehold of 8nt). All reactions were saturated with an excess concentration of Inc after reaction kinetics were completed to determine [FQ] at time t = 0. An additional negative control was included with [FQ] = 15 nM and [Inc] = 0 nM. Further details of experimental protocol for reporter characterisation are given in ESI Note 7.1† (full protocol in Table S6,† illustrative fluorescent data in Fig. S3†). Further details of the normalisation and estimation of k_rep are given in ESI Notes 7.2 and 7.3,† respectively. All fluorescent traces for reporter characterisation are given in Fig. S4,† fits of the ODE model to the fluorescent traces are given in Fig. S5 & S6† and estimated k_rep values are given in Tables S7 & S8.†

Characterisation of full toehold exchange reactions. We estimate the effective rate constant, k_eff, for each toehold exchange system that was tested. We model toehold exchange reactions as a second-order reaction


	(3)

We combine both the toehold exchange and reporter displacement reactions to generate a system of ODEs. Using conservation laws this series of reactions can be well described by 3 ODEs


	(4)


	(5)


	(6)

Note that eqn (6) is equivalent to eqn (2) above.

For each design we estimated k_eff for DNA > DNA, RNA > DNA and DNA > RNA displacement reactions. These experiments were performed by combining [FQ] = 15 nM, [SInc] = 10 nM and [Inv] = 4–8 nM (for DNA_pu > RNA_pu strand displacement reactions we used [Inv] = 40–80 nM). We used an excess of [FQ] and [SInc], unless specified otherwise, to ensure that all [Inv] reacted and we were able to effectively estimate [Inv] at time, t = 0, [Inv]₀. For these experiments we used 3 different [Inv] concentrations to confirm the robustness of the second-order assumption and each reaction was performed in triplicate. All reactions were saturated with an excess concentration of Inv after reactions kinetics were complete to determine [SInc] at time t = 0. Reactions were next saturated with an excess concentration of Inc to determine [FQ] at time t = 0. An additional negative control was included with [FQ] = 15 nM, [SInc] = 10 nM and [Inv] = 0 nM. Further details of the experimental protocol for toehold exchange characterisation are given in ESI Note 8.1† (full protocol in Tables S9 & S10,† illustrative fluorescent data in Fig. S7 & S8†). Further details of the normalisation and estimation of k_eff are given in ESI Notes 8.2 and 8.3,† respectively. All fluorescent traces for toehold exchange characterisation are given in Fig. S10–S12,† fits of the ODE model to the fluorescent traces are given in Fig. S13–S18† and fitted k_eff values are given in Tables S12 & S13.†

Characterisation of leak reactions. We observed an undesired leak reaction between SInc and FQ for DNA_pu > RNA_pu strand displacement reactions. We describe the undesired leak reaction as


	(7)

For these strand displacement reactions we adapted the experimental protocol to effectively estimate k_eff despite the presence of the leak reaction. These experiments were performed by combining [FQ] = 15 nM, [SInc] = 10 nM and [Inv] = 40–80 nM. An excess of Inc is also present from formation of the SInc complexes. All reactions were performed in triplicate. All reactions were subsequently saturated with a further large excess concentration of Inv to ensure all SInc complexes were dissociated and therefore determine [SInc] at time t = 0. Reactions were next saturated with an excess concentration of Inc to determine [FQ] at time t = 0. An additional negative control was included with [FQ] = 15 nM, [SInc] = 10 nM and [Inv] = 0 nM. We first estimated the leak reaction rate constant, k_leak, from this negative control reaction using the following ODEs


	(8)


	(9)


	(10)

Note that the right hand side of eqn (9) is equivalent to eqn (2) and (6) above. We include the ODE describing the reporter displacement reaction due to the presence of excess Inc.

For the toehold exchange reactions of interest we then estimated k_eff while keeping the corresponding k_leak estimate fixed. We adapted the ODE model in eqn (4) and (5) to account for the undesired leak reaction observed for DNA > RNA strand displacement


	(11)


	(12)


	(13)


	(14)

Note that the right hand side of eqn (11) is equivalent to eqn (4) above and the right hand side of eqn (14) is equivalent to eqn (10) above.

Further details of experimental protocol for toehold exchange characterisation for these leak reactions are given in ESI Note 8.1† (full protocol in Table S11,† illustrative fluorescent data in Fig. S9†). Further details of the normalisation are given in ESI Note 8.3.† Further details of k_leak and k_eff estimation are given in ESI Note 8.4.† All fluorescent traces for toehold exchange characterisation for systems with a leak are given in Fig. S12,† fits of the ODE model to the fluorescent traces are given in Fig. S19 & S20† and fitted k_leak and k_eff values are given in Tables S14 and S15,† respectively.

General fitting procedure

All fitting was performed in Python version 3.8.8. First, kinetic traces for fluorescence in afu were converted to [FInc] in nM using the calibration results, then least-squares fitting of the ODE model was used to estimate relevant rate constants. Initial reactant concentration parameters, which may deviate from intended values due to experimental imperfections, were calculated prior to fitting using the calibration results.

For reporter characterisation data we estimated the rate constant k_rep. For each experiment we derived a single, global estimate of k_rep across all replicates and each initial concentration of Inc at time t = 0 ([Inc]₀). [Inc]₀ and [FQ]₀ were fixed values in the fitting protocol. The k_rep values estimated in the reporter characterisation experiments were fixed for fitting the corresponding toehold exchange reactions. Similarly, the values of k_leak obtained from preliminary experiments (see ESI Note 8.4†) were fixed for fitting the corresponding DNA_pu > RNA_pu strand displacement reactions.

We estimated k_eff for each toehold exchange system that we tested in the standard 1 M NaCl buffer. [Inv]₀, [SInc]₀ and [FQ]₀ were fixed values in the fitting protocol. Unless otherwise specified, for each experiment we estimated a single, global estimate for k_eff across all replicates and [Inv]₀.

For DNA_pu > RNA_pu strand displacement reactions we were unable to estimate global k_eff values. In this case, for each experiment we assumed that [Inv]₀ matched the intended concentration and calculated the individual estimates of k_eff for each experimental trace. We report the mean of these k_eff values instead of a global fit of k_eff. Fitting individual estimates of k_eff for each experimental trace for all other sets of experiments produced results that agreed well with the global fits. ESI Note 7.3† contains a detailed description of the fitting procedures for reporter characterisation experiments. ESI Notes 8.3 and 8.4† contain a detailed description of the fitting procedures for full strand displacement experiments.

Initial rate predictions using IEL model

Initial predictions from the IEL model were based on previous parameters for DNA > DNA strand displacement taken from ref. 10: ΔG_assoc = 2.5k_BT, ΔG_bp = –2.52k_BT, ΔG_p = 3.5k_BT, ΔG_bm = 7.4k_BT and k_bp = 5.9 × 10⁷ s⁻¹. For initial perdicitions ΔG_rd, the relative stability of RNA/DNA and DNA/DNA duplexes, is a flexible parameter. For these predictions we used an initial invader concentration of 6 nM, with γ = 4nt, β = 26nt and ε = 4nt (Fig. S1†). We assumed a reaction temperature of 25 °C (298 K). Full details of the IEL model are given in ESI Notes 1–3.† Detailed explanation of these initial predictions are given in ESI Note 4.†

Parameterisation of IEL model for RNA/DNA hybrid strand displacement

Parameterisation of the free-energy landscape model was performed in Python version 3.8.8. For parameterisation based on the low RNA purine content (DNA_py > DNA_py, RNA_py > DNA_py, DNA_py > RNA_py) kinetic data ΔG_assoc, ΔG_p and ΔG_bp were fixed at the previously estimated values given in Table 1, while ΔG_bm, k_bp and ΔG_rd were estimated. For parameterisation using the high RNA purine content (DNA_pu > DNA_pu, RNA_pu > DNA_pu, DNA_pu > RNA_pu) kinetic data ΔG_assoc, ΔG_p, ΔG_bp and ΔG_bm were fixed at the values predicted from the low RNA purine content kinetic data, while k_bp and ΔG_rd were flexible parameters. For parameterisation the reaction temperature was set to 25 °C (298 K). Parameterisation was performed by minimising the squared residuals between the model-predicted and the experimentally-derived values of k_eff. The details of the parameterisation protocol are given in ESI Note 9.†

Table 1 Estimated parameter sets to describe RNA/DNA hybrid strand displacement reactions for low RNA purine content design. Estimated parameters for the IEL model obtained from fits of RNA_py > DNA_py, DNA_py > DNA_py and DNA_py > RNA_py toehold exchange experimental data. The bottom two rows are a comparison for the equivalent parameters from previous studies of DNA > DNA strand displacement systems.^10,11 Parameter values in the final row of the table were derived by Irmisch et al. (2020) from data produced by Machinek et al. (2014). For k_bp we provide lower and upper confidence intervals (CI) as this value was estimated as log₁₀(k_bp). The available data doesn't strongly constrain all parameters, with similar fits arising when subsets of parameters were varied together therefore parameters marked with an asterisk (*) were fixed at the values from Irmisch et al. (2020) to allow for appropriate parameter estimation

Data set	ΔG_bp (k_BT)	ΔG_assoc (k_BT)	ΔG_p (k_BT)	ΔG_bm (k_BT)	k _bp (s⁻¹)	ΔG_rd (k_BT)
This study	−2.52*	2.5*	3.5*	9.3 ± 1.2	6.4 (CI: 2.3; 18.3) × 10⁷	0.16 ± 0.04
Irmisch	−2.52	2.5 ± 0.2	3.5 ± 0.2	7.4 ± 0.2	(5.9 ± 1.1) × 10⁷	—
Machinek	−2.51	4.0 ± 0.2	5.4 ± 0.4	8.5 ± 0.3	(20.6 ± 0.6) × 10⁷	—

Results and discussion

RNA/DNA hybrid free-energy landscape model

In this work we have developed a continuous-time, Markov chain model to describe the rate of strand displacement for RNA/DNA hybrid systems. This model has been adapted from the Intuitive Energy Landscape (IEL) model developed by Srinivas et al. (2013). The IEL model was initially introduced to explain DNA > DNA strand displacement reaction kinetics.⁴³ More recently it has been expanded to capture the effect of mismatches in strand displacement systems.¹⁰ We further expand on the IEL model as described by Irmisch et al. (2020) to allow a description of RNA > DNA and DNA > RNA strand displacement kinetics. For all systems referred to here the substrate strand (S) was composed of DNA.

Our free-energy landscape model is shown in Fig. 2A. This model contains a number of distinct states which represent each step in the strand displacement reaction, from state −γ through to state N. State −γ refers to the stage in which the invader is unbound from the incumbent–substrate complex. Completion of toehold hybridisation is defined as state 0, in which in the invader is bound to the incumbent–substrate complex via the toehold but branch migration has not been initiated. Branch migration proceeds through to state β in which the incumbent has been successfully displaced by the invader strand. For systems with no incumbent toehold state β, is equivalent to state N. For systems in which the incumbent toehold length is non-zero (ε > 0 as in Fig. 1B), state N refers to the complete dissociation of the incumbent strand from the incumbent toehold of the invader–substrate complex.


	Fig. 2 Free-energy landscape model for toehold exchange. (A) Simplified free-energy landscape of a toehold exchange reaction, based on the IEL.¹⁰ The substrate has length N + γ, with invader toehold length γ, branch migration domain length β and incumbent toehold length ε. Within the diagram, the states represented by blue dots are defined by the number of invader base pairs formed; large barriers inhibit transitions between these states during branch migration. (I) Unbound invader and incumbent–substrate complex; (II) single invader nucleotide hybridised to the invader toehold; (III) invader strand fully hybridised to the invader toehold; (IV) invader strand fully displaced incumbent strand in branch migration domain; (V) single incumbent nucleotide bound to the incumbent toehold; (VI) unbound incumbent and invader–substrate complex. Curved, red arrows represent spontaneous invader and incumbent dissociation, and straight, double-headed red arrows reflect free-energy changes. (B) ΔG_rd determines the gradient of the landscape within the branch migration domain. For RNA > DNA strand displacement, negative ΔG_rd values produce a downhill landscape and positive ΔG_rd values produce an uphill landscape. (C) For RNA > DNA toehold exchange, we predict the rate constant (k_eff) for an increase in β = 11nt to β = 26nt, across a range of ΔG_rd = −0.4k_BT to ΔG_rd = 0.4k_BT for γ = 4nt and ε = 4nt. Increasing ΔG_rd increases the dependence of rate on branch migration domain length.

The IEL model as defined by Irmisch et al. (2020) fully describes non-mismatched DNA > DNA strand displacement reactions with 4 free-energy parameters and a single rate parameter: ΔG_assoc, ΔG_bp, ΔG_p, ΔG_bm and k_bp.¹⁰ ΔG_assoc is the free-energy penalty caused by the initial association of the invader to incumbent–substrate complex within the toehold resulting in reduced orientational and translational freedom of the single strand at the standard 1 M reference concentration. ΔG_bp corresponds to the free-energy change resulting from formation of a single additional DNA–DNA base pair within a duplex. Therefore, toehold hybridisation can be expressed as a free-energy reduction by ΔG_bp for each base pair formed in this process. For simplicity, within this model we ignore the individual contribution of the different nucleotides and assume a sequence-average value for ΔG_bp. ΔG_p defines the free-energy penalty associated with the initiation of branch migration due to the presence of two single-stranded overhangs as the incumbent strand starts to be displaced. ΔG_bm is the free-energy barrier associated with each branch migration step. This parameter can be thought of as the transition barrier to breaking a single incumbent–substrate base pair and formation of a single invader–substrate base pair, due to the rearrangement of invader and incumbent strands involved in displacement.⁴³ We assume that for DNA > DNA strand displacement reactions there is no net free-energy change between adjacent states during branch migration. These 4 free-energy parameters describe a one-dimensional free-energy landscape for a non-mismatched DNA > DNA strand displacement reaction. Notably, the IEL model as described by Irmisch et al. (2020) also accounts for spontaneous incumbent dissociation before state N is reached. The probability of spontaneous detachment of the incumbent strand is assumed to drop off exponentially with an increase in the number of remaining base pairs with the substrate strand. Expanding the one-dimensional landscape model to include spontaneous incumbent dissociation has been shown to improve predictions of experimental strand displacement kinetic data.^10,11

In this study we introduce an additional free-energy parameter, ΔG_rd, which describes the free-energy difference between DNA–DNA and RNA–DNA base pairs. Using these 5 free-energy parameters the free-energy change between each state was calculated in order to construct complete free-energy landscapes for DNA > DNA, RNA > DNA and DNA > RNA strand displacement reactions. We stress that using a single parameter to describe the asymmetry between DNA/DNA and DNA/RNA duplexes is the simplest possible way to adjust the landscape, and the effective ΔG_rd is expected to be sequence dependent. We will use fits of ΔG_rd to experimental data as a useful way to quantify the relative advantage of RNA or DNA strands in a given context.

Using an analytical solution of our model we calculated the predicted rate of reaction from the free-energy landscape for each system (ESI Note 3†). We first calculated the first passage time t, which is defined as the time taken to pass from state −γ through to state N for each system. Within the second order limit the first passage time is related to the effective second-order rate constant (k_eff) according to


	(15)

where c is the initial free invader concentration.

We made preliminary k_eff predictions for RNA > DNA and DNA > RNA toehold exchange reactions across a range of toehold lengths and branch migration domain lengths. For initial predictions we made use of free-energy parameter values extracted from previous work of DNA > DNA strand displacement kinetics and allowed ΔG_rd to remain a flexible parameter.¹⁰ The IEL model suggests that for strand displacement reactions with no net free-energy change between states during branch migration (ΔG_rd = 0k_BT) the rate of strand displacement decreases with an increase in the branch migration domain length, β, according to


	(16)

Previous studies of hybridisation within the literature suggest that RNA–DNA hybrids are less stable than the corresponding DNA–DNA complexes for systems with a low purine content in the RNA strand of the hybrid, while a high RNA purine content produces the opposite thermodynamic trend.^36–39 Under this assumption, we would expect RNA displacing DNA (RNA > DNA) and DNA displacing RNA (DNA > RNA) to exhibit an overall uphill free-energy landscape for low and high RNA purine content systems, respectively (Fig. 2B). Within these limits, our initial predictions suggest an approximately exponential decrease in effective rate constant with a linear increase in β (Fig. 2C). By contrast, we anticipate RNA > DNA and DNA > RNA strand displacement reactions to exhibit an overall downhill free-energy landscape for high and low RNA purine content systems, respectively. In this case preliminary predictions indicate a very weak dependence of the effective rate constant on β for branch migration domain lengths greater than 10nt. These initial predictions suggest that the branch migration domain length is a critical parameter in effectively extracting ΔG_rd, displaying significantly different trends between uphill and downhill landscapes (Fig. S1†). Therefore, we focussed on systems with differing branch migration domain lengths to probe the relative stability of RNA–DNA and DNA–DNA base pairs, and extract a useful estimate for ΔG_rd for RNA > DNA and DNA > RNA strand displacement reactions across systems of purine content extremes.

Experimental characterisation of RNA/DNA hybrid strand displacement reactions

We designed a toehold exchange reaction network to probe the reaction rates for RNA > DNA and DNA > RNA strand displacement systems. These toehold exchange reactions were monitored over time by assessing the concentration of released incumbent strand over the course of the reaction. We made use of a distinct fluorescent reporter detection system (Fig. 3A) to assess the concentration of released incumbent to avoid directly labelling the strands within the reaction of interest, which may otherwise influence the kinetics of strand displacement.⁴⁴ The reporter complex was composed of a fluorescently (AlexaFluor488)-labelled strand (F) which was partially complementary to a quencher (IowaBlack-FQ)-associated strand (Q). The reporter complex was designed such that F had a 5′ overhang of 6nt or 8nt, which acted as the reporter toehold (δ). The released incumbent strand (Inc) possessed a domain complementary to the toehold δ, allowing the incumbent to displace the quencher-associated strand, resulting in a detectable fluorescent product (FInc). Across all the systems, we designed the reporter reaction to be effectively instantaneous compared to the toehold exchange reaction of interest such that the reporter acted as an effective readout for the toehold exchange kinetics.


	Fig. 3 Reporter detection system for experimental characterisation of toehold exchange reactions. (A) Toehold exchange reaction network. γ is the invader toehold, β is the branch migration domain, ε is the incumbent toehold and δ is the reporter toehold. Barred letters are complementary to their unbarred counterparts. Toehold exchange occurs between the invader strand and the incumbent–substrate complex with second-order rate constant k_eff. The released incumbent induces fluorescence by displacing the quencher-associated strand (Q) from a complex with a fluorescently-labelled strand (F), with second-order rate constant k_rep. (B) Fluorescence traces of FInc for a range of Inc concentrations (4–12 nM). Coloured, solid lines show the normalised fluorescent traces. Dashed, black are the output of fitting an ODE model to this fluorescent data.

For each design we assessed RNA and DNA counterparts of invader and incumbent strands, however for all systems herein only DNA substrate strand and DNA–DNA reporter complexes were used. Importantly, all sequences were designed to minimise secondary structure which may otherwise have affected the strand displacement rate, and confounded estimation of k_eff. In this work, we designed one system with low purine content in the invader strand (17% G/A) and designed a second system with high purine content in the invader strand (78% G/A). For these two independently-designed systems we aimed to extract the effective rate constant for DNA > DNA, RNA > DNA and DNA > RNA strand displacement reactions across a range of invader toehold lengths (4–10nt) and branch migration domain lengths (11–26nt) or (11–21nt for the high RNA purine content design). We refer to RNA > DNA strand displacement for the design with high purine content in the RNA invader strand (and thus high purine content in the incumbent strand) as RNA_pu > DNA_pu and equivalently for DNA > RNA strand displacement as DNA_pu > RNA_pu. For systems with low purine content in the invader (and thus high pyrimidine content), we use RNA_py > DNA_py and DNA_py > RNA_py. In order to minimise interaction and limit potential sequestration of the reporter toehold by the invader we ensured that only 4nt of the invader strand overlapped with the reporter toehold. We also introduced a GC-rich clamp in both the incumbent–substrate and reporter complexes to minimise undesired leak reactions. This design resulted in an incumbent toehold (ε) of 4nt for all systems.

To minimise cost we designed our system such that only two RNA or DNA invaders were required across all toehold length and branch migration domain length combinations: one for the low RNA purine content design and one for the high RNA purine content design. For the low RNA purine content design an invader with γ = 10nt and β = 26nt was used, while the toehold length and branch migration domain length of the substrate and incumbent strands were adjusted. For the high RNA purine content design an invader with γ = 6nt and β = 21nt was used, while the branch migration domain length of the substrate and incumbent strands were adjusted to achieve the desired domain lengths.

We initially performed a series of reporter characterisation reactions to estimate the rate of TMSD for the reporter reaction (k_rep). For these experiments we combined the 15 nM of FQ with Inc at a range of concentrations (Fig. 3B). Across all the reporters that were tested we showed that the reporter strand displacement reaction was sufficiently rapid to act as an effective readout for the reaction of interest, confirming the suitability of these designs. We estimated k_rep for each reporter reaction by fitting an ODE model to the fluorescence data (ESI Note 7.3 and Fig. S6†). Final estimates of k_rep for all designs are given in Table S8.†

We next monitored the fluorescence for the complete strand displacement network for the low purine content design (Fig. 4A). We initially assessed the effect on k_eff of changing the invader toehold length (410nt) for a fixed branch migration domain length of 26nt. Reactions were initiated by combining 4 nM, 6 nM or 8 nM of Inv with 10 nM of SInc and 15 nM of FQ. We then estimated k_eff by fitting a simple ODE model to the fluorescent traces using a non-linear least-squares approach (ESI Note 8.3, Fig. S16–S18†) (Fig. 4B). We found that the estimated k_eff values (Fig. 4C) for DNA_py > DNA_py toehold exchange were in good agreement with experimental data from previous studies, with the rate appearing to plateau at an invader toehold length between 6nt and 7nt.^5,10 Both DNA_py > RNA_py and RNA_py > DNA_py toehold exchange systems appeared to follow the same general trend as DNA_py > DNA_py toehold exchange. However, RNA_py > DNA_py and DNA_py > RNA_py strand displacement rates plateaued at toehold lengths closer to 7nt and 6nt, respectively. These results also reveal that RNA_py > DNA_py strand displacement is up to an order of magnitude slower than the corresponding DNA_py > DNA_py strand displacement reaction. In contrast, DNA_py > RNA_py strand displacement is up to an order of magnitude faster than DNA_py > DNA_py strand displacement and up to two orders of magnitude faster than RNA_py > DNA_py strand displacement. Numerical estimates of k_eff for RNA_py > DNA_py, DNA_py > DNA_py and DNA_py > RNA_py strand displacement reactions are given in Table S13.†


	Fig. 4 RNA > DNA and DNA > RNA toehold exchange kinetics for low RNA purine content design. (A) Schematic of RNA_py > DNA_py and DNA_py > RNA_py toehold exchange. (B) Fluorescence traces for RNA_py > DNA_py, DNA_py > DNA_py and DNA_py > RNA_py with γ = 5nt, β = 26nt and ε = 4nt for an Inv concentration of 6 nM, across 3 replicates. Solid, coloured lines are normalised fluorescent curves. Dashed, black lines are fits of an ODE model to this data. (C) Experimentally-derived k_eff values for RNA_py > DNA_py, DNA_py > DNA_py and DNA_py > RNA_py for toehold lengths 4–10nt at a fixed β = 26nt and ε = 4nt. Error bars show 95% confidence intervals based on the standard error of k_eff estimates from individual curves. (D) Experimentally-derived k_eff values for RNA_py > DNA_py, DNA_py > DNA_py and DNA_py > RNA_py for branch migration domain lengths of 11–26nt for a fixed γ = 4nt and ε = 4nt. Error bars show 95% confidence intervals based on the standard error of k_eff estimates from individual curves. (E) Predicted k_eff from the IEL model (dashed lines) compared to experimental values (solid lines) for toehold lengths (4–10nt). Error bars show 95% confidence intervals of model-predicted k_eff values. (F) Predicted k_eff from the IEL model (dashed lines) compared to experimental values (solid lines) for branch migration domain lengths of 11–26nt. Error bars show 95% confidence intervals of model-predicted k_eff values.

Subsequently, we investigated the effect of branch migration domain length on the rate of strand displacement. A 4nt invader toehold was used and a total of 4 branch migration domain lengths (11nt, 16nt, 21nt and 26nt) were explored (Fig. 4D). A fixed invader toehold of 4nt was selected as our initial model predictions suggested this toehold length would provide the most obvious rate differences between systems with uphill and downhill free-energy landscapes (ESI Note 4 and Fig. S1†). We found that for DNA_py > DNA_py strand displacement reactions, k_eff decreased by a factor of 15 for an approximate 2.5-fold increase in branch migration domain length. This fold-change is slightly larger than we would predict by eqn (16), although this is likely explained by variability between sequences. More importantly, k_eff for DNA_py > RNA_py strand displacement only decreased by a factor of 5.7 between 11nt and 26nt, suggesting that k_eff is less dependent on the length of the branch migration domain as compared to DNA_py > DNA_py strand displacement. Finally, the rate of RNA_py > DNA_py strand displacement decreased by a factor of 54 for an increase in the branch migration domain length between 11nt and 26nt, which is significantly larger than for DNA_py > DNA_py strand displacement.

Using the experimental data for RNA_py > DNA_py, DNA_py > DNA_py and DNA_py > RNA_py strand displacement systems we were able to parameterise our adapted IEL model and estimate ΔG_rd. The available kinetic data did not strongly constrain all IEL parameters (Table S16†), as such we performed a 3-parameter fit to 24 rate constants. The predicted k_eff values obtained from the fully parameterised model showed good fit to the experimental kinetic data across both toehold length and branch migration domain length (Fig. 4E and F). The estimated free-energy parameters and rate parameter are given in Table 1. These parameters are in good agreement with previous values of DNA > DNA strand displacement reactions.^10,11 For the low RNA purine content system we extracted a ΔG_rd value of 0.16k_BT. This value of ΔG_rd for RNA_py > DNA_py strand displacement predicts an overall uphill (although notably shallow) free-energy landscape during branch migration. In contrast, our model predicts an overall downhill free-energy landscape for DNA_py > RNA_py branch migration with a ΔG_rd value of the same magnitude.

Next we probed the rate of strand displacement for an independent design with alternative sequence composition. Specifically, we investigated the effect of high purine content (78%) within the RNA invader strand (Fig. 5A). We studied 3 different branch migration domain lengths (11nt, 16nt and 21nt) for a fixed invader toehold length of 4nt. As above, we estimated k_eff for each system by fitting an ODE model to the fluorescence traces (Fig. 5B). In complete contrast to the low RNA purine content design, we found that RNA_pu > DNA_pu strand displacement was up to an order of magnitude faster than DNA_pu > DNA_pu strand displacement, while DNA_pu > RNA_pu strand displacement was over an order of magnitude slower than DNA_pu > DNA_pu strand displacement (Fig. 5C). For DNA_pu > DNA_pu strand displacement we see a reduction in k_eff by a factor of 1.7 between branch migration domain lengths of 11nt and 21nt, in accordance with eqn (16). For an approximate doubling in the branch migration domain length, we observe no decrease or even a slight increase in k_eff for RNA_pu > DNA_pu strand displacement (k_eff = 9.670 × 10⁴ M⁻¹ s⁻¹ for β = 11nt and k_eff = 1.640 × 10⁵ M⁻¹ s⁻¹ for β = 21nt), indicating that k_eff is effectively independent of branch migration domain length across this range. Finally, we observe a reduction in k_eff by a factor of 4.8 for DNA_pu > RNA_pu strand displacement for an increase in β between 11nt and 16nt. Notably, we were unable to explicitly estimate k_eff for β = 21nt because the reaction speed was slow enough for an undesired leak reaction to obfuscate the reaction kinetics of interest. Numerical estimates of k_eff for RNA_pu > DNA_pu, DNA_pu > DNA_pu and DNA_pu > RNA_pu strand displacement reactions are given in Tables S13 and S15.†


	Fig. 5 RNA > DNA and DNA > RNA toehold exchange kinetics for high RNA purine content design. (A) Schematic of RNA_pu > DNA_pu and DNA_pu > RNA_pu toehold exchange. (B) Example fluorescent traces for RNA_pu > DNA_pu, DNA_pu > DNA_pu, DNA_pu > RNA_pu toehold exchange system with γ = 4nt, β = 11nt and ε = 4nt for an Inv concentration of 6 nM (for RNA_pu > DNA_pu, DNA_pu > DNA_pu) or 60 nM (DNA_pu > RNA_pu) across 3 replicates. Solid, coloured lines represent normalised fluorescent curves. Dashed, black lines are fits of an ODE model to this data. (C) Summary of experimentally-derived k_eff values for RNA_pu > DNA_pu, DNA_pu > DNA_pu and DNA_pu > RNA_pu toehold exchange reactions across a range of branch migration domain lengths (11–21nt) for fixed γ = 4nt and ε = 4nt. For the DNA_pu > RNA_pu toehold exchange reaction with β = 21nt, k_eff could not be reliably estimated. Error bars show 95% confidence intervals based on the standard error of k_eff estimates from individual curves. (D) Predicted k_eff from parameterised IEL model (dashed lines) compared to experimentally-derived values (solid lines) across branch migration domain lengths (11–21nt). Error bars show 95% confidence intervals of model predicted k_eff values.

Using the experimental data from the high purine content design, we reparameterised the IEL model using the same approach as above. We fixed values for ΔG_bp, ΔG_assoc and ΔG_p as above. We also fixed ΔG_bm at the value predicted from experimental rate data of the low RNA purine content design (Table 1). We allowed both k_bp and ΔG_rd to be flexible parameters and extracted estimates of 3.0 (CI: 1.9; 4.5) × 10⁷ s⁻¹ and −0.25 ± 0.07k_BT for these parameters, respectively. This parameterisation provided k_eff predictions that were in line with the experimental kinetic data for RNA_pu > DNA_pu and DNA_pu > DNA_pu strand displacement (Fig. 5D). However, for DNA_pu > RNA_pu strand displacement the model predicts higher k_eff values, although the trend across branch migration domain length is correctly predicted. Notably, the estimated value of |ΔG_rd| is slightly larger than that predicted from the low RNA purine content experimental data, but in this case ΔG_rd is negative. We identified alternative fits and ΔG_rd estimates by fixing different free-energy parameters but importantly ΔG_rd is estimated to be negative across all parameterisations (Fig. S21 and Table S17†). In contrast to the low RNA purine content design, these estimates of ΔG_rd predict an overall downhill (although again notably shallow) free-energy landscape during branch migration for RNA_pu > DNA_pu strand displacement. Meanwhile, this model parameterisation predicts an overall uphill free-energy landscape for DNA_pu > RNA_pu branch migration.

It is interesting to ask whether the observed behaviour is robust to changes in experimental conditions. In particular, while high monovalent salt is common in nanotechnological settings, lower salt concentrations may be relevant to biological or biologically-derived systems, and high divalent salt concentrations are typically employed if large nanostructures such as DNA origami are present. In ESI Note 10, Fig. S22 and S23,† we show results for DNA_py > DNA_py, DNA_py > RNA_pu, RNA_py > DNA_py, DNA_pu > DNA_pu, DNA_pu > RNA_pu and RNA_pu > DNA_pu reactions for systems with γ = 4nt, ε = 4nt and β = 11nt at both 100 mM NaCl and 10 mM MgCl₂. Although we have not performed a complete set of carefully calibrated experiments under these conditions, the results in ESI Note 10† are consistent with a picture in which RNA > DNA is relatively favoured for low purine content in the RNA strand, and relatively disfavoured for high purine content. At 10 mM MgCl₂, however, we observe that DNA > RNA reactions are suppressed in general, and that all reactions are slowed in 100 mM NaCl relative to 1 M NaCl.

Conclusions

In this work we experimentally characterised RNA > DNA and DNA > RNA strand displacement systems, which revealed notable differences in kinetics between DNA > DNA and RNA/DNA hybrid strand displacement systems, with important implications for the rational design of hybrid reaction networks. We have developed a parameterised free-energy landscape model which accurately predicts strand displacement kinetics for the test hybrid systems employed herein, given a fixed branch migration domain purine content. Most importantly, we demonstrate a strong sequence–dependence for the RNA > DNA and DNA > RNA strand displacement reactions that we tested. We highlight the importance of purine content within the RNA strand in determining the relative stability of RNA–DNA hybrids compared to DNA–DNA duplexes. We found that between high (78%) and low (17%) purine content within the RNA strand, estimated ΔG_rd values shifted from −0.25k_BT to 0.16k_BT. These results support conclusions from previous thermodynamic studies into RNA–DNA hybridisation, which suggest that a high purine content in the RNA strand of an RNA–DNA complex results in increased stability compared to DNA–DNA duplexes while low RNA purine content results in reduced stability of RNA–DNA complexes.^36–39

Banerjee et al.³⁶ report RNA–DNA hybrid nearest neighbour (NN) parameters at 1 M NaCl. These can be compared to the DNA–DNA NN parameters of SantaLucia.⁴⁵ To get a rough idea of the relative stability of duplexes, one can average over the contribution of each base pair step at 298 K, weighted so as to produce the correct proportion of purine/pyrimidine bases in each strand. Doing so predicts ΔG_rd values of +0.24k_BT for 17% purine content in the RNA strand and −0.51k_BT for 78% purine content in the RNA strand. These values are larger than those predicted in this work and would therefore predict more extreme kinetic behaviour for hybrid systems. It is worth noting that only a limited number of data points were used to derive these hybrid NN parameters and as such extreme behaviour is likely not well accounted for. Importantly, these NN parameters indicate that high purine content within the RNA strand should give a larger effect than low purine content in the RNA strand,³⁶ which is also predicted from our experiments. Notably, there is some suggestion that the relative stability of RNA–DNA hybrids as compared to DNA–DNA complexes is also dependent on the G/C content of the displacement domain, particularly for sequences with high G/C content.³⁹ Our study was limited to systems with approximately 50% GC content within the branch migration domain so future work should investigate how these sequence composition factors interact to determine the kinetics of strand displacement reactions.

Although the estimated ΔG_rd values of −0.25k_BT and 0.16k_BT are relatively small, our results suggest these free-energy differences per base pair can compound over the whole landscape giving orders of magnitude differences in the rate of RNA > DNA strand displacement compared to DNA > RNA strand displacement. Moreover, changing the sequence composition of the RNA from 17% purine content to 78% purine content resulted in a difference in rate of up to 3 orders of magnitude, suggesting that this factor could be a critical parameter in the rational design of RNA/DNA hybrid strand displacement systems and facilitate tuneable reaction kinetics. Indeed, requesting minimal secondary structure within NUPACK typically yields sequence designs excluding G or C nucleotides,⁴¹ and therefore by default these sequences exhibit extremes in purine content. As such our work is particularly relevant to synthetic designs within the low secondary structure constraint.

With knowledge of the underlying free-energy landscapes of hybrid strand displacement, other tools, including use of mismatches, could be exploited for further fine-tuned kinetic control.^11,12 Given that the predicted ΔG_rd values are relatively small, elimination of mismatches within the incumbent–substrate complex could be employed to counter the observed uphill free-energy landscapes. We also suggest that future studies could explicitly investigate the effect of mismatches within hybrid systems to gain a better understanding how this might differ from DNA > DNA strand displacement reactions.

The vital role of RNA in the correct functioning of intracellular processes makes RNA a critical target for both diagnostics and therapeutics. Putative detection or therapeutic systems based on strand displacement by RNA may be compromised by the sequence-dependent biases for and against RNA invasion of DNA duplexes observed here. On the one hand, if an RNA strand struggles to complete displacement, it will be difficult to detect. On the other, if the RNA strand completes displacement too easily, it will be more difficult to distinguish perfectly matching sequences from single nucleotide mutants. While real, clinically relevant RNA sequences may not fall into the purine content extremes, we propose that our work may provide informative limits on the possible behaviours of these systems.

Future work should focus on exploring the sequence dependence of RNA/DNA strand displacement, further looking to confirm whether the behaviour revealed here applies across all sequences and experimental conditions. We note that, consistent with our preliminary data, nearest neighbour models predict a qualitatively similar sequence dependence of ΔG_rd at 100 mM and 1 M NaCl.^36,45 Indeed, the key question is whether nearest neighbour models of DNA/DNA and DNA/RNA duplexes, in their current parameterisation, are sufficient to provide either quantitative or semi-quantitative predictions for the sequence dependence of hybrid strand displacement rates. If so, relatively few experiments would be needed to establish a useful predictive tool, based on the free-energy landscape model with a slope determined by the nearest neighbour models. If not, a far larger set of experiments will be necessary to characterise the range of possible behaviour.

Author contributions

F. G. S., J. P. G., M. M. S. and T. E. O. devised the project. F. G. S. performed the experiments (except experiments performed at 100 mM NaCl and 10 mM MgCl₂, which were performed by K. J.). F. G. S. performed the data fitting, and with support from J. P. G. and T. E. O., analysed the data. F. G. S., J. P. G., K. J., M. M. S. and T. E. O. wrote the manuscript.

Data availability

All raw data, fitting and parameterisation code are freely available at https://doi.org/10.5281/zenodo.10090783.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

This work is supported by UK Engineering and Physical Sciences Research Council (EPSRC) [EP/S022856/1 to F. G. S.]; the Royal Academy of Engineering Chair in Emerging Technologies award [CiET2021\94 to M. M. S.]; a Royal Society PhD studentship to K. J., and a Royal Society University Research Fellowship to T. E. O. The authors would like to acknowledge Dr Rakesh Mukherjee for his assistance in checking the data and code associated with this work.

References

K. M. Cherry and L. Qian, Nature, 2018, 559, 370–376 CrossRef CAS PubMed.
H. Su, J. Xu, Q. Wang, F. Wang and X. Zhou, Nat. Commun., 2019, 10, 5390 CrossRef PubMed.
J. K. Jung, C. M. Archuleta, K. K. Alam and J. B. Lucks, Nat. Chem. Biol., 2022, 18, 385–393 CrossRef CAS PubMed.
D. Y. Zhang, A. J. Turberfield, B. Yurke and E. Winfree, Science, 2007, 318, 1121–1125 CrossRef CAS PubMed.
D. Y. Zhang and E. Winfree, J. Am. Chem. Soc., 2009, 131, 17303–17314 CrossRef CAS PubMed.
L. Oesinghaus and F. C. Simmel, Nat. Commun., 2019, 10, 2092 CrossRef PubMed.
S. F. J. Wickham, J. Bath, Y. Katsuda, M. Endo, K. Hidaka, H. Sugiyama and A. J. Turberfield, Nat. Nanotechnol., 2012, 7, 169–173 CrossRef CAS PubMed.
Y. Guo, M. Wang, F. Shen, Z. Hu, H. Ding, W. Yao and H. Qian, Anal. Methods, 2021, 13, 447–452 RSC.
M. Mohammadniaei, M. Zhang, J. Ashley, U. B. Christensen, L. J. Friis-Hansen, R. Gregersen, J. G. Lisby, T. L. Benfield, F. E. Nielsen, J. Henning Rasmussen, E. B. Pedersen, A. C. R. Olinger, L. T. Kolding, M. Naseri, T. Zheng, W. Wang, J. Gorodkin and Y. Sun, Nat. Commun., 2021, 12, 5089 CrossRef CAS PubMed.
P. Irmisch, T. E. Ouldridge and R. Seidel, J. Am. Chem. Soc., 2020, 142, 11451–11463 CrossRef CAS PubMed.
R. R. F. Machinek, T. E. Ouldridge, N. E. C. Haley, J. Bath and A. J. Turberfield, Nat. Commun., 2014, 5, 1–9 Search PubMed.
N. E. C. Haley, T. E. Ouldridge, I. Mullor Ruiz, A. Geraldini, A. A. Louis, J. Bath and A. J. Turberfield, Nat. Commun., 2020, 11, 2562 CrossRef CAS PubMed.
A. J. Genot, D. Y. Zhang, J. Bath and A. J. Turberfield, J. Am. Chem. Soc., 2011, 133, 2177–2182 CrossRef CAS PubMed.
W. Deng, J. Y. Xu, H. Peng, C. Z. Huang, X. C. Le and H. Zhang, Biosens. Bioelectron., 2022, 217, 114704 CrossRef CAS PubMed.
A. J. Genot, J. Bath and A. J. Turberfield, J. Am. Chem. Soc., 2011, 133, 20080–20083 CrossRef CAS PubMed.
G. A. Wang, X. Wu, F. Chen, C. Shen, Q. Yang and F. Li, J. Am. Chem. Soc., 2023, 145, 2750–2753 CrossRef CAS PubMed.
F. Wang, H. Lv, Q. Li, J. Li, X. Zhang, J. Shi, L. Wang and C. Fan, Nat. Commun., 2020, 11, 121 CrossRef CAS PubMed.
G. Chatterjee, Y.-J. Chen and G. Seelig, ACS Synth. Biol., 2018, 7, 2737–2741 CrossRef CAS PubMed.
S. Chen and T. Hermann, J. Am. Chem. Soc., 2021, 143, 20356–20362 CrossRef CAS PubMed.
W.-J. Wang, J. Lin, C.-Q. Wu, A.-L. Luo, X. Xing and L. Xu, Nucleic Acids Res., 2023, 51, 7691–7703 CrossRef CAS PubMed.
R. Lopez, R. Wang and G. Seelig, Nat. Chem., 2018, 10, 746–754 CrossRef CAS PubMed.
A. Bayés-Genis, D. E. Lanfear, M. W. J. de Ronde, J. Lupón, J. J. Leenders, Z. Liu, N. P. A. Zuithoff, M. J. C. Eijkemans, E. Zamora, M. De Antonio, A. H. Zwinderman, S.-J. Pinto-Sietsma and Y. M. Pinto, Eur. J. Heart Failure, 2018, 20, 67–75 CrossRef PubMed.
H. Fang, J. Xie, M. Zhang, Z. Zhao, Y. Wan and Y. Yao, Am. J. Transl. Res., 2017, 9, 953–961 CAS.
R. Hernández, E. Sánchez-Jiménez, C. Melguizo, J. Prados and A. R. Rama, BMB Rep., 2018, 51, 563–571 CrossRef PubMed.
N. van Boven, I. Kardys, L. C. van Vark, K. M. Akkerhuis, M. W. J. de Ronde, M. A. F. Khan, D. Merkus, Z. Liu, A. A. Voors, F. W. Asselbergs, E.-J. van den Bos, E. Boersma, H. Hillege, D. J. Duncker, Y. M. Pinto and D. Postmus, Eur. J. Heart Failure, 2018, 20, 89–96 CrossRef CAS PubMed.
Z. Zhou, H. Xiong, F. Xie, Z. Wu and Y. Feng, Front. Neurol., 2020, 11, 132 CrossRef PubMed.
S. Paik, S. Shak, G. Tang, C. Kim, J. Baker, M. Cronin, F. L. Baehner, M. G. Walker, D. Watson, T. Park, W. Hiller, E. R. Fisher, D. L. Wickerham, J. Bryant and N. Wolmark, N. Engl. J. Med., 2004, 351, 2817–2826 CrossRef CAS PubMed.
X. Tang, Y. Wang, L. Zhou, W. Zhang, S. Yang, L. Yu, S. Zhao, K. Chang and M. Chen, Microchim. Acta, 2020, 187, 172 CrossRef CAS PubMed.
Y.-Y. Zhou, G.-F. Li, R.-X. Ma, Y. Lin, J.-W. Wu, Y.-Y. Wu, J. Yan, S.-G. Liu, X.-C. Tan and K.-J. Huang, Anal. Chem., 2023, 95, 14052–14060 CrossRef CAS PubMed.
P. B. Chen, H. V. Chen, D. Acharya, O. J. Rando and T. G. Fazzio, Nat. Struct. Mol. Biol., 2015, 22, 999–1007 CrossRef CAS PubMed.
T. G. Fazzio, Transcription, 2016, 7, 121–126 CrossRef CAS PubMed.
S. Toubiana and S. Selig, FEBS J., 2018, 285, 2552–2566 CrossRef CAS PubMed.
A. Brambati, L. Zardoni, E. Nardini, A. Pellicioli and G. Liberi, Mutat. Res., Rev. Mutat. Res., 2020, 784, 108300 CrossRef CAS PubMed.
F. Hong and P. Šulc, J. Struct. Biol., 2019, 207, 241–249 CrossRef CAS PubMed.
M. Klein, B. Eslami-Mossallam, D. G. Arroyo and M. Depken, Cell Rep., 2018, 22, 1413–1423 CrossRef CAS PubMed.
D. Banerjee, H. Tateishi-Karimata, T. Ohyama, S. Ghosh, T. Endoh, S. Takahashi and N. Sugimoto, Nucleic Acids Res., 2020, 48, 12042–12054 CrossRef CAS PubMed.
J. I. Gyi, A. N. Lane, G. L. Conn and T. Brown, Biochemistry, 1998, 37, 73–80 CrossRef CAS PubMed.
E. Hantz, V. Larue, P. Ladam, L. Le Moyec, C. Gouyette and T. Huynh Dinh, Int. J. Biol. Macromol., 2001, 28, 273–284 CrossRef CAS PubMed.
E. A. Lesnik and S. M. Freier, Biochemistry, 1995, 34, 10807–10815 CrossRef CAS PubMed.
H. Liu, F. Hong, F. Smith, J. Goertz, T. Ouldridge, M. M. Stevens, H. Yan and P. Šulc, ACS Synth. Biol., 2021, 10, 3066–3073 CrossRef CAS PubMed.
J. N. Zadeh, C. D. Steenberg, J. S. Bois, B. R. Wolfe, M. B. Pierce, A. R. Khan, R. M. Dirks and N. A. Pierce, J. Comput. Chem., 2011, 32, 170–173 CrossRef CAS PubMed.
J. Cabello-Garcia, W. Bae, G.-B. V. Stan and T. E. Ouldridge, ACS Nano, 2021, 15, 3272–3283 CrossRef CAS PubMed.
N. Srinivas, T. E. Ouldridge, P. Šulc, J. M. Schaeffer, B. Yurke, A. A. Louis, J. P. K. Doye and E. Winfree, Nucleic Acids Res., 2013, 41, 10641–10658 CrossRef CAS PubMed.
C. Li, Z. Li, W. Han, X. Yin, X. Liu, S. Xiao and H. Liang, Chem. Commun., 2022, 58, 5849–5852 RSC.
J. SantaLucia Jr and D. Hicks, Annu. Rev. Biophys. Biomol. Struct., 2004, 33, 415–440 CrossRef PubMed.

Footnote

† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4nr00542b

Click here to see how this site uses Cookies. View our privacy policy here.