Open Access Article
Jory Lietard
*a,
Dominik Ameura and
Mark M. Somoza
*abc
aInstitute of Inorganic Chemistry, University of Vienna, Althanstraße 14, 1090 Vienna, Austria. E-mail: jory.lietard@univie.ac.at; mark.somoza@univie.ac.at
bChair of Food Chemistry and Molecular Sensory Science, Technical University of Munich, Lise-Meitner-Straße 34, 85354 Freising, Germany
cLeibniz-Institute for Food Systems Biology at the Technical University of Munich, Lise-Meitner-Straße 34, 85354 Freising, Germany
First published on 16th February 2022
Fluorescein is commonly used to label macromolecules, particularly proteins and nucleic acids, but its fluorescence is known to be strongly dependent on its direct chemical environment. In the case of fluorescein-labeled nucleic acids, nucleobase-specific quenching originating in photoinduced charge transfer interactions results in sequence-dependent chemical environments. The resulting sequence specificity of fluorescent intensities can be used as a proximity detection tool, but can also lead to biases when the abundance of labeled nucleic acids is quantified by fluorescence intensity. Here we comprehensively survey how DNA sequences affect fluorescence intensity by preparing permutational libraries containing all possible 5mer contexts of both single-stranded and double-stranded DNA 3′ or 5′ end labeled with fluorescein (6-carboxyfluorescein, FAM). We observe the expected large quenching of fluorescence with guanine proximity but also find more complex fluorescence intensity changes depending on sequence contexts involving proximity to all four nucleobases. A terminal T (T > A ≈ C ≫ G) in both 3′ and 5′ labeled single strands results in the strongest fluorescence signal and it changes to a terminal C (C ≫ T > A ≫ G) in double-stranded DNA. Therefore, in dsDNA, the terminal G·C base pair largely controls the intensity of fluorescence emission depending on which of these two nucleotides the dye is attached to. Our data confirms the importance of guanine in fluorescence quenching while pointing towards an additional mechanism beyond the redox potential of DNA bases in modulating fluorescein intensity in both single and double stranded DNA. This study should help in designing better nucleic acid probes that can take sequence-dependent quenching effects into account.
000 M−1 cm−1), very large quantum yield (ϕ = 0.92),1 good photostability and solubility in aqueous media. Fluorescein was used in general staining approaches before becoming a macromolecular labeling method, allowing the tracking and quantification of proteins and nucleic acids. Structural derivatives of fluorescein are commonly used as reversible fluorophore tags on nucleoside triphosphates, a pivotal aspect of next-generation sequencing and in vitro DNA polymerization.2–4 Fluorescein labeling can be conveniently carried out with an isothiocyanate functional group (FITC) or, in DNA and RNA labeling, during solid-phase synthesis by coupling a phosphoramidite version of fluorescein (6-carboxyfluorescein, 6-FAM). The fluorescence properties of fluorescein vary according to changes in the environment and it is most notably sensitive to pH variations, with the highest absorption of 490 nm light at pH > 7 and a progressive decrease in fluorescence response with decreasing pH, which can be understood by the opening of the spirolactone function attached to the xanthene moiety.5 Because of its pH-dependent behavior, fluorescein has also found applications in the monitoring of very fine intracellular pH changes.6 And while the fluorescence properties of free fluorescein have been extensively studied, much less work has been devoted to studying fluorescein in the context of dye-labeled molecules, specifically how the local chemical environment can affect the fluorescence response. This potential modulation of absorption and emission properties of fluorescence is important to consider and particularly relevant when emission can be correlated to concentration, in nucleic acid quantification and sequencing, but also in more complex photophysical systems based on Förster resonance energy transfer (FRET).7
Indeed, fluorescein is a common fluorophore in FRET pairs, but its donor and acceptor properties are sensitive to the nucleic acid environment. The change from single-stranded to double-stranded DNA was found to result in a ∼1/3 decrease in fluorescence intensity8,9 and quantum yield is reduced when labeling occurs at the 3′ end.10–12 Sequence-dependent fluorescence intensity in oligonucleotides is a well-documented phenomenon affecting most chromophores, although the specific mechanisms are varied. In the case of quenching via photoinduced electron transfer from nucleobases, proximity to guanine, as the most oxidizable nucleobase, is the largest contributor. Such quenching is also observed in many commonly used chromophores such as coumarin,13 porphyrin14 and rhodamine15 derivatives, as well as others.16,17 Further distal guanine bases also contribute to quenching but to a lesser extent.18 Guanosine-mediated fluorescence quenching proceeds via photoinduced electron transfer (PET) between the fluorophore and a proximal electron-donor guanine. Since PET efficiency correlates with redox potential at the donor/acceptor level, PET-quenching of fluorescence should follow the order of increasing nucleobase redox potential,19 dG ≪ dA < dT ≈ dC, but there is limited data availability on the sequence-specific modulation of fluorescence and any study would likely need to distinguish between 5′ and 3′ labeling, and between single-stranded and double-stranded systems.
We previously investigated the sequence dependence of cyanine dyes in all possible 5mer ss- and dsDNA contexts and revealed how nucleobase identity further away from the dye also affects Cy3/Cy5 fluorescence intensity, as well as that of the structurally similar DyLight DY547 and DyLight DY647.20–22 For fluorescein, the modulation of fluorescence by neighboring bases is expected to significantly differ from cyanine dyes, as a π-stacking contribution to fluorescein's interaction with DNA has not been previously documented. Indeed, fluorescence anisotropy measurements show that rotational motion of the fluorescein dye is decoupled from that of the labeled DNA, indicating that fluorescein rotates independently from the nucleic acid molecule.23 This effect could be seen as the consequence of the opening of the spirolactone ring at physiological pH, creating not only freedom of rotation about the xanthene–phenolic system but a negatively charged carboxylate as well, which creates a source of electrostatic repulsion with nearby phosphodiester groups,24 in clear contrast to positively-charged cyanine fluorophores.
Herein, we explore how the fluorescence properties of fluorescein can be affected by five consecutive DNA nucleotides beginning immediately proximal to the dye, by synthesizing all possible sequence permutations (45, or 1024 unique pentanucleotides) in all terminal labeling conditions, that is 3′, 5′, ssDNA and dsDNA formats. To do so, we synthesized – using nucleic acid photolithography – the DNA oligonucleotides in both 3′ → 5′ and 5′ → 3′ direction with a final, terminal fluorescein coupling using the 6-fluorescein phosphoramidite (6-FAM).25 Changes in fluorescence across the surface of the nucleic acid array inform on which nucleobase or ordered combination of nucleobases has the strongest effect on fluorescein emission. As expected, we found that proximal G and G-rich sequences at both the 5′ and 3′ ends of oligonucleotides strongly predict fluorescence quenching. However, the sequence-dependent fluorescence cannot be fully explained by the nucleobase oxidation potential dG ≪ dA < dT ≈ dC. Instead, we measure FAM fluorescence quenching following the order G ≫ C ≈ A ≫ T for 5′ labeled single-stranded DNA, G ≫ C ≈ A > T for 3′ labeled single-stranded DNA, and G ≫ A ≈ T ≫ C for double-stranded DNA. Our data suggest that a redox mechanism alone is insufficient to explain fluorescein fluorescence quenching in DNA. These results should provide comprehensive guidance for better fluorescein nucleic acid probes that can take sequence-specific variations into account.
:
1 ethylenediamine/ethanol for 2 hours at rt (12 h for 5′ → 3′ synthesized DNA), washed with distilled water twice and then with 50 mM phosphate buffer at pH 7.6 (PBS) before being spun dry. The dsDNA libraries were self-annealed by heating the microarray in PBS buffer at 50 °C and slowly cooling it down to rt. After 30 min, the array was briefly washed in 1× sodium citrate buffer then spun dry.
We first looked at how the fluorescence intensities of fluorescein are modulated by the sequence context immediately adjacent to the dye, initially in single-stranded format, and then when the dye is attached to double-stranded DNA. We find that sequences interact differently with fluorescein, creating a large range of fluorescence intensities across the 1024 different combinations. In both the 5′- and 3′-labeled oligonucleotide series, the distribution of fluorescence intensities adopt a sigmoidal shape, with up to an almost 55% difference between the brightest and darkest 5′-fluorescein-labeled sequence combinations and a somewhat smaller difference in the case of 3′-fluorescein labeled oligonucleotides, a maximum of 45% quenching relative to the brightest sequence (Fig. 2B). This dynamic range of fluorescence is in line with our previous observations on Cy3 and Cy5 dyes on similarly complex DNA libraries,20–22 indicating that the extent of fluorescence quenching in xanthene-like structures is comparable to that in cyanine derivatives. At the top end of fluorescence intensity, we identify the 5mer 5′-TTTTT and 3′-CTTTC (3′-TTTTT being a close third). At the lower end of the fluorescence spectrum, we find 5′-GGGGG and 3′-GGGGC. Clearly, T-proximal single-stranded DNA sequences minimally quench fluorescence while G-rich elements near fluorescein lead to the greatest loss of fluorescence, largely as expected due to the known mechanism of photoinduced electron transfer between the fluorophore and a proximal guanine as electron donor.
To assess the sequence-dependence derived from these very large datasets, we divided the extent of fluorescence intensities into octiles of equal intensity ranges and looked for sequence motifs. Sequence logos were generated for each octile and arranged by intensity (Fig. 2A and C). In both 5′ and 3′ labeling, T-rich sequence combinations populate the high fluorescence intensities while the G-rich counterparts are very likely to be found in low fluorescence data. The top and bottom 1% of fluorescence intensity very clearly show the predominance of T and G nucleotides in the extremes of the intensity range. There is a loss of consensus in the middle range of fluorescence, indicating that the sigmoidal curves can be interpreted as cumulative distribution functions with relative fluorescence as the variable. More intuitively, this pattern originates because most sequences from the full permutational library are composed of a mix of nucleobases associated with both high and low fluorescence, whereas only a few sequences exist that are primarily composed of these same nucleobases, T and G. Unsurprisingly, and corresponding to the electron transfer mechanism, the nucleotide immediately adjacent to the dye is the most important with regards to modulating fluorescence properties, in both 5′ and 3′ labeling and the identity of the nucleobases further away from the terminal nucleotide quickly becomes less relevant.
The sequence-dependence of fluorescein in single-stranded DNA correlates fairly well with our initial observations with terminal nucleotides alone and generally agrees with the expectation that nucleobase redox potential is the most important physicochemical parameter to consider when studying fluorescence quenching in fluorescein. This mechanism, however, would predict that both pyrimidines would have least affected the fluorescence response, while our data demonstrates a strong preference for thymine only. Cytosine-rich combinations (≥4 dC) result in similar fluorescence to dA-rich combinations, in both cases much lower than dT-rich and significantly higher than dG-rich sequence combinations. In terms of nucleobase abundance alone, quenching follows the order dG ≫ dC ≈ dA ≫ dT for 5′ labeling and dG ≫ dC ≈ dA > dT for 3′ labeling, the former reflecting the clear dominance of dT in the set of highly fluorescent 5′-labeled fluorescein DNA conjugates. These patterns follow the dG ≪ dA < dT ≈ dC order expected from their redox potential mostly for dG and its associated quenching.
While the identity of distal nucleotides is less conserved in the brightest and darkest range of fluorescence intensity, they do affect the recorded fluorescence signal. The 5′-TGGGG permutation is amongst the bottom 2% of the fluorescence intensity range and the 5′-GTAAA is part of the first octile of fluorescence. However, a single T or G inserted five nucleotides away from the terminal dye poorly influences quenching, with 5′-GGGGT one of the darkest sequence variant and 5′-TTTTG in the first octile of fluorescence. These observations indicate that on top of the photoinduced electron transfer taking place at the fluorophore–nucleotide level, the neighboring bases can affect fluorescence intensity. Single-stranded DNA is a flexible molecule with a persistence length on the order of a few nanometers43—longer than a 5mer—which therefore presupposes that on the length scale of the permuted sequences in our experiments, there exists a partial order and base stacking. Such base stacking in ssDNA has been observed experimentally44 and could facilitate charge transport mechanisms between adjacent guanines and through adenine tracts.45,46 The redox potential of cytosine and thymine is too large to allow participation in any charge transfer mechanism. Conversely, the flexibility of ssDNA coupled with the very flexible six-carbon linker to the fluorescein should also permit direct contact between the fluorescein and any of the nucleobases of the permuted 5mer. Since guanosine in each of the five positions quenches fluorescein fluorescence, it is clear that some such charge transfer mechanism to distal guanosines is available. Since quenching by distal guanosines is almost entirely absent in the double-stranded DNA data (see below), we can hypothesize that molecular-flexibility-enabled direct contact between fluorescence and guanosine in any of the five positions can result in quenching in ssDNA only.
We next looked at double-stranded DNA with 5′-fluorescein adjacent to the 5-basepair-long permutation region (Fig. 3B). Here too, the intensity of fluorescence varies with sequence, with more than 50% fluctuation between the brightest and darkest sequence combination (Fig. 3). The brightest sequence is 5′-CTACG and the least fluorescent is 5′-GGGCC. As for single-stranded systems, a dG nucleotide next to the dye almost always decreases the fluorescence of fluorescein and can be found in more than 80% of all sequences in the 8th octile of fluorescence. Unlike single-stranded oligonucleotides however, bright sequence combinations frequently present dC at the 5′ end instead of dT (>2/3 of all sequences in the 1st octile of fluorescence). This observation is more in line with the fairly similar redox potential of pyrimidine bases which, based on this metric alone, should indeed predict that dC and dT both do not quench fluorescence intensity. But it is interesting to note that in this context, the nucleotides are base-paired and a dG·dC base pair can drive the fluorescence of the labeled hairpin towards the bright or the dark region depending on which heterocycle is in direct proximity to the dye. The oxidation potential of a G·C base pair was calculated to be lower than the oxidation potential of dG alone,47,48 suggesting that photoinduced electron transfer via oxidation of the neighboring G base would be more facilitated in base-paired systems which might explain the slightly more dominating presence of G in the most quenching dsDNA combinations. Similarly, the oxidation potential of an A·T base pair was also found to be lower than A or T alone, but an A·T base pair at the very end of a dsDNA molecule is more likely to exist as loose nucleobases (“frayed ends”). The fact that a 5′-C in a hairpin system can be assumed to be correctly base-paired contrary to a 5′-T could by the reason why C appears at the bright end of the intensity spectrum. Furthermore, the importance of the nature of the final 5′ nucleotide suggests that the fluorescein molecule mostly interacts with the closest covalently-bound nucleotide and does not reach over to the complementary base, nor does it appear to intercalate between base pairs either. The identity of the nucleotides further away from the dye does not substantially affect fluorescence intensity, but looking at the ranked list of sequences, the top 5% of fluorescence is very C/T rich, while the bottom 5% is mostly G rich. With pyrimidines consistently found in the top section of the intensity spectrum, it appears that stacking energies, greatest for purines, do not contribute significantly to quenching, even in very rigid double helical structures. The brightest fluorescein-labeled ssDNA is here in the 2nd octile, 20% darker than the top sequence combination.
We also looked at how the sequence context, in the absence of guanine, affects the fluorescence intensity of fluorescein. The results are shown in Fig. 4. With or without G, the strong fluorescence response in single-stranded DNA remain largely dominated by T when in proximity to the dye. Low fluorescence G-free sequences are usually populated with A in 3′-labeled strands (Fig. 4B), which is in contrast with 5′-labeled strands, where low fluorescence is equally distributed between C- and A-rich DNA (Fig. 4A). The fluorescence intensity falls by ∼40% for 5′-fluorescein and by ∼30% for 3′-fluorescein, indicating that some sequences entirely devoid of guanines can still significantly quench fluorescence, with 5′-CACCA and 3′-AAATT producing the weakest fluorescence in all G-free combinations. Even in the absence of guanine, a clear T → C → A transition is difficult to identify when ranking fluorescence intensities from high to low, as the appearance of C in the low fluorescence regime is concomitant with the appearance of A. A-rich sequences can therefore tune the fluorescence properties of fluorescein in single-stranded formats; indeed >50% of all nucleotides in the 5mers that are at least 20% less fluorescent than the brightest sequence combination are composed of A. Since T is prominent in the most fluorescent ssDNA sequences, the T linker to the surface may contribute to higher fluorescence; nevertheless, this would not affect our measured sequence dependence as all ssDNA permutations share this same linker.
In double-stranded DNA, excluding G from the 5mer immediately adjacent to the dye reveals a slightly different picture (Fig. 4C). As was observed in Fig. 3, the final 5′-nucleotide leading to the strongest fluorescence response continues to be C, as opposed to the T seen in single stranded DNA. Interestingly, low fluorescence sequence combinations are populated with C as well, but only in the second and third nucleotide position. Along with A, these sequences at the low end of fluorescein fluorescence produce A/C-rich motifs comparable to those seen in 5′-fluorescein. The fifth nucleotide position furthest from the dye, here appears to prefer a T for strong fluorescence response. Such a clear nucleotide preference at a 5 nt distance from terminal labeling is striking, but has been observed before.20 The presence of T at the 3′-end of the permuted region being critical to high fluorescence intensity may be due to the fluorescein dye stacking not on the terminal 5′ base pair, but rather intercalating further down along the double strand, an effect which could not take place in single-stranded oligonucleotides. The intercalation of the xanthene moiety 5 bp downstream of the 5′ end is conceivable given the flexibility of the C6 aliphatic chain linking the dye to the terminal nucleotide. As with ssDNA, the fluorescein in the dsDNA can also interact with the T linker as illustrated in Fig. 1B, but this interaction is shared among all sequences and therefore does not affect the consensus sequence.
Fig. 5 illustrates how—even in the absence of all guanines—a diminished but still large span of sequence-dependent fluorescence of the fluorescein is retained. The range of intensities, comprising a 30% drop for 3′ FAM ssDNA (vs. ∼50% for such sequences including G), and almost 40% for both 5′ FAM ssDNA and 5′ FAM dsDNA (vs. ∼60% and ∼50%, respectively, for the equivalent sequences including G). These numbers, along with the discrepancy between the nucleobase redox potentials of A, C and T and their relative prominence in all of the consensus sequences, suggest that one or more additional mechanisms—superimposed on the photoinduced electron transfer mechanism—are needed to fully explain the sequence dependence of fluorescein end labeling in single- and double-stranded DNA. Alternatively, since experimental values for the oxidation potentials have only been determined for free nucleobases in acetonitrile,13 significant shifts in more natural contexts cannot be excluded. Within DNA, protonation equilibria,49 as well as nucleobase pairing and stacking interactions48 may significantly change these potentials, and these changes themselves are likely to be sequence dependent. Even in single-stranded DNA, which is far less structurally defined than double-stranded DNA, base-stacking in ssDNA has been observed experimentally and shown to contribute to its electrostatics and elasticity, two factors which can also contribute to charge transfer efficiency.44
![]() | ||
| Fig. 5 Sequence-dependent variations in the fluorescence intensity of guanine-free 3′ and 5′ fluorescein-labeled ssDNA as well as 5′ fluorescein-labeled dsDNA. The data and normalization are the same as that in Fig. 3B and 4A, but include only the 243 (35) sequences without G in each DNA context. The 5mers are ranked from most to least intense, with fluorescence falling by ∼30% for 3′-fluorescein-labeled ssDNA, and by ∼40% for 5′-fluorescein-labeled ssDNA and dsDNA. | ||
Footnote |
| † Electronic supplementary information (ESI) available: Relative fluorescence intensity data for all the experimental sequences in spreadsheet format. See DOI: 10.1039/d2ra00534d |
| This journal is © The Royal Society of Chemistry 2022 |