Ida C.
Elle
a,
Kasper K.
Karlsen
a,
Mikkel G.
Terp
b,
Niels
Larsen
c,
Ronni
Nielsen
d,
Nicola
Derbyshire
a,
Susanne
Mandrup
d,
Henrik J.
Ditzel
be and
Jesper
Wengel
*a
aNucleic Acid Center, Department of Physics, Chemistry and Pharmacy, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark. E-mail: jwe@sdu.dk; Tel: +45-65502510
bInstitute of Molecular Medicine, University of Southern Denmark, J. B. Winsløwsvej 25, 5000 Odense C, Denmark
cDanish Genome Institute, Skt. Lucas Kirkeplads 8, 8000 Aarhus C, Denmark
dDepartment of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
eDepartment of Oncology, Odense University Hospital, Sdr. Boulevard 29, 5000 Odense C, Denmark
First published on 17th February 2015
LNA-containing DNA aptamers against CD73 (human ecto-5′-nucleotidase), a protein frequently overexpressed in solid tumours, were isolated by SELEX. A pre-defined stem-loop library, containing LNA in the forward primer region, was enriched with CD73 binding sequences through six rounds of SELEX with recombinant his-tagged CD73 immobilised on anti-his plates. Enriched pools isolated from rounds one, three and six were subjected to next-generation sequencing and analysed for enrichment using custom bioinformatics software. The software identified aptamer sequences via the primers and then performed several steps including sequence unification, clustering and alignment to identify enriched sequences. Three enriched sequences were synthesised for further analysis, two of which showed sequence similarities. These sequences exhibited binding to the recombinant CD73 with KD values of 10 nM and 3.5 nM when tested by surface plasmon resonance. Truncated variants of these aptamers and variants where the LNA nucleotides were substituted for the DNA equivalent also exhibited affinity for the recombinant CD73 in the low nanomolar range. In enzyme inhibition assays with recombinant CD73 the aptamer sequences were able to decrease the activity of the protein. However, the aptamers exhibited no binding to cellular CD73 by flow cytometry analysis likely since the epitope recognised by the aptamer was not available for binding on the cellular protein.
LNA (Locked Nucleic Acid) is an RNA-mimic containing a 2′-O,4′-C-methylene bridge, which locks the furanose ring in a C3′-endo conformation.13–15 LNA nucleotides have been proven to improve thermal stability of nucleic acid duplexes and confer nuclease-resistance when incorporated into nucleic acid oligomers.16,17 Aptamers containing LNA nucleotides by post-SELEX modification have been reported (reviewed in16 and18), but the effects of these modifications on aptamer function can be difficult to predict. Therefore in investigating the properties of LNA nucleotides in aptamers, we and others, have focused on selecting aptamers using SELEX methods, front-loaded with LNA and other modified nucleotides.19–23
Commercially available polymerases are capable of reading and incorporating LNA nucleotides,22,24,25 but the efficacy of such reactions is often low. Despite this, Kasahara et al.19 recently reported the selection of aptamers containing LNA nucleotides in the primer and the random region. They used capillary electrophoresis and a modified SELEX protocol, which incorporated a primer extension step in each round. Following eleven rounds of selection, two groups of aptamer candidates were identified by cloning and sequencing of the enriched pool. The aptamer with the highest target affinity was investigated further for the effects of the LNA-nucleotides on its affinity and specificity. For this particular aptamer, the LNA-nucleotides were shown to be necessary for target binding,20 but previous work from the same group has demonstrated that this is not always the case.19
We have attempted to exploit the potential benefits of LNA inclusion in aptamer sequences whilst maintaining a simple SELEX protocol with good pool regeneration at the end of each round. A library was designed containing six LNA nucleotides in the forward primer region (Fig. 1). The design requires the polymerase to read the modified nucleotides but not to incorporate them during library amplification. The primer regions were partially complementary forcing formation of a 7 bp stem containing two of the LNAs, while the remaining four LNAs were free to participate in formation of other secondary structures or in target binding (also Fig. 1). This type of library design promotes the involvement of the primer regions in the secondary structures and potentially in target binding. This means that post-selection removal of primer sequences for truncation would be expected to have detrimental consequences on the folding stability and target binding of the aptamer sequences.
![]() | ||
Fig. 1 Library design used for the selection of CD73 aptamers. Primers were designed to form a pre-determined 7 bp stem containing two LNA-T nucleotides. Complementary regions forming this stem are underlined and LNA nucleotides are in red. The structure of the library was predicted by Mfold26 and is shown above. LNA nucleotides are marked by red circles. |
The specificity of aptamers makes them promising candidates for targeting sub-populations of cells displaying abnormal expression levels of certain proteins; e.g. cancer cells. The ecto-5′-nucleotidase, CD73, is a GPI-anchored, homodimeric, cell surface protein overexpressed in many solid tumours. CD73 hydrolyses extracellular AMP to adenosine, a potent immuno-suppressor. Therefore it has been suggested that tumour-derived CD73 is a mediator of tumour immune-escape.27 Antibody therapy against CD73 expressing cells inhibits breast tumour growth and metastasis,28,29 while RNAi-mediated suppression of CD73 expression induces cell-cycle arrest and apoptosis in the breast-cancer cell line MB-MDA-231.30 An aptamer binding to cellular CD73 could be useful as a direct inhibitor of enzymatic activity or as part of a delivery vehicle for cytotoxic drugs to cancer cells.
Here, LNA-containing aptamers recognising recombinant human CD73 were isolated via six rounds of SELEX coupled to screening for sequence enrichment with next-generation sequencing (NGS) and a customised bioinformatics platform. The isolated sequences exhibited low nanomolar affinity for the recombinant protein by surface plasmon resonance (SPR) analysis, and some also inhibited the recombinant protein's enzyme activity. The same was true of the truncated and modified sequences derived from the full length aptamers. However, none of the sequences displayed binding to cellular CD73 by flow cytometry analysis. SPRi was utilized to partially map the aptamer–protein interaction through a series of control and competition injections and suggests that the binding site of the aptamer sequences is sterically unavailable in the cellular protein.
Round | Target![]() ![]() |
Quick washes (μL) | 5 min washes (μL) | Negative selection |
---|---|---|---|---|
1 | 1![]() ![]() |
5 × 200 | None | Empty plate |
2 | 1![]() ![]() |
8 × 200 | 2 × 200 | Empty plate |
3 | 1![]() ![]() |
4 × 200 | 1 × 200 | Empty plate and His-tagged sortilin |
4 | 1![]() ![]() |
8 × 200 | 2 × 200 | Empty plate |
5 | 1![]() ![]() |
4 × 200 | 1 × 200 | Empty plate |
6 | 1![]() ![]() |
8 × 200 | 2 × 200 | Empty plate and His-tagged sortilin |
Competition elutions were performed in selection buffer with a 10 fold excess of free CD73 compared to the immobilised protein and incubation at 37 °C/30 min/700 rpm. Eluted pools were amplified using KOD XL (Novagen, Merck Millipore, Darmstadt, Germany), FAM-labelled forward primer and biotin-labelled reverse primer in 25 cycles [98 °C/2 min followed by 25 cycles of 98 °C/15 s, 63 °C/15 s, 72 °C/1 min with a final extension step of 72 °C/2 min]. PCR products were verified on 10% denaturing acrylamide gels. Enriched pools were regenerated by strand separation using Dynabeads M-280 streptavidin (Invitrogen, Life Technologies) according to the manufacturer's protocol, and quantified by UV spectroscopy using a NanoDrop 2000 (Thermo Scientific).
We developed custom software for the analysis of sequencing data from aptamer pools. This software cleaved off primer sequences, filtered out sequences of low quality and clustered unique sequences. Information from the resulting sequence tables for each pool analysed, such as sequences and read number were used to calculate enrichment. Using the number of reads for a unique sequence in each round and the total number of reads in that round it was possible to calculate the enrichment of a sequence using ordinary spreadsheet software. We analysed selection pools 1, 3 and 6 from SELEX of a library with a pre-defined stem-loop against the recombinant his-tagged protein CD73. Table 2 shows the ten, overall most abundant sequences and their enrichment factors over the course of selection. Since sequence read length was limited by reagent kits and fell short of a full sequence read, it was necessary to identify pairs of sequences (the complementary sequences to each other converted to the same orientation) in which the central 20 nucleotides were identical and the read counts comparable (Table 3). These pairs of sequences were merged into a single, full length sequence read (Table 4). Whilst this merging of shorter sequencing reads could have been avoided by use of an extra, costly reagent kit in a single sequencing run, the process also served as an additional quality control step.
Read count | Enrichment factora | |||||||
---|---|---|---|---|---|---|---|---|
ID | Round 1 | Round 3 | Round 6 | Total | 1–3 | 3–6 | 1–6 | Sequence of random region |
a Since it is not possible to calculate an enrichment factor from a zero read count, all zero values was substituted with one in the enrichment calculation. | ||||||||
Total | 1![]() ![]() |
125![]() |
53![]() |
1![]() ![]() |
||||
1709 | 371 | 52 | 3 | 426 | 1 | 0 | 0 | CGTGGGTGGGGAGGGTGGCTAGTTA |
1157 | 222 | 21 | 0 | 243 | 1 | 0 | 0 | AGCGTCGTGGGTGGGGAGGGTGGCT |
1156 | 26 | 14 | 32 | 72 | 5 | 5 | 25 | GAGAGCGCATGGACCAACGCTAGTG |
1155 | 0 | 1 | 51 | 52 | 9 | 121 | 1044 | GGGTGGGTAGGGTGGGCAGGAACGT |
1154 | 14 | 17 | 16 | 47 | 11 | 2 | 23 | CGCATGGACCAACGCTAGTGCAAGA |
1153 | 0 | 1 | 44 | 45 | 9 | 104 | 900 | CGCACCGGCATTTGTTACAGGAAGC |
1152 | 0 | 0 | 39 | 39 | 9 | 92 | 798 | CGGCATTTGTTACAGGAAGCGCGGT |
1151 | 15 | 4 | 3 | 22 | 2 | 2 | 4 | ATGCAGGGAGAATGCAGCTACCCAG |
1150 | 0 | 0 | 18 | 18 | 9 | 43 | 368 | GGTGGGTGGGCGTCTCTGTCCGGGT |
1149 | 8 | 7 | 2 | 17 | 8 | 1 | 5 | AAGCCGACTGCAGTGTGGAATGCAG |
This analysis yielded three potentially interesting aptamer candidates for qualitative analysis of target binding by BLItz bio-layer interferometry. Pre-folded aptamer candidates were immobilised on streptavidin-coated biosensors, while recombinant his-tagged CD73 and his-tagged sortilin were in solution. Both proteins exhibited no binding to empty streptavidin-coated biosensors. One of these candidates, NAC6552, bound specifically to the recombinant his-tagged CD73. NAC6551 exhibited no binding, and NAC6604 showed weak binding to both CD73 and his-tagged sortilin indicating that it might bind to the his-tag of both proteins (data not shown). Despite successful immobilisation of his-tagged CD73 on NTA biosensors, the reverse experiment, protein ligand and aptamer analyte, failed to identify binders. This is probably due to the relatively small size of the aptamer candidates vs. CD73, a phenomenon also seen in SPR experiments.36
The sequence of the CD73 binding aptamer, NAC6552, was used to further filter the sequencing data from these selection pools for sequences where the central 16 nucleotide segment was either identical or contained a maximum of three mismatches. This filtering process yielded an additional aptamer candidate (NAC6772) that resembled NAC6552 in the central region (also shown in Tables 3 and 4). Analysis by bio-layer interferometry showed that NAC6772 was also able to bind CD73 specifically.
The identification of candidate aptamer sequences after selection is clearly shifting from traditional cloning and Sanger sequencing to NGS-based approaches, and several groups have presented methods and software for analysis of NGS-data in the context of aptamer selection.11,32,35,37–40 Through the modification of the files defining the primers and library our software is applicable to all aptamer libraries. Several of the program steps are similar to steps in the program Aptaligner published by Lu et al.37 We believe that tandem use of NGS and well-designed analysis software allows a much more complete evaluation of the progress of a selection. It also enables the identification of genuinely enriched sequences from sequences resulting from PCR bias. The software described here is available free of charge at http://www2.sdu.dk/multi/bion-apta. Use of NGS and the associated bioinformatics to identify sequences of interest is limited only in its inability to calculate functionality of said sequences. As such it is necessary to have effective means of testing the sequences of interest for functionality. Bio-layer interferometry on the BLItz system appears to be a very effective means of screening the binding capabilities of individual aptamer candidates. The experimental demand on regents and time for a single aptamer is extremely low vs. methods such as SPR. With just a few μL of low μM aptamers and targets it is possible to discern the potential of an aptamer candidate in less than 15 minutes. However, with no automation, testing of batches of aptamer candidates can become tedious and time-consuming compared to more high throughput methods such as SPRi described below.
![]() | ||
Fig. 2 Secondary structure predictions and KD values for CD73 binding aptamers and derived truncations. NAC6552 and NAC6772 were truncated step-by-step as indicated by the arrows and secondary structures predicted by Mfold.26 LNA nucleotides are marked by red circles. All-DNA versions of NAC6552, NAC6749, NAC6772, and NAC6853 are not shown but indicated by sequence number. “*” indicates that KD values could not be determined for this aptamer. |
Despite such sequence similarity the most favourable predicted fold for NAC6552, the originally identified sequence, was somewhat different to that of NAC6772 (Fig. 3). However, it was not surprising to see that it was possible for NAC6772 to also form a secondary structure that produced a similar stem-bulge-stem-loop, with only three bases in the loop section differing. Fig. 2 shows this more similar secondary structure in an attempt to illustrate the rationale for the specific truncations tested. NAC6746, NAC6747, NAC6749 and NAC6748 were composed of the random region of NAC6552 and gradually reduced primer regions. Likewise, NAC6853 was a truncation of the primers from NAC6772. The remaining candidates depicted were attempts to alter the loop of the secondary structures to partially define bases essential for target binding.
![]() | ||
Fig. 3 Most favorable secondary structure predictions for NAC6772/NAC6750 (with and without LNA bases, respectively) and its truncated version NAC6853/NAC6970 suggested by Mfold.26 LNA nucleotides are marked by red circles. |
The conformational changes in an aptamer's secondary and tertiary structures that may be induced on target binding make it difficult to predict which bases and structures formed by a particular sequence are necessary for functionality. Despite this, secondary structure predictions often provide a useful starting point when considering truncation and mutation sites. Here, with the help of these predictions, we were able to make strategic modifications (truncations and LNA to DNA nucleotide substitutions) to the two CD73 binding sequences.
Library | Sequence (5′–3′) | ||
---|---|---|---|
a Kinetic analyses were performed in at least duplicates and mean ± standard deviation is shown. Enzyme inhibition activity of recombinant CD73 was determined colorimetrically and normalized to background levels (no AMP present) for each candidate. A560 values of CD73 alone and with each aptamer were used to derive percentage inhibition values. | |||
NAC5613 |
![]() |
— | — |
The KD value of the parent aptamer candidate NAC6552 was 9.9 ± 0.04 nM. A minor truncation to NAC6552 of the 13 nucleotide 3′ “tail” of the reverse primer region (resulting sequence NAC6746) improved the KD slightly to 6.5 ± 0.79 nM. NAC6747, further reduced by five nucleotides in the 5′ and 3′ regions performed equally well to the parent aptamer with a KD value of 9.3 ± 0.04 nM. However, further truncation, removing all of the reverse primer and leaving only the last six nucleotides of the forward primer (sequence NAC6749) had a slightly detrimental effect of the KD 15.4 ± 0.35 nM. The final truncation, where NAC6552 was reduced to 26 nucleotides of the random region, lead to a molecule (NAC6748) which was unable to bind CD73 in SPR assays.
The KD value of parent sequence NAC6772 was 3.54 ± 0.59 nM. A truncated variant of this (NAC6853), similar in design to NAC6749 was determined to have a KD of 12.5 ± 1.14 nM suggesting that truncation of this particular sequence was tolerated to approximately the same extent as for NAC6552 and therefore further truncations were not tested.
Reduction of the loop region of the shortest functional variant of NAC6552, in sequences NAC6854 and NAC6789 eradicated CD73 binding. Whilst substitution of the LNA nucleotides with the DNA equivalent in both the parent strands (NAC6552 and NAC6772) and the shortest functional variants of these (NAC6749 and NAC6853, respectively) retained comparable CD73 binding. Data showed the KD of the all DNA version of NAC6552 to be 5.48 ± 0.04 nM vs. 9.94 ± 0.07 nM and the KD of the all DNA version of NAC6772 to be 4.04 ± 0.28 nM vs. 3.54 ± 0.59 nM. Data also showed the all DNA version of NAC6749 to have a KD of 13.20 ± 1.84 nM vs. 15.40 ± 0.34 nM and finally the all DNA version of NAC6853s KD to be 19.00 ± 0.94 nM vs. 12.50 ± 1.14 nM. This final variant did behave somewhat differently to the other sequences during SPR requiring higher aptamer concentrations (0.1 μM compared to 0.025 μM) to achieve the appropriate response levels. This indicated that the actual capture level may have deviated from the calculated one. It is possible that the secondary structure of this aptamer required for CD73 binding is less stable than for other candidates and therefore higher immobilisation concentrations are required to produce the sufficient response level.
Collectively, these affinity experiments suggest that only small parts of the primer regions adjacent to the random region and the loop seen in secondary structure predictions are requirements for CD73 binding. We hypothesise that whilst the loop forms the majority of aptamer–target interactions, the primer region stabilises the formed structure. The results also demonstrated that despite the design of the library to promote LNA involvement in the binding regions, LNA were not fundamental for target binding with very little difference in affinity seen when substituting LNA for DNA. They also show that only a minimal part of the 3′-end of the forward primer was required to keep the functionality intact. This finding is in line with the findings of Kasahara et al. who saw that target affinity was improved for certain structural motifs and reduced for others when a DNA-primer was replaced with an LNA-containing version for aptamer production.19,20 Similarly, we have found that single LNA nucleotide substitutions in an avidin-binding DNA aptamer can significantly impact aptamer binding.41
NAC6748 was an interesting case. This was the shortest aptamer candidate synthesised and CD73 binding was not detectable by bio-layer interferometry or SPR. The fact that it was able to inhibit the activity of CD73 to a similar level to sequences seen to bind by SPR suggests that it forms a similar interaction with CD73 to its longer counter parts. This postulates the question of why binding was not observed in earlier assays. It is possible that in ligand immobilised assays with such short aptamers the aptamer–target interaction could be sterically hindered by the proximity of the binding region to the capture surface. This could be confirmed by synthesis of the short sequence with a spacer prior to the biotin but this was not performed here.
In light of this finding the enzymatic assay was repeated replacing the recombinant his-tagged CD73 with MDA-MB-231 cells. Unfortunately, no inhibition of the cellular protein's activity was seen with any of the aptamer candidates (data not shown), including NAC6748.
When trying to isolate aptamers against cellular proteins, compromises in the SELEX process are sometimes necessary. Direct selection against cellular proteins is possible42–45 but is challenging due to the complexity of cell surfaces and requires access to cell culture facilities. Using a purified recombinant protein as the target is a compromise. It simplifies the selection process but creates potential for evolution of aptamers that bind to parts of the protein that may be sterically unavailable in the cellular form. In this case, cellular CD73 is a GPI-anchored, cell-surface protein that forms a homodimer through non-covalent bonds,46 whilst the recombinant protein is a his-tagged protein expressed as a monomer without the GPI-anchor. This recombinant CD73 may or may not remain as a monomer after purification. Areas of CD73, such as the region around the GPI-anchor, near the cellular surface, may be inaccessible in the cellular vs. recombinant form. This may be the reason why we observed no binding of the aptamers to cellular CD73 in flow cytometry or inhibition of cellular CD73 activity. Though in this work a focus was on the technical aspects of the SELEX process and therefore only recombinant CD73 was used as the target for selection, we were interested to partially map the binding of the aptamer candidates to CD73 and utilised a series of analyte competition injections and SPRi to do so.
No aptamer binding of AMP was observed and co-injection of CD73 with AMP showed no reduction in affinity of the aptamer candidates (example shown in Fig. 5). This would indicate that the aptamers do not bind directly to CD73′s active site, which is in the C-terminal portion of the protein, but instead inhibit the enzymatic activity by changing the conformation of the protein or limiting the conformational change required for activity.
It was expected that when performing co-injections of CD73 and anti-CD73 antibodies (AD2 and IE9) and negative control antibodies (human IgG, IgG1 and anti-his tag), that competitive binding would knock out binding. However, non-competitive binding would lead to an increased response owning to the larger analyte (protein plus antibody) but the KD would remain unchanged. None of the aptamers bound competitively to the AD2 binding site but it was not possible to comment on the interaction with IE9 owning to a buffer response obscuring the picture (Fig. 5). The negative controls also had no effect on binding of CD73. These data indicate that the aptamers bind recombinant his-tagged CD73 in regions distal to the antibodies which can bind cellular or recombinant CD73.
Finally, none of the aptamer candidates bound to a recombinant N-terminal CD73 fragment indicating that the aptamer candidates bound CD73 in the C-terminal domain of the protein. In cellular CD73, the GPI-anchor is at the C-terminal meaning that part of the C-terminal domain will be in contact with the cell surface and unavailable for binding. Further to this it is the C-terminal that is involved in formation of the cellular homodimer protein possibly restricting aptamer access.
Without complicated structural analysis of the recombinant his-tagged CD73 complexed with a binding aptamer candidate by crystallography or NMR it is impossible to discern the actual binding site. Doing so could determine the reason why the CD73 aptamer candidates were able to bind and inhibit the recombinant but not the cellular form of the protein. However, by using SPRi it has been determined that the aptamers bind to the C-terminal domain and within this domain we know that significant portions of the protein are occupied by interactions with the cell surface or protein dimerization which could hinder aptamer–target interactions.
3′-Biotin-TEG CPG | 1-Dimethoxytrityloxy-3-O-(N-biotinyl-3-aminopropyl)-triethyleneglycolglyceryl-2-O-succinyl long chain alkylamino-controlled pore glass |
APCP | α,β-Methylene ADP |
bp | Base pairs |
CD73 | Ecto-5′-nucleotidase |
GPI | Glycosylphosphatidylinositol |
K D | Equilibrium dissociation constant |
LNA | Locked nucleic acid |
NGS | Next-generation sequencing |
SELEX | Systematic evolution of ligands by exponential enrichment |
SPR | Surface plasmon resonance |
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c5mb00045a |
This journal is © The Royal Society of Chemistry 2015 |