 Open Access Article
 Open Access Article
Trang Vua, 
Shanna-Leigh Davidsona, 
Julia Borgesia, 
Mowla Maksudula, 
Tae-Joon Jeonb and 
Jiwook Shim *a
*a
aDepartment of Biomedical Engineering, Henry M. Rowan College of Engineering, Rowan University, 201 Mullica Hill Road, Glassboro, New Jersey 08028, USA. E-mail: shimj@rowan.edu;   Tel: +1 856 256 5393
bDepartment of Biological Engineering, Inha University, Incheon 22212, Republic of Korea
First published on 4th September 2017
Cancer is the result of a multistep process, including various genetic and epigenetic alterations, such as structural variants, transcriptional factors, telomere length, DNA methylation, histone–DNA modification, and aberrant expression of miRNAs. These changes cause gene defects in one of two ways: (1) gain in function which shows enhanced expression or activation of oncogenes, or (2) loss of function which shows repression or inactivation of tumor-suppressor genes. However, most conventional methods for screening and diagnosing cancers require highly trained experts, intensive labor, large counter space (footprint) and extensive capital costs. Consequently, current approaches for cancer detection are still considered highly novel and are not yet practically applicable for clinical usage. Nanopore-based technology has grown rapidly in recent years, which have seen the wide application of biosensing research to a number of life sciences. In this review paper, we present a comprehensive outline of various genetic and epigenetic causal factors of cancer at the molecular level, as well as the use of nanopore technology in the detection and study of those specific factors. With the ability to detect both genetic and epigenetic alterations, nanopore technology would offer a cost-efficient, labor-free and highly practical approach to diagnosing pre-cancerous stages and early-staged tumors in both clinical and laboratory settings.
|  | ||
| Fig. 1 Schematic view of the NP-based experimental setup. The two chambers (cis and trans) are separated by a membrane, which is usually either biological or solid-state. A nanopore, which was embedded/fabricated into the membrane, acts as the single channel connecting the two chambers. DNA is added to the cis side. Under the electrophoretic force exerted by applied voltage, DNA strands translocate through the NP to the trans chamber, creating characteristic current blockages.2 | ||
Both biological and solid-state NPs, which can be obtained or fabricated in numerous ways,3–8 offer a wide range of biomolecule detection. Biological NP is secreted from different bacteria, in which the two most popular types come from α-Hemolysin and MspA porin. These biological NPs are then usually inserted into different biological substrates, such as a phospholipid bilayer, liposomes, or polymer films. Biological membranes are structurally well-defined and easily reproducible. Biological NP is mostly used for the detection of single-stranded DNA (ssDNA), microRNA (miRNA), and disease diagnostics.2 Most solid-state NPs are fabricated in membranes made of silicon oxide (SiO), silicon nitride (SiNx), hafnium oxide (HfO2), graphene, aluminum oxide (Al2O3) and hybrid materials.9–11 With controllable pore size and membrane thickness, solid-state NPs have been beneficial for use in RNA sequencing, single-stranded and double-stranded DNA sequencing, DNA–protein complex detection, and other biomolecule detection.
Since its development and publication in 1996, the NP has become an emergent and powerful technology for a direct and inexpensive method for DNA sequencing, biosensing, and detecting biological or chemical modifications on single molecules, as well as the kinetics of DNA and protein folding. NP technology offers many advantages that NGS devices are incapable of. For example, NP has demonstrated the ability to detect CpGs methylation (one of the earliest epigenetic biomarkers in cancer hallmarks) without the need of PCR amplification and bisulfite conversion.26,27 Thus, NP technology strives to be a potential genomic tool that is label-free, has a high throughput, a small sample volume requirement, flexible runtime, and minimal footprint.2 However, despite the past twenty years of significant progress in single molecular sequencing and analysis, NP technologies have not yet been translated into even distantly comparable advances in clinical settings. There are two general, synergistic goals that have been striven for to increase the efficacy of single molecule analysis using NP: to decrease the translocation time of biomolecules through the pore and to increase the base-calling accuracy. An ideal single molecule analysis system would be highly accurate, have a high throughput, and be sensitive to both genetic and epigenetic changes of the cancer genome. Many previous articles have extensively discussed the potential and development of NP technology, especially in DNA sequencing.2,3,28–32 However, to the best of our knowledge, an article focusing solely on the application of NP technology in the early detection of various types of cancer biomarkers and causal factors has yet to be published. We believe this review will contribute to the further understanding of the potentials and challenges of applying NP technology in cancer research. Herein, we provide a brief overview of the six main cancer-causing factors, along with methods conventionally used in detecting cancer at the molecular level. We then focus on reviewing NP technology with a focus on its development as a method for specific molecular detection, as well as its future potential and challenges in the clinical domain. All studies presented here are not intended to form an exhaustive list, but rather, to illustrate the totality of our major achievements and challenges of applying NP technology in early cancer detection.
SVs are important indicators of human cancers.33,41–43 Complex SVs have been found to cause approximately half of nucleotide deletions in pancreatic ductal adenocarcinoma (PDAC).44–46 Furthermore, CDKN2A/p16 and SMAD4/DPaazC4 have been identified as two of the most common deleted tumor suppressor genes. The ability to detect these mutations is critically important to the healthcare industry, allowing the monitoring of cancer patients for early detection of possible relapse.33,44–47 In mammalian cells with highly repetitive genomes, studies of SVs frequently use a resequencing approach, in which the read from the target genome is independently aligned from the reference genome to search for SVs.48 In general, besides specificity and sensitivity, when detecting SVs, a method's quality is further judged by its ability to accurately predict breakpoint locations, the size of variants, and changes in copy count.33,49
Norris et al. demonstrated the value of detecting long SVs using Oxford MinION™, to detect a series of well-characterized SVs, including large deletions, inversions, and translocations that inactivate the CDKN2A/p16 and SMAD4/DPC4 tumor suppressor genes in pancreatic cancer.33 Using Oxford Nanopore barcodes, the Norris et al. produced libraries for all 12 PCR amplicons in one run, yielding reads with PHRED scores of 10.9–11.50. PHRED, invented back in 1998 by Ewing and Green, was originally a base-calling program for automated sequencer traces. In later research, the term “PHRED score” has been used for the determination of quality and accuracy between consensus sequences. The higher the PHRED score, the higher the accuracy. For example, a PHRED score of 10 stands for a 90% base call accuracy, and a PHRED score of 20 is correlated with 99% base call accuracy.50 For this specific study, the readings were averaged at 640 bps long with a PHRED score of 11.50. It was also found that these reads are consistent for the entire bps length. The amplicons mapped with an overall percentage of 99.6% for regions of hg19, while 79% of aligned reads accurately matched to bases. Notably, the representation of amplicons does not change accuracy based on the complexity of the sequence. Additionally, the researchers wanted to test their method with low frequency SVs. In a 1![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 100 dilutions, the run produced 4058 2D reads from 270 of 512 channels. The average read length was 650 bps and had a PHRED score of 10.9. Overall, the researchers proved their methods can be conducted in a timely manner. For the two sequences (CDKN2A/p16 and SMAD4/DPC4) in this study, it took 15 minutes and 33 minutes respectively, to generate 450 reads.33 In comparison, 2nd generation sequencers could generate millions of reads simultaneously, but it would take hours to days to complete. The experiment indicated the ability of NPs to serve as a reliable and efficient method of sequencing, allowing rapid detection of tumor-associated structural variants. The two limitations of MinION™, as noted by the researchers, were (1) a relatively high mismatch and index error rate and (2) a limited yield (on the scale of Mb or Gb) (Fig. 2).
100 dilutions, the run produced 4058 2D reads from 270 of 512 channels. The average read length was 650 bps and had a PHRED score of 10.9. Overall, the researchers proved their methods can be conducted in a timely manner. For the two sequences (CDKN2A/p16 and SMAD4/DPC4) in this study, it took 15 minutes and 33 minutes respectively, to generate 450 reads.33 In comparison, 2nd generation sequencers could generate millions of reads simultaneously, but it would take hours to days to complete. The experiment indicated the ability of NPs to serve as a reliable and efficient method of sequencing, allowing rapid detection of tumor-associated structural variants. The two limitations of MinION™, as noted by the researchers, were (1) a relatively high mismatch and index error rate and (2) a limited yield (on the scale of Mb or Gb) (Fig. 2).
|  | ||
| Fig. 2 Detecting structural variants with nanopore. (A–D) Schematic of the Oxford MinION Nanopore Library Prep workflow. Oxford Nanopore barcodes were pooled into amplicons by PCR. After NEB End Repair and dA-tailing modules, hairpin and leader adapters were ligated on, each containing a motor protein (orange). Then, tether attachment allowed DNA molecules to attach to the membrane of MinION flowcell. Within the flowcell, molecules, each with attached motor proteins, were pulled through a nanopore, producing 2D consensus read. (E) Size comparison between an Oxford MinION and a quarter coin.33 | ||
Comparing to conventional genome-based methods, such as fluorescence in situ hybridization (FISH), fiber-FISH, array comparative genomic hybridization (aCGH) and paired-end mapping (PEM), which have a read length of approximately 35–400 base pairs (bps),40,49 NP allows a much more flexible read lengths (of a few bps to kbps). However, the average PHRED score of reads generated by MinION is still relatively low compared to other sequencers (e.g. Illumina, 454, Ion Torrent, and PacBio). At the moment, Illumina is the most popular DNA sequencer on the market. Still, depending on the equipment model and sample size, sequencing using Illumina can take from 3–12 days to complete. Additionally, the current market price of Illumina ranges from $50![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 (MiniSeq) to over $6M (Illumina HiSeq X Five), costing tremendously more than the NP-based sequencers.
000 (MiniSeq) to over $6M (Illumina HiSeq X Five), costing tremendously more than the NP-based sequencers.
Various direct and indirect techniques have been used to characterize TFs, along with other sequence-specific DNA binding proteins, including electrophoresis, electrophoretic mobility shift assay, nuclear magnetic resonance, X-ray crystallography, atomic force microscopy, optical tweezers, and direct fluorescent visualization, among others.58–63 However, most of these methods require some combination of chemical cross-linking between TFs and DNA, modification or tagging of the TF and DNA, and amplification assays. Furthermore, due to the complicated requirements, these methods would lack the ability to resolve fine details of the TF and DNA complex (i.e. partial versus full binding of the TF domains to DNA).63 The specific mechanism of TFs binding to DNA sequences is still under invasive study and is a major area of interest in molecular biology.63,64
Squires et al. used solid-state NPs as biosensors for the characterization of DNA, RNA, and proteins. With the use of an electric field, the researchers could guide the polymers through a NP and identify individual molecules. The current-blockage patterns generated during translocation of charged molecules provides an abundance of information about TF local properties, as well as TF–DNA interactions.63 As previously noted, the regulation of TFs has not been well investigated, hence the use of solid-state NPs could be a novel technique in describing these molecular interactions. As proof of technique, the Squires et al. has shown that their NPs can distinguish between specific and nonspecific binding of TF, by analyzing the ion current of the canonical zinc-finger DNA-binding domain of early growth response 1 (zif268). Characterization of the zif268 was accomplished using the distinct blockage patterns of the current within the nanopore.65 Through analyzing the data, the researchers found that there are three main types of blockages, existing mostly in five distinct patterns rather than randomly. These patterns have a direct correlation to preexisting data. Hence, the NP presents great potential in characterizing DNA complexes because of its ability to detect complex structures and protein conformations, with the possibility of removing TFs as needed. Squires et al. note that their NP sensor can identify small TFs in DNA as well as distinguish between specific and nonspecific binding. This research technique allows information-gathering availability with respect to TF–DNA interactions (Fig. 3).
|  | ||
| Fig. 3 Distinguishing between specific and non-specific binding of TF—DNA with solid-state nanopore. Translocation event traces and proposed mechanisms for (A) specific binding, and (B) non-specific binding of TF to DNA.63 | ||
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 102 individuals from the general population, where these individuals were followed for up to 20 years to find out the relationship between telomere length and cancer. Although short telomere length is not an indication of cancer,69 it was observed that cancer patients with shorter telomere length had increased risk of early death. This result was observed in patients with lung and esophagus cancer, malignant melanoma, and leukemia.69
102 individuals from the general population, where these individuals were followed for up to 20 years to find out the relationship between telomere length and cancer. Although short telomere length is not an indication of cancer,69 it was observed that cancer patients with shorter telomere length had increased risk of early death. This result was observed in patients with lung and esophagus cancer, malignant melanoma, and leukemia.69
Even though it has been years since the first research, the kinetics of telomeres in cancer cells remains elusive. At present, measuring the length of telomeres and observing the kinetics of folding are still challenging, as there is no gold-standard technique.73 In order to fully understand the role of telomeres in cancer prediction or therapy, it is essential to understand the kinetics of telomere folding and other conformational changes as a response to different living and environmental conditions.
Work is currently underway to apply NP sensor in tracking the telomeric DNA G-quadruplex folding/unfolding. Several research groups have used biological NP to capture some or all four folded-structures of G-quadruplex, including hybrid (hybrid-1 and hybrid-2), basket, and propeller structures.10,74–76 Findings from these studies reported that even though the four G-quadruplex structures all folded from the same DNA sequence, they produced very different electrical signatures.76 This was attributed to the overall shape and volume of each secondary structure. It was observed that both hybrid-1, -2, and basket forms had a diameter of 2.7 nm and 2.4 nm, respectively. Since the cis opening of the α-hemolysin pore has a diameter of 3.0 nm, these three folds can enter the large vestibule. However, the propeller fold, with a disk-shaped structure and diameter of 4.0 nm, exceeds the diameter of the NP cis opening and was unable to enter the vestibule.77
Another inventive solution to capture and unravel G-quadruplexes is to employ a 25-mer poly-2′deoxyadenosine tail (d25A-tail) on the 5′ end of the telomeric DNA. Applying this method, the Burrows group reported the analysis of various folding motifs of the telomere sequence, with and without the 5′-d25A-tails.76 Among the four loop topologies, only the basket fold was able to translocate through the NP without the addition of the homopolymer tail to the 5′ end. For the G-quadruplex to move through the NP, it needs to unravel to a singular strand which would be able to translocate through the narrow β-barrel, and the remaining G-triplex has to roll within the vestibule. This is likely a favorable process for the basket fold because of its nearly spherical shape.78 Even though the volume of the vestibule is large enough to accommodate all four G-quadruplexes within its cavity, the narrow entrance of the vestibule prevented the propeller fold from entering the NP. However, with the addition of the 5′ tail, the propeller fold was able to circumvent the problem of entering the cavity, and yet still had a very fast translocation signature. This is attributed to the fact that the propeller fold was able to roll outside of the vestibule while an electric force was applied to the dA25-tail as it threaded through the ion channel, without having any molecular interactions or steric hindrance that would have been experienced on the interior of the vestibule.76
In the light of those previous studies, for the first time, the unfolding kinetics of human i-motifs were studied using the α-hemolysin NP. Under acidic conditions, cytosine (C)-rich DNA sequences can adopt i-motif folds, since the hemi-protonation of C-rich strands allow C+·C base pairs to form.79 The Ding et al. conducted experiments on the human i-motif sequence at a constant ionic strength, but various pH (5.0–7.2). Since the dimension of an i-motif (2.0 nm × 2.0 nm) is smaller than cis opening (∼3.0 nm) of the α-hemolysin pore, it can enter the pore without unfolding and be captured in the nanocavity.79 Hence, a d25A tail was attached to the sequence, in order to increase the unfolding rate of i-motif. Upon the attachment of d25A, it was observed that at pH 5.0, the folded structure entered the α-HL pore, yielding characteristic current patterns. However, when the pH was at 6.8 and 7.2 (higher than the transition pH 6.15), the percentage of strands still folded was 4% and 2%, respectively. Furthermore, the force applied in this study was analogous to the forces exerted on genomic DNA by RNA polymerases II (5–20 pN) and DNA helicase (6–16 pN).79 Hence, these studies strive to show the potential of α-hemolysin as part of biosensor development, aiding in our knowledge of the lifetimes of i-motifs of telomere sequences, and their biologically relevant structures, which can be used as drug delivery targets for cancer treatments.80
These findings are steps toward a better understanding of the folding and unfolding mechanisms of the telomere. When pre-detecting different cancer types, conventional methods, such as FISH, Southern blot, and quantitative-PCR, require complicated meta-analyses, chemical-crosslinking and intensive preparation; hence the results are inconsistent.69,81–83 Whereas NP analysis, lacking all those complications, allows a better understanding of the kinetics and mechanisms, aiding in the analysis of how different oxidation, stress and factors affect the length of telomeres, as well as the correlation between cancer development and telomere immortality (Fig. 4).
|  | ||
| Fig. 4 Capturing unfolding process of the four G-quadruplex structures with biological nanopore. (A) Schematic of the α-hemolysin NP, with the cis opening of 3.0 nm, constriction of 1.4 nm, and trans opening of 2.0 nm. (B) Folding structures and dimensions of G-quadruplex conformations: hybrid-1, hybrid-2, basket, and propeller. (C) G-Quadruplex fold entered and unfold inside the nanocavity of α-hemolysin NP, causing two distinct levels of blockade. (D) Except the propeller fold, all other G-quadruplex can enter the cis opening of α-hemolysin NP without unfolding, but cannot pass through the pore constriction.65 (E) Models of the three conformations with the additional 5′-dA25 tail unraveling through α-hemolysin pore. Both hybrid and basket folds were able to enter the cis opening of the α-hemolysin pore, thus unraveled inside the pore nanocavity. On the other hand, propeller fold, because of its size, could not enter the NP. This conformation unraveled its structure outside of the pore, using the help of the 5′-dA25 additional tail.65 | ||
The overall level of 5-methylcystosine contained in the cell sample can be quantified using high-performance liquid chromatography (HPLC), high-performance capillary electrophoresis (HPCE), bisulfite sequencing, methylation-specific PCR, among many others.102–108 However, these methods have certain drawbacks. For example, although HPLC and HPCE can accurately quantify the total amount of methylated CpGs, they have incomplete restriction enzyme cutting, offer limited region of study, require substantial amounts of high molecular weight DNA, and are labor intensive. Similarly, with PCR-based methods, only the methylation status of CpG sites that are complementary to the primers can be interrogated. Thus, the predominant methylation patterns in the sample may not necessarily reflect the actual results (false positive results).
With NP analysis, current methods used in the detection of aberrant CpGs methylation usually employ either a methylation specific labeler, or an electro-optical tagging.26,27,109 The first method, as proposed by Shim et al., employs an engineered methyl-CpG-binding domain protein (i.e. MBD1x or Kaiso Zinc Finger proteins) as a selective labeler to detect and quantify hypermethylated CpG sites in double-stranded DNA (dsDNA).26,109 As the DNA translocated through the NP, the presence of 5-mC·labeler complexes caused a signature current blockage, allowing the detection and coarse quantification of 5-mC sites on a single molecule.109 Indeed, this method set an initial application in screening for the presence of hyper- and hypomethylated DNA. Moreover, Shim et al. pointed out that with the versatile binding affinity of KZF to various methylation patterns, the studied assay can allow various patterns to be screened.26 Since NP analysis requires low volumes of DNA for testing, the technique will be more applicable and practical for clinical use. Without the need of DNA replication and amplification, detecting CpG methylation using NPs requires much less labor in comparison to other conventional methods.
The second method, as mentioned before, uses an electro-optical solid-state NP to detect and quantify hypomethylation in DNA.27 In this approach, enzyme DNA MTases was assisted by small molecular weight synthetic cofactors to catalyze a one-step enzymatic reaction. This enzyme–cofactor complex was directly conjugated onto fluorescent probes and attached to the unmethylated CpG sites. The Meller group was able to detect and differentiate between fully methylated, partially methylated and unmethylated dsDNA, using ultrasensitive electro-optical NP sensing as the tool for single-fluorophore multicolor quantification. Unlike MBPs, DNA MTase only labeled unmethylated CpG sites of the target DNA. This allowed the direct targeting of hypomethylated CpG sites in the genome (i.e. promoter regions of oncogenes). Furthermore, this electro-optical solid-state NP showed a high potential for employing multiple DNA MTases and other epigenetic biomarkers. With the aid of those biomarkers, orthogonal labeling/sensing of 5-mC can be achieved in the future.27 Further research must be done in order to develop a calibrated scale to count the number of unmethylated CpGs in the target sequence.
The presence of bulk 5-methylcytosine (5-mC) and 5-hydroxymethylcytosine (5-hmC) on ss- and dsDNA has been successfully detected and distinguished using both solid-state and biological NPs.121–123 For instance, the Drndic group proposed a method using solid-state NP to discriminate two different structures that translocated through the pore (5-mC and 5-hmC). Upon the addition of 3 kbp dsDNA, a sequence of current blockage was generated, in which the magnitude of each spike was related to the excluded volume of biopolymer that occupies the pore. From the differences in ΔImax values, Wanunu et al. was able to discriminate between 5-mC and 5-hmC. Shorter end-to-end distance of the more polar 5-hmC indicated an increased flexibility in 5-hmC comparing to cytosine and 5-mC. Moreover, it was shown that different proportions of 5-hmC in DNA fragment containing cytosine and 5-mC can be quantified using ionic current signal.121 The second device used in the detection of CpG methylation variants employed both the wild-type phi29 DNA polymerase (phi29 DNAP) and MspA in the same assay.122,123 With this unique approach, the Wescoe et al. reported a direction detection of all five cytosine variants (C, mC, hmC, fC and caC). In this single-molecule tool, a phi29 DNA polymerase drew ssDNA through the pore in single-nucleotide steps and the ion current through the pore was recorded.122 Overall, the single-pass call ranged from approximately 91.6% to 98.3% depending on neighboring nucleotides.122,123 Since the knowledge of the five cytosine variants, especially fC and caC, is still very limited, the possibility of these variants having an impact on genome-wide demethylation or other modifications in cancer cells should not be eliminated.
These studies have shown NP analysis potential as a robust and efficient tool for the study of DNA methylation. The technique can directly detect CpG methylation without the need for DNA amplification or complicated preparation processes. Due to its special characteristics, methylation of CpG is usually erased during replication and amplification. Bisulfite conversion, for example, requires large amplification, leading to false positive results. Hence, NP analysis could be a more practical and reliable method to screen and detect aberrant DNA methylation in cancer patients. However, in order to apply NP technology to clinical trials and testing, a genome-wide mapping of CpG methylation needs to be developed with a higher base-call accuracy (Fig. 5).
|  | ||
| Fig. 5 Distinguishing variants of cytosine with biological and solid state nanopores. (A) Chemical structures of cytosine and its variants. First row: mC (left) and fC (right). Second row: cytosine. Third row: hmC (left) and caC (right).122 (B) Schematic of the Phi 29 DNAP–MspA complex. MspA pore constriction is shorter and narrower compared to α-hemolysin (as shown in the top), allowing short subtle structural changes to be distinguished. (C) A typical trace of DNA translocation through the Phi 29 DNAP – MspA complex.121 (D) Detection of DNA methylation with methyl binding proteins (MBP) using solid state nanopore. MBPs bind to methylated CpGs on DNA, allow the detection and differentiation between unmethylated, hypermethylated and locally methylated DNAs. (E) Detection of DNA methylation with optical-tagging using solid-state nanopore.27 | ||
Several studies have been conducted on the translocation or unravelling of a nucleosome and its subunit structure through NP.136,138,139 Generally, it was found that DNA–histone complexes lead to higher applied voltage required and overall longer time periods to translocate through the NP, most likely due to either: (1) the bulky disk shape nucleosome experienced a higher drag force comparing to a bare dsDNA, (2) the positively charged histone core lowered the total net charge density of nucleosomes, causing the translocation speed in electrophoresis to reduce, and (3) the unwinding process of histone–DNA complex.136,140
As mentioned earlier, epigenetic modifications have been known to affect the structural integrity and stability of nucleosomes. Given this fact, it was hypothesized that methylation of CpGs on dsDNA would affect the way nucleosomes fold and/or unravel. To test this hypothesis, the Langecker et al. investigated the influence of DNA methylation on the stability of unlabeled mononucleosomes.139 Similar to the results reported in other studies, under the electrophoretic force, the nucleosomal DNA tail entered the pore and gradually unraveled under increasing voltage, which was much higher in comparison with free DNA capture.141 This experiment was repeated on nucleosomes with and without methylated DNA sequences, yielding that methylation of CpGs did not affect the nucleosome assembly, stability, or unraveling trajectories. This finding suggested that histone modifications (i.e. acetylation and phosphorylation) play a much more dominant role in nucleosomal maintenance than DNA methylation. The confirmation of methylation-independent nucleosome stability indicated other possible mechanisms by which DNA methylation alters gene expression, for example, modulating the binding of transcription activators/repressor.139
The NP-based studies outlined herein lay the groundwork for understanding and predicting the influence of different histone core modifications on the nucleosome structure,139 in which our knowledge is still quite limited. Unlike conventional methods (i.e., single-gene chromatin immunoprecipitation (ChIP), ChIP with a DNA array (ChIP-on-chip),142,143 HPLC, HPCE, and many others), NP devices is more versatile, because they do not rely heavily on the quality of the polyclonal antibodies or antibodies that are available.101 Although the study here indicated that DNA methylation does not affect the nucleosome assemble, further studies need to be done in order to confirm the role of DNA methylation in other processes (i.e. regulating transcription activators/repressors binding, or gene expressions), as well as the relationship between acetylation and phosphorylation on nucleosome assembly, and chromatin stability.
Detection of miRNAs faces several challenges, mainly due to the shortness of miRNAs. Some quantitative methods have been applied to miRNA detection with enhanced sensitivity and/or selectivity, including quantitative reverse transcription real-time polymerase chain reaction (qRT-PCR) assays, microarrays, colorimetry, bioluminescence, enzyme turnover, electrochemistry, molecular beacons, deep sequencing and single-molecule fluorescence.146–149 Unfortunately, these techniques incur DNA amplification errors, unavailable internal controls, and cross-hybridization. Also, the short sequence of miRNAs makes the designing of probes and primers even more challenging.146,148
MiRNAs have been investigated as potential molecular biomarkers, because their expression levels are associated with various diseases.150 For instance, each year, lung cancer causes approximately 1.2 million deaths worldwide.151 Since there is no effective screening procedure available, more than 70% of lung cancer patients were diagnosed with less than a 15% chance of a 5 year survival rate.151 More than 100 types of miRNAs have been identified to deregulate lung cancer progression.150 Noticeably, high levels of miR155 and low levels of let-7a-2 have been associated with a significantly poor prognosis and shorter survival times in lung cancer patients.152,153 Many research groups have used biological and solid-state NPs for the detection of miRNAs in different tissues. For example, the solid-state NP was used for rapid detection of probe-specific miRNAs (miRNA-122a and miRNA-153).154 Specifically, for every 1 fmol of miRNA duplex per mL solution, the capture rate was 1 molecule per second. In this study, the p19 protein from the Carnation Italian ringspot virus was used to enrich miRNA-122a and miRNA-153. Since miRNA concentrations were 1% relative to other cellular RNAs, to detect a specific miRNA using a NP sequence, an enrichment step was required.154 P19 binds 21–23 bps dsRNA in a size-dependent, but sequence-independent manner. Additionally, the highly affinitive and selective viral p19 protein does not bind ssRNA, tRNA or rRNA. This eliminates the possibility of false results from mismatched binding.155 Detection of 250 molecules in 4 minutes was sufficient to determine miRNA concentration with 93% confidence.154
A different approach from using viral proteins for probe-specific miRNAs detection is to employ an engineered-probe with a programmable sequence to differentiate single nucleotide differences in miRNA family members.150 The Wang et al. proposed a system that enabled sensitive, selective, and direct quantifications of cancer-associated miRNAs in the blood. In this study, the group constructed a robust protein nanopore-based sensor that utilized an oligonucleotide probe (P155) to detect aberrant expression of miRNA-155 and let-7a-2 from lung cancer patients.150 The generated signature electrical signals provided a direct and label-free detection of the target miRNA in a fluctuating background, such as plasma RNA extract.150 Probe (P155) has a programmable sequence and can be optimized to achieve high sensitivity and selectivity. Additionally, using chemical modifications, distinct probes can further be engineered with specific barcodes, allowing multiple miRNAs to be simultaneously detected. Furthermore, with the development of miRNA markers, manipulatable miRNA profile detection NP arrays can be constructed for a noninvasive screening and early diagnosis of cancer.150
Comparing to qRT-PCR assays, microarrays, colorimetry, bioluminescence, and other current methods,146–149 NP arrays is a simpler, faster methods to detect miRNAs in cancer patients. This approach lacks all the complications that conventional methods have, such as DNA amplification errors, unavailable internal controls, and cross-hybridization. Early detection is one of the most crucial contributors to a higher survival rate, especially lung cancer patients (Fig. 6).151
|  | ||
| Fig. 6 Detection of a miR-155 using using solid-state and biological α-hemolysin nanopores. (A) Schematic of miRNA detection with viral proteins for probe-specific miRNA, using solid-state nanopore. Protein from Carnation Italian ringspot virus was used to enrich miRNA form background fluid. (B) Detection of probe-specific miRNA using alpha-hemolysin biological nanopore. MiRNA-155 (shown in red) was attached to a DNA P155 probe (shown in green). (C) At 8.0 pH and 100 mV, translocation of the miRNA-155·P155 resulted in various current blockage patterns. (C) A typical current blockade with three characteristic blocking levels, representing the mechanism of miRNA-155·P155 complex dissociation and translocation through the pore (as shown in the right-hand side).154 | ||
| Cancer biomarkers | Type of nanopore | Support needed | Advantages | Limitations | Ref. | |
|---|---|---|---|---|---|---|
| BL | SS | |||||
| a BL = biological nanopore. SS = solid-state nanopore. (*) label-free and do not require additional aid, in order to detect biomolecules. | ||||||
| Structural variants | ✓ | * | Does not require multiple processing steps and file output formats. Low capital cost and short sequencing time | Low sensitivity and accuracy, high mismatch rate. Thus, PHRED score is not high enough for cancer detection yet | 33 | |
| Transcriptional factors | ✓ | * | Label- and tether-free. Does not require chemical-crosslinking or tagging. Hence, allow direct detection and distinguishing between full versus partial, and specific versus nonspecific bindings | Not able to predict the exact binding site TFs on an unknown DNA sequence | 63 | |
| Telomere | ✓ | * | Does not require complicated sample preparation and labeling, thus, allow the kinetics and folding/unfolding mechanism of the four different G-quadruplex structures to be studied | The correlation between telomere folding/unfolding and shortening has not been clarified | 10, 74–76, 78 and 79 | |
| ✓ | Poly-adenosine tail | |||||
| Aberrant methylation of CpGs | ✓ | ✓ | Methyl specific labelers | Does not require PCR or complicated sample amplification. Hence, eliminate false-positive results and allow epigenetic changes to be captured. Real-time detection for all four variants of cytosine (mC, cacC, fC, hmC) | Only provide coarse quantification of methylation on CpGs. Thus, it is still challenging to finely quantify and map the methylation profile of CpGs at the promoter region (essential for precancerous and tumorous detection) | 26, 27, 107 and 119–121 | 
| ✓ | Optical-tagging | |||||
| Histone–DNA modification | ✓ | * | Show that methylation of DNA does not affect the ability of nucleosome to ravel and unravel. Moreover, NP allows nucleosomes to unravel with the double-stranded DNA remain unzipped | Has not proved the ability to detect and/or map global changes of post-translational modifications (i.e. methylation, acetylation) on nucleosome | 135–138 | |
| Expression of miRNA | ✓ | ✓ | Engineered probe | Does not require amplification and cross-hyberdization of samples | Multiple miRNAs can be up- or downregulated at the same time in cancerous cells. Hence, several miRNAs have to be tested and clustered together. However, since the sensitivity and accuracy of NP still low, this will potentially lead to false results | 149 and 153 | 
| Viral proteins | Multiple miRNAs can potentially be detected using engineered probe. Data were consistent with good statistical distribution | |||||
Although the concepts of NP analysis in early cancer detection are exceptionally promising, several key technological challenges must be addressed before this method can be implemented in clinical uses. First and foremost, the biggest drawback of NP-based methods is high mismatch and error rates. Because the NP membrane thickness, especially biological ones, is relatively large comparing to a nucleotide, NP sensitivity is still low at the single-nucleotide level. Furthermore, even though different DNA conformations and foldings yield distinguishing characteristic current blockades, information about the molecular structure cannot be determined by NP membrane alone. In order to confirm the exact structure that causes a signature blockades in NP, researchers need the aid of other equipment, such as circular dichroism (CD), FRET, FISH, among many others. This limits the use of NP membranes as an independent, stand-alone tool for molecular studies in general, and early cancer detection, specifically. Moreover, since one single biological molecule can quickly adopt multiple, complex conformations under different environments, many research groups choose to use short/simplified sequences in their NP studies. Hence, the complexity of cancer cells has not yet been demonstrated and/or fully investigated with NP membranes.
With this review paper, we hope to give our readers an overview of the essential genetic and epigenetic modifications in cancerous tissue and the progression of cancer cells. With the complexity of the human body and more specifically cancer tissues, many of the mechanisms for cancer proliferation remain unknown. NP-based membranes have shown their ability to detect various biomolecules chemical and structural modifications, as well as genetic and epigenetic modifications. Thus, NP technology could be the one simple solution replacing many costly, labor-intensive conventional cancer screening methods. With the complexity of the field, there is growing opportunity for more significant research to be conducted in the next few decades.
| This journal is © The Royal Society of Chemistry 2017 |