Arata
Tsuchida
ab,
Taikopaul
Kaneko
a,
Kaori
Nishikawa
a,
Mayu
Kawasaki
a,
Ryuji
Yokokawa
b and
Hirofumi
Shintaku
*abc
aCluster for Pioneering Research, RIKEN, Main Research Building 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
bDepartment of Micro Engineering, Graduate School of Engineering, Kyoto University, Kyotodaigaku-katsura, Nishikyo-ku, Kyoto 615-8540, Japan. E-mail: shintaku@infront.kyoto-u.ac.jp
cInstitute for Life and Medical Science, Kyoto University, 53 Kawara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan
First published on 20th March 2024
We introduce a simple integrated analysis method that links cellular phenotypic behaviour with single-cell RNA sequencing (scRNA-seq) by utilizing a combination of optical indices from cells and hydrogel beads. With our method, the combinations, referred to as joint colour codes, enable the link via matching the optical combinations measured by conventional epi-fluorescence microscopy with the concatenated DNA molecular barcodes created by cell-hydrogel bead pairs and sequenced by next-generation sequencing. We validated our approach by demonstrating an accurate link between the cell image and scRNA-seq with mixed species experiments, longitudinal cell tagging by electroporation and lipofection, and gene expression analysis. Furthermore, we extended our approach to multiplexed chemical transcriptomics, which enabled us to identify distinct phenotypic behaviours in HeLa cells treated with various concentrations of paclitaxel, and determine the corresponding gene regulation associated with the formation of a multipolar spindle.
Single-cell optical phenotyping and expression (SCOPE-seq and SCOPE-seq2)11,12 is a method for linking scRNA-seq with live cell imaging. In SCOPE-seq, a single cell and a bead bearing barcoded DNA are co-isolated in a microwell, images are obtained of the cellular morphology, and mRNA from a single cell on the bead is captured for pooled scRNA-seq. SCOPE-seq links the image of the single cell to the scRNA-seq by optically decoding the barcode of each bead using cyclic hybridisation of fluorescently labelled oligo (oligonucleotide) probes, followed by fluorescence microscopy.
Herein, we propose a novel and simple approach for optical indexing that leverages the combination of cells and hydrogel beads dual-labelled with optical indices and DNA molecular barcodes (we refer to this dual label as a “colour code”) for linking cellular images with scRNA-seq. To link the combinations of optical indices decoded from the imaging to the cell barcodes in scRNA-seq, our approach creates concatenated fragments of barcoded DNA oligos (DNA tags) derived from the cells and barcoded dT primers derived from the hydrogel beads. The concatenated fragments are then sequenced with the scRNA-seq library, and provide a look-up table to link the combinations of optical indices and cell barcodes.
Our approach is free of automated microfluidic controls and offers fewer on-chip steps, which is advantageous because this approach can be easily implemented in a standard laboratory setup. We demonstrate our approach with multiplexing up to 256 combinations of colour codes (joint colour codes), using 16 pools of colour-coded cells and 16 pools of colour-coded hydrogel beads, and decoding them with four colours of fluorescence via standard epi-fluorescence microscopy.
In our demonstration, we designed 16 colour codes, which were bright or dim combinations of four fluorescence dyes (24 = 16) and which also corresponded to 16 different sequences of DNA barcodes, respectively, for cells and hydrogel beads (see the Methods section). Thus, a maximum of 256 joint colour codes (16 × 16) can be registered. The expected number of cell–bead pairs with unique joint colour codes per experimental run was predicted to attain a maximum of approximately 94 when assaying 256 single cells on the basis of Poisson distribution, excluding the cells with duplicated joint colour codes (Fig. 1E).
To demonstrate our protocol, we performed mixed-species experiments using HeLa cells (human) and NIH/3T3 cells (mouse) (Fig. S3†). We prepared a pool of 16 differently colour-coded cells (eight sub-pools each of HeLa and NIH/3T3 cells) that were each labelled with a combination of four different dyes (CellTrace Violet, CFSE, Yellow, and Far Red from Thermo Fisher Scientific) (Fig. 1B and S1†) and corresponding DNA tags, which contained an 8 nt barcode, poly A sequence, and a PCR handle (Table S2†), via electroporation. We then isolated single cells from the pool of 16 colour codes in microwells and imaged them by epi-fluorescence and in a bright field. The hydrogel beads bearing barcoded primers with colour codes (Fig. 1B and S2†) were subsequently isolated in the microwells to capture mRNA and DNA tags.
To retain the molecules released from the cells within the microwells, we sealed the microwells with a track-etched membrane with nanopores of 10 nm in diameter, chemically lysed the cells in the microwells, and captured the mRNA and DNA tags by the hydrogel beads via hybridisation. After removing the track-etched membrane, we imaged the fluorescence of the hydrogel beads in the microwells and read the colour codes. We registered the joint colour codes with the images of the single cells by integrating the microscopic images of cells and hydrogel beads. We finally transferred the hydrogel beads to a standard PCR tube to synthesise libraries of the scRNA-seq and DNA tag by off-chip reactions (see the Methods section). The latter library yielded a look-up table that linked the cell barcodes in the cDNA fragments and joint colour codes by creating concatenated fragments of colour codes of hydrogel beads and cells (Fig. S4†).
Our microwell chip, which contained 2511 wells per chip, captured 149 ± 70 cells and 1326 ± 462 hydrogel beads per run (n = 13), and created 137 ± 64 pairs of single cells and hydrogel beads on average. The image-based decoding of the joint colour codes showed that the number of unique joint colour codes matched the theoretical prediction (Fig. 1E).
To link the cell barcode in scRNA-seq to single-cell images via joint colour codes, we devised a framework that optimises pairs of cell barcodes and single-cell images by maximizing the sum of the similarity between the decoded colour code from the images and counts of DNA tags (Fig. 2D and E). We benchmarked the framework in terms of the accuracy and number of linked datasets using the mixed-species data, computed with various metrics of similarity and normalisation approaches for the DNA tag counts. The results showed that the most optimal performance among those tested was obtained for the cosine similarity in combination with the centred log ratio (CLR per feature) for normalisation of DNA tags among those tested (Fig. 2F). The framework with the cosine similarity and CLR yielded a consistency of 91% for species at a threshold of 0.5 (Fig. 2A). We employed the same framework throughout this study.
We also assessed the durability of the DNA tags within the cells by quantifying their presence over time (Fig. 3C). After 48 h of labelling, the DNA tags remained detectable, indicating the potential for combining our approach with longitudinal live-cell imaging. Notably, the linking rate, a metric representing the fraction of cell images linked to scRNA-seq among those identified from the cell images, exhibited no degradation over time (Fig. 3D). Interestingly, the linking rate associated with electroporation labelling increased over time. We hypothesise that this trend may be attributed to selection bias in favour of healthy cells within the electroporated cell population. Furthermore, we conducted additional analyses to confirm the integration of labelled cells with unlabelled cells in the transcriptomic space (Fig. 3E and F). These data serve as a clear benchmark for cell tagging achieved through DNA delivery via electroporation and lipofection.
To understand the intricate relationship between the phenotypic and transcriptomic responses at the single-cell level, we subjected the DNA-tagged and colour-coded HeLa cells to paclitaxel treatment at eight distinct concentrations, ranging from 0.5 to 500 nM, over a 24 h period. Subsequently, we analysed the combined samples using our established approach. We utilised a single fluorescence channel to monitor the emergence of multipolar spindles as a phenotypic response to paclitaxel by staining the DNA with Hoechst 33342 dye. The colour-decoded images of individual cells revealed that multipolar spindles were more prevalent at higher concentrations of paclitaxel. Notably, even at identical concentrations of paclitaxel, the number of spindles exhibited considerable heterogeneity across the cells (Fig. 4A and B). These observations were in accordance with the findings from non-pooled assays conducted in separate dishes (Fig. S5E and F†).
To determine the mechanism underlying the heterogeneous cellular response, we leveraged the transcriptomic data linked to the phenotypic responses. There were 1364 ± 350 genes per cell (i.e., 4357 ± 2366 UMIs per cell, with 106119 sequence reads per cell on average), according to the transcriptomic data. The integrated multimodal data by weighted nearest neighbour (WNN) analysis enabled two distinct trajectories to be inferred, which is related to the formation of multipolar spindles, or lack thereof, in response to the paclitaxel burden within the transcriptomic data (Fig. 4C–E). Gene set enrichment analysis (GSEA) revealed that the genes associated with mitosis (mitotic cell cycle, mitotic cell cycle process, cell cycle process, cell cycle G2/M phase transition, cell cycle phase transition, G2/M transition of mitotic cell cycle, and regulation of cell cycle) were downregulated with increasing paclitaxel concentration, irrespective of the presence of multipolar spindle formation (Fig. 4F). The genes included in the Gene Ontology (GO) terms consistently exhibited the downregulation (Fig. 4G).
Subsequently, in trajectory 1 (no multipolar spindle formation), GSEA highlighted the induction of endoplasmic reticulum (ER) stress, which, if sustained or is severe, can potentially trigger apoptosis (Fig. 4H).15 Conversely, in trajectory 2, characterised by multipolar spindle formation, the RFC4 gene, known for its role in DNA replication and repair,16 consistently exhibited upregulation in response to increasing paclitaxel exposure (Fig. 4I). These findings underscore the remarkable power of integrated multimodal data analysis, and its ability to effectively distinguish the gene regulation between two trajectories associated with distinct phenotypic outcomes.
There are two major strategies for integrated phenotypic and transcriptomics analyses. The first is the optical decoding of the barcode sequence by sequential fluorescence in situ hybridisation, and the second is physical isolation of the cell of interest and then indexing by known barcode tags. SCOPE-seq2 employs the former strategy, and decodes the cell barcode of the hydrogel beads by performing cyclic hybridisation and readout with automated microfluidic control and microscopic imaging.12 As an example of the latter strategy, an automated cell-picking system was employed to isolate single cells into 96-well plates and then process them for scRNA-seq.20
In contrast to these methods, our approach to link the cellular phenotype with transcriptomics uses a combination of colour codes of cells and hydrogel beads to optically index pairs of single cells and hydrogel beads. Automated microfluidic controls and robotic systems are not required for this method, which incorporates fewer on-chip steps, and is compatible with standard epi-fluorescence microscopy, which are advantageous features because they can be implemented in a standard laboratory.
As demonstrated in our experiments to measure the paclitaxel burden on HeLa cells, the cell colour code also works as cell hashing for multiplex chemical screening. Our analysis revealed two distinct trajectories in transcriptomic response by paclitaxel treatment, which correspond to distinct phenotypic reactions. The trajectory that involved no multipolar spindle formation showed upregulation of the ER stress response. Prolonged and severe ER stress may induce apoptosis, or lead to the acquisition of drug resistance21 through the activation of unfolded protein response (UPR), a signalling pathway involved in adaptive and apoptotic response.22 The second trajectory exhibited generations of multipolar spindles, leading to chromosome missegregation and cell death.14
Our approach has the potential to be extended to the integrated analysis of dynamic phenotyping and transcriptomics using longitudinal live-cell imaging. The DNA tags were retained within the cells 48 h after labelling. For instance, the integration of our approach with longitudinal imaging of leukocytes at the sites of active inflammation can be used to potentially classify leukocytes by spatiotemporal behaviours23 and determine the molecular background. Furthermore, our approach can be readily integrated with the profiling of surface proteins via CITE-seq,6 thereby enabling the analysis to be coupled with another omics layer. The proposed opto-combinatorial indexing is also compatible with cell-hashing using DNA-tagged antibodies24 or lipids.25 The transfection-based approaches (electroporation or lipofection) used in our demonstration are robust and cost-effective, for instance, when assaying cells from non-model organisms.
In analysing the drug response of HeLa cells to paclitaxel, we labelled nuclei with a fluorescent dye (Hoechst 33342) to observe the nuclear morphology, resulting in a reduction in the number of cell colour codes. We envision that increasing the number of fluorescence channels by quantitatively demultiplexing the fluorophores with spectral overlap is the key to improving scalability and increasing observable phenotypic parameters. In the future, we hope to demonstrate high-content and improved multiplexing by increasing the fluorescence channels and using unmixing approaches.26
In conclusion, opto-combinatorial indexing provides a simplified strategy to simultaneously analyse the image and gene expression from single cells, and effectively dissect the molecular background of distinct phenotypic behaviours by integrating cellular phenotype and transcriptomics data.
We generated droplets of acrylamide premix with 50 μM acrydited primer (5′-Acryd/AAGCAGTGGTATCAACGCAGAGTACGACGCTCTT-3′) using a simple coflow microfluidic device. The final bead size was approximately 40 μm. We then ligated the first barcode fragments containing the bead colour code to the first part of the cell barcode (Table S1,† Stem_CC00_ID01–Stem_CC15_ID25). In the second round of ligation, we added fragments with the second part of the cell barcode, UMI and poly(T) (NNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTVN) (Table S1,† ID00_dT–ID32_dT). The combination of the first and second parts of the cell barcode created 6400 unique barcodes.
To stain beads with colour codes, we pooled the beads with cell barcodes in a single tube and combined 3 × 104 beads, a mixture of 6 μM branched oligos (Table S1,† Branch_00_NNNN–Branch_15_BGPR), a mixture of 12 μM readout oligos (with Alexa 488, Alexa 555, Alexa 647, and Alexa 750, Table S1,† Readout_Alexa647–Readout_Alexa750), 12 μM oligo without fluorophore (Table S1,† Readout_R0–Readout_B0 to fill the sequence in branched oligos for dim beads), and 0.1 mg mL−1 salmon sperm DNA in hybridisation buffer (5 mM Tris-HCl (pH 8.0), 1 M KCl, 5 mM EDTA, 0.05% (vol/vol) Tween-20). We incubated the mixture at 94 °C for 5 min and cooled it by 5 °C every 5 min to 25 °C, and then maintained it at 4 °C. Excess probes were washed three times with an ice-cold hybridisation buffer.
We reasoned that hybridisation-based staining of hydrogel beads insignificantly affects the synthesis of cDNA and the PCR amplification because the branched oligo hybridises downstream of the cDNA extension during reverse transcription, and the concentration of the branched oligo in PCR is estimated at 3.52 nM per colour code, while that of the PCR primer is at 240 nM. Furthermore, the melting temperature of the branched oligo is lower at 66.4 °C than that of the PCR primer at 77.5 °C (under conditions of 50 mM Na+ and 3 mM Mg2+ as an example), while the annealing temperature for PCR is 65 °C.
For the species-mixing experiment, we separately seeded HeLa cells and NIH/3T3 cells in a 100 mm dish at a concentration of 2.0 × 105 cells per mL for each, and cultured them for 1 day. After trypsinisation, we aliquoted cells equally into eight sub-pools per cell type in 16 tubes and individually stained them with 16 different combinations of four types of CellTrace dyes (5 μM Violet, 5 μM CFSE, 5 μM Yellow, and 1 μM Far Red, Invitrogen™) at the concentration of 1.0 × 106 cells per mL. We then individually suspended the stained cells in Gene Pulser® Electroporation Buffer (Bio-Rad) with 100 nM of DNA tag (Table S2†) corresponding to the respective fluorescence colour.
Immediately after electroporation by the Gene Pulser Mxcell™ Electroporation System (Bio-Rad, voltage: 250 V, capacitance: 2000 μF, resistance: ∞ Ohm, duration: 20 ms for HeLa cells, and 400 V, capacitance: 950 μF, resistance: ∞ Ohm, duration: 20 ms for NIH/3T3 cells), we added a five-fold volume of culture medium and incubated the cells for 1 h at 37 °C in 5% CO2. We washed the cells three times with 1× phosphate-buffered saline (PBS, 14249-24, Nacalai Tesque), and combined the 16 sub-pools in loading buffer (1% polyvinylpyrrolidone in 1× PBS).
For the paclitaxel treatment experiments, we seeded the HeLa cells at the concentration of 5.0 × 104 cells per mL in eight wells of a 24-well plate and cultured them for one day. We individually transfected cells with different types of DNA tags using Lipofectamine® 3000 reagent (Invitrogen™) according to the manufacturer's protocol (incubation for 4 h at 20 nM DNA tag concentration). After lipofection, we washed the cells three times with culture medium. We individually stained the cells in each well with eight combinations of three CellTrace dyes (5 μM CFSE, 5 μM Yellow, and 1 μM Far Red). We cultured cells for one day in culture medium containing paclitaxel (163-28163, FUJIFILM) at different concentrations. We washed the cells three times with 1× PBS. After trypsinisation, the eight sub-pools of cells at equal concentration were pooled equally into one tube. Subsequently, the cells were stained with 10 μg mL−1 Hoechst 33342 and resuspended in loading buffer.
For the tag retention assay, we seeded HeLa cells at a concentration of 5.0 × 104 cells per mL and cultured them for one day, followed by lipofection with 20 nM DNA tags and culturing for 4 h. In addition, we performed electroporation on HeLa cells at 1.0 × 106 cells per mL in Gene Pulser® Electroporation Buffer with 100 nM DNA tag under the same conditions described above. We then cultured the cells in a 24-well plate for varying durations up to 48 h and stained them with six different combinations of three CellTrace dyes (5 μM CFSE, 5 μM Yellow, and 1 μM Far Red). After three PBS washes, we pooled all of the cells at the same concentration and resuspended them in loading buffer.
We dropped 20 μL of the colour-coded bead suspension at the concentration of 1.0 × 106 beads per mL onto the microwell array and incubated them for 10 min to capture the beads in the microwells, followed by tapping and resting for 1 min at 37 °C, which was repeated five times. To seal the microwells with a track-etched membrane with nanopores of 10 nm in diameter (Sterlitech), the membrane was pre-treated with atmospheric plasma (BD-20, Electro-Technic Products) for 60 s and then hydrated in PBS. Then, we pressed the membrane and the PDMS slab by placing a glass slide (8 mm square per side, S7214, MATSUNAMI) and a 100 g weight at 37 °C for 30 min. We removed the glass slide by adding 3 mL of PBS. We lysed the cells with 3 mL of cell lysis buffer (5 M guanidine thiocyanate, 1 mM EDTA, 0.50% sarkosyl (N-laurylsarcosine), 1.0% 2-mercaptoethanol), and then agitated them in a microplate shaker at 5–60 rpm for 20 min.
Next, we washed the microwell array with 3 mL of hybridisation buffer (2 M NaCl, 0.64% PEG-8000, 0.52× PBS) to hybridise the released mRNA and DNA tag with the primers on the hydrogel beads by agitation at 5–60 rpm for 40 min. After removing the membrane over the microwell array, we acquired images of the microwell array containing the hydrogel beads with adjusted exposure times. To collect beads, we placed the microwell array directly into a 200 μL tube and flushed it with 200 μL of wash solution 1 (2 M NaCl, 3 mM MgCl2, 20 mM Tris-HCl (pH 8.0), 0.64% PEG-8000, 0.1% Tween 20). We exchanged wash solution 1 for wash solution 2 (50 mM Tris-HCl (pH 8.3), 75 mM KCl, 6 mM MgCl2, 0.4 U μL−1 Recombinant RNase Inhibitor (2313A, Takara), 0.1% Tween 20) by repeating centrifugation (3000 × g, 4 °C, 3 min) and the buffer exchange twice.
The first-strand cDNA was amplified by PCR in 50 μL of reaction solution containing 0.24 μM primer 2, 9 nM additive primer, 1× SeqAmp PCR Buffer and 0.025 U μL−1 SeqAmp DNA Polymerase (Takara) (Table S2†) using the following program: 95 °C for 1 min; 16–18 cycles of 98 °C for 10 s and 65 °C for 30 s; 68 °C for 4 min; and 72 °C for 10 min. We purified the mRNA-derived cDNAs (long cDNA) and the DNA tag-derived cDNAs (<200 bp short cDNA) by size fractionation using SPRIselect beads (B23318, Beckman Coulter).
We mixed the cDNA product with 0.6× SPRIselect beads and placed it on the DynaMag™-Spin Magnet (Invitrogen™) to capture the beads. We then transferred the first supernatant containing the DNA-tagged product to a new tube and further purified it with 1.4× SPRI beads. The magnetic beads were washed three times (long) and twice (short) on a magnetic stand with 0.2 mL of fresh 80% (vol/vol) ethanol and were then air-dried for 2.5 min at room temperature. The cDNAs derived from mRNA and DNA tags were eluted with 13 and 11 μL of elution buffer (10 mM Tris-HCl, pH 8.5), respectively.
For the mRNA-derived cDNA, we examined the yield, quality, and size distribution using a Qubit 4 Fluorometer (Thermo Fisher Scientific) and quantitative real-time PCR (qPCR) targeting GAPDH (glyceraldehyde-3-phosphate dehydrogenase, Hs02758991_g1, Thermo Fisher Scientific) with an Agilent High Sensitivity DNA Kit and the Bioanalyzer 2100 (Agilent). We then performed the tagmentation of 600 pg of cDNA using a Nextera XT DNA Library Prep Kit (Illumina) and subsequent PCR with custom indexing primers. The PCR products were then purified with 0.6× SPRIselect beads and eluted with 6.5 μL of Resuspension Buffer (RSB, Illumina).
To construct the DNA tag library, the short cDNA was amplified in 20 μL of 1× KAPA Hifi Hotstart Ready Mix (Roche) containing 1.6 μL of 10-fold diluted templates and 0.25 or 0.5 μM of indexing primers (Table S2†) according to the following program: 98 °C for 2 min; 2 cycles of 98 °C for 20 s and 74 °C for 30 s; 12–18 cycles of 98 °C for 20 s and 72 °C for 30 s; and 72 °C for 5 min. The library was then purified with 1.5× SPRI beads and eluted with 8 μL of elution buffer. We assessed the yield and length of the library using a Qubit™ dsDNA HS Assay Kit and an Agilent High Sensitivity DNA Kit (Agilent), respectively.
The typical size of the mRNA-derived cDNA library was approximately 500 bp, while the size of DNA tag libraries was 224 bp. Finally, we quantified the library using KAPA Library Quantification Kits (Roche) according to the manufacturer's protocol. The library was sequenced on a HiSeq X (Illumina) instrument with 2 × 150 bp paired-end reads.
When the centres of the cells were within the radii of the bead-captured wells, we assigned them as co-captured pairs. To compensate for the different focal lengths of the nuclei of paclitaxel-treated cells in the microwells, z-stack images were obtained with a 10× lens (Olympus UPlanFL N 10×) at 3 μm intervals from the bottom to the top of the wells. We then executed an extended depth-of-field algorithm29 provided by Fiji software. We counted the spindle poles using CellProfiler30 by enhancing the speckles with the ‘EnhabceOrSuppressFeature’ module and segmenting them into individual poles with the ‘IdenfifyPrimaryObjects’ module.
We assumed that cosine similarity is a function of the signal-to-noise ratio of tag counts expressed by the following equation:
Furthermore, we combined paclitaxel concentrations (log10-transformed with a 0.1 offset) and the number of spindle poles to create principal components using the ‘prcomp’ function from the stats package. To incorporate both sets of principal components in subsequent analyses, we performed a WNN analysis33 utilizing the ‘FindMultiModalNeighbors’ function (with a k-nearest neighbors' parameter of 25) from the Seurat package.
log(μgi) = β0 + βψψ + log(Ni) |
log(μgi) = β0 + βψ,1t1ψ + βψ,2t2ψ + log(Ni) |
where tk indicates a factor denoting the assignment to trajectory k. We assessed the significance of βψ,2 − βψ,1 to derive fold changes and p values.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3lc00866e |
This journal is © The Royal Society of Chemistry 2024 |