Kevin
Jahnke‡§
a,
Nina
Struve‡
*b,
Daniel
Hofmann
a,
Martin Julius
Gote¶
a,
Margund
Bach
a,
Malte
Kriegs‡
b and
Michael
Hausmann‡
*a
aKirchhoff-Institute for Physics, Heidelberg University, Im Neuenheimer Feld 227, 69120 Heidelberg, Germany. E-mail: hausmann@kip.uni-heidelberg.de
bDepartment of Radiotherapy & Radiation Oncology, University Medical Center Hamburg – Eppendorf, Martinistr. 52, 20246 Hamburg, Germany. E-mail: ni.struve@uke.de
First published on 24th July 2024
Super-resolution microscopy has been used to show the formation of receptor clusters and adapted lipid organization of cell membranes for many members of the ErbB receptor family. The clustering behaviour depends on the receptor size and shape, possibly ligand binding or expression activity. Using single molecule localization microscopy (SMLM), we also showed this typical clustering for the epidermal growth factor receptor variant III (EGFRvIII) in glioblastoma multiforme (GBM) cells. EGFRvIII is co-expressed with the wild type (EGFRwt) and both receptors are assumed to preferentially form hetero-dimers leading to transactivation and elevated oncogenic EGFR-signalling in GBM cells. Here, we analysed EGFRvIII and EGFRwt co-localization using our already described model system of the glioblastoma cell line DKMG, displaying endogenous EGFRvIII expression. Using EGFRvIII and EGFRwt specific antibodies, EGFR localization and their potential for dimerization in a given membrane cluster were analysed by dual colour SMLM supported by novel approaches of mathematic evaluations including Ripley statistics, persistent homology and similarity algorithms. Surprisingly, cluster analysis, Ripley point-to-point distance statistics for cluster geometry and persistent homology comparing cluster topology, revealed that both EGFRvIII and EGFRwt do primarily not form hetero-dimers but the results support the hypothesis that they tend to form homo-dimers. The ratio of homo-dimers obtained by this calculation was significantly higher (>5σ, standard deviation) than expected from randomly arranged points. In comparison, hetero-dimer formation was only slightly increased. We confirmed these data by immunoprecipitation, which show no co-precipitation of EGFRvIII and EGFRwt. Furthermore, we showed that the topology of the clusters was more similar among the same type than among the different types of receptors. Taken together, these data indicate that EGFRvIII does induce oncogenic signalling by homo-dimerisation and not preferentially by hetero-dimer formation with EGFRwt. These data offer a new perspective on EGFRvIII signalling which will lead to a better understanding of this tumour associated receptor variant in GBM.
Glioblastoma multiforme (GBM), a high-grade aggressive glioma with an estimated 5-year survival rate of less than 10%,11,12 is a common and one of the most malignant brain cancers.13 Despite extensive research and many approaches for aggressive treatment,14–16 GBM remains one of the cancers with mostly poor prediction and outcome for the patient, i.e., the majority of patients is dying within 15 months after their diagnosis.17 GBM is treatment-resistant and highly invasive which is believed to be the reason for the high recurrence rates observed after surgery, radiation treatment and chemotherapy.18
About 50% of GBM cases show increased wild type EGFR (EGFRwt) expression19 due to egfr gene amplification which often is accompanied by gene rearrangements. The most common gene mutation leads to the epidermal growth factor receptor variant III (EGFRvIII20 or de2-7EGFR21) frequently expressed in GBM. EGFRvIII lacks the exons 2–7 resulting in a loss of the EGFR's N-terminal extra-cellular domain which is usually involved in ligand binding and dimerization. EGFRvIII has been shown to be an important factor involved in many different cellular mechanisms influencing tumour genesis and treatment response,22 which is, however, under controversial debate. For example, EGFRvIII mediates the sensitivity towards the chemotherapeutic temozolomide by upregulating the DNA mismatch repair which is associated with better survival of EGFRvIII-positive patients.23 While this is true for patients with methylguanine-DNA methyltransferase (MGMT) promotor methylation, the opposite could also be found for other entities.24 EGFRvIII induces replication stress and genomic instability which can lead to increased sensitivity towards irinotecan.25 However, EGFRvIII does not affect radio-sensitivity with and without anti-EGFR treatment.26,27 These studies show on the one hand that EGFRvIII may be a promising biomarker for future GBM therapy but on the other the need for mechanistic insights to better understand EGFR biology and the patients’ response towards GBM therapy.
In contrast to EGFRwt, which is activated by ligand binding and subsequent dimerization, EGFRvIII is thought to be constitutively active because of the loss of EGFR's extra-cellular domain which is usually involved in ligand binding and dimerization.28 The precise mechanisms of this activation and regulated signalling pathways are not completely understood; however, dimerization-independent activity and oncogenic signalling relationships between EGFR and EGFRvIII have been described.29 A selective pressure for EGFRwt expression in EGFRvIII positive cells may be suggested29 which may be due to physical binding of EGFRwt and EGFRvIII.21 Although EGFRvIII is known to be ligand-independent, the addition of EGF ligands resulted in an increased phosphorylation of EGFRvIII on double positive cells suggesting a EGFRwt ligand-dependent cross-phosphorylation of EGFRvIII.29 If EGFRvIII expression can only be observed in the presence of EGFRwt, this raises the intriguing questions as to whether EGFRvIII promotes hetero-dimerization and how these hetero-dimers between a fully intact and a deleted receptor variant should be formed.
EGFRwt is known to be localized in membranous structures such as coated pits.30 In addition, super-resolution imaging has revealed the clustering of EGFRwt.31 However, less is known about the precise localization and arrangement of EGFRvIII. By means of super-resolution single molecule localization microscopy (SMLM) we have shown that EGFRvIII undergoes cluster formation.5 The precise functions and consequences of these spatial arrangements for the dimerization potential, however, have remained elusive. While homo-dimerization was suggested as the predominant mechanism of EGFRvIII in glioblastoma,32 the potential of hetero-dimerization10 or even both possibilities33 could not be excluded. This has motivated further detailed studies of dimerization-correlated receptor arrangements using super-resolution (SMLM) as used in previous studies.4,5
SMLM has been established as an approach for super-resolution light microscopy34 that circumvents the Abbe-Rayleigh boundary conditions of optical resolution35 and enables precise single molecule localization.36 The precision is on the order of a few ten nanometres and even about 10 nm (ref. 36) in 3D conserved cells whereby standard objective lenses and optics are used.37,38 The fundamental concept of localization microscopy is based on optical isolation by different spectral signatures, e.g., by using fluorophores that can be switched between two different spectral states39–41 to achieve a temporal isolation and thus a spatial separation of single signals. This allows the determination of signal positions (spatial coordinates) and their spatial distances even if they are adjacent below the Abbe-Rayleigh resolution limit. All acquired positions of individual fluorescent molecules can then be merged into a coordinate matrix (called “ortematrix”).
This resulting coordinate matrix can be subjected to further data processing actions and especially mathematical algorithms for point pattern analysis without further typical procedures for image processing for quality improvements of the visual image.42 Distances of points can be calculated from the coordinates as Euclidian distances between the coordinates scaled with object size units obtained from the pixel size of the camera and the magnification used. Interpretation tools as basis for biological investigations are algorithms derived from Ripley's statistics based on the point-to-point distance frequency distributions and topological calculations obtained by persistence homology and appropriate similarity measures (see42 and citations therein).
SMLM and other super-resolution microscopy techniques have been successfully applied to study membranes and receptor cluster formation (see for instance4,5,31,43) or chromatin arrangements to distinguish cell types according to their heterochromatin or euchromatin networks, e.g.44 or to study radiation induced DNA damaging and repair, e.g.38,45 Here, we apply SMLM to study the probability for the formation of hetero- and homo-dimers of EGFRwt and EGFRvIII in a well-established GBM cell model.5
For the subsequent dimer analysis, antibodies from different species were needed which had to be specific either for EGFRwt or for EGFRvIII. The specificity of the selected antibodies (mouse-derived αEGFR R-1 for EGFRwt and rabbit-derived αEGFR D6T2Q for EGFRvIII) was tested using Chinese hamster ovary cells (CHO) which lack EGFR expression. These cells were transfected by EGFRwt or EGFRvIII encoding vectors and analysed by immunofluorescence microscopy. As shown in Fig. 2, both antibodies were highly specific and showed no cross reactivity.
![]() | ||
Fig. 3 Examples of SMLM images (a) EGFRwt, (b) EGFRvIII, (c) and (d) visualization of the respective cluster formation. Scale bar is 1 μm. The inserts are enlarged by a factor of about 3. |
From 8 prepared slides, two labelled ones (slide 1 and slide 5) and one control (labelled by secondary antibodies only) were selected according to the quality required for SMLM (see Experimental chapter). The two completely labelled slides in general showed different absolute numbers of blinking events (Fig. 4a) but the same tendencies in structural formations. Although the data were only obtained from a few slides it has to be considered that all results are obtained from single cell data sets subjected to established, standardized tools of a computer analysis. This ensures a high reproducibility of the evaluation process for each cell. However, the cell line analysed showed a broad variability in the quantitative values as showed by the standard box plot graphics. It might be apposite to ask for reasons of this variability observed. At a first glimpse one might think about preparation effects that on one hand can never be excluded completely. On the other hand, real biological reasons have to be included into the consideration. Recently, Alekseenko et al.46 have pointed out reasons for low reproducibility of quantitative results between individual tumour cells. Each cell that could be subjected to single cell experiments has slightly different conditions in the microenvironment which might lead to little functional differences in the individual cells. This might be negligible in bulk experiments but in single cell measurements it could become visible. Or with other words, the more precisely single cell measurements are performed, the more variations between individual cells may become obvious.
For the number of blinking events, no significant differences were seen between the control group and the slides with the complete colouring. Obviously the secondary antibodies tend to non-specifically attachment on the cell membranes. This seems to be a typical shortcoming of the cell model used as being discussed and elaborated in more detail in Boyd et al.5 Therefore it was necessary also to calculate the frequencies of distances where the two groups could be separated from the controls (see Ripley statistics below). Both groups exhibited signal numbers in the same orders of magnitude in two detection channels; for EGFRwt, the mean detection numbers fluctuated between 7000 and 13000, for EGFRvIII the mean detection numbers were much higher, i.e., in the range between 18
000 and 25
000. The event numbers, however, differed significantly between the 488 nm (EGFRwt) and 568 nm (EGFRvIII) channels. On average, up to twice as many events were found for EGFRvIII compared to EGFRwt. This corresponds to the fact that the deletion mutant is more highly expressed than EGFRwt in the cell line used.22 The higher event number for EGFRvIII can also be seen in the localization image (Fig. 3b). However, the clearly more pronounced fluorophore-rich area outside the cell in the EGFRvIII channel, visible in Fig. 3b on the right, also indicates an elevated background. This suggests that there could be also some background inside the cell. Thus, the increased signal number in EGFRvIII might result from a more pronounced expression and a presumably increased background of non-specifically bound secondary antibodies.
The different number of blinking events reasoned the selection of the dyes for the secondary antibodies and that in case of distance measurements between the two receptor types, always the distances from EGRFwt to EGFRvIII were measured.
The results of the Ripley analysis were supported by the cluster analysis using DBSCAN with the cluster conditions that a minimum of 5 points must be located within a radius of 60 nm around a single point to form a cluster. In Fig. 3c and d, the results of this clustering are shown for the examples of Fig. 3a and b. On average about four times more EGFRvIII clusters than EGFRwt clusters were obtained (Fig. 5a). While the mean cluster size was only slightly smaller for EGFRvIII (Fig. 5c), EGFRvIII clusters showed a much larger variation in the mean density than the clusters of EGFRwt (Fig. 5b). However, the mean number of points per cluster did not vary between EGFRvIII and EGFRwt (Fig. 5d).
![]() | ||
Fig. 5 Box plots (for description see Fig. 4) for different features of the clusters according to the DBScan with a minimum of 5 points within a radius of 60 nm around a point: (a) number of clusters per cell; (b) mean point density in the clusters; (c) mean cluster size; (d) mean number of points per cluster. |
In the case of EGFR, it is known that the receptors, induced by a ligand via their extracellular domain, form homo- or hetero-dimers with other ErbB receptors.48 Since EGFR has only one transmembrane domain, the dimer spacing would be the distance between the two transmembrane domains. With the help of the crystal structure of an EGFRwt homo-dimer this distance can be estimated to 7.9 nm.49 Both primary antibodies used bind to the extracellular domain around the EGF binding site.50,51 Accordingly, these two were about 8 nm apart.
The primary antibody for EGFRwt has a molecular mass of 150 kDa and the one for EGFRvIII has a molecular mass of 130 kDa. These have an approximate size52 of 14 × 8 × 4 nm3 and have the shape of a Y, which consists of several protein chains. The two primary antibodies were fluorescently labelled by specific secondary antibodies “loaded” with the fluorophores, which were assumed to have about the same spatial extension. The size of the fluorophores was negligibly small since it is limited by a molecular mass of 759 Da for AlexaFluor594 or 721 Da for AlexaFluor488. These secondary antibodies were not free in their binding, as the binding site was located at the lower end of the primary antibody. This also restricted the binding angle. In addition it has to be considered that it is possible that even two or perhaps three secondary antibodies bind to one primary antibody. Unfortunately, neither literature nor the manufacturer could give concrete estimates of the probability of double or triple secondary antibody binding.
Under the first assumption that only one secondary antibody targets one primary antibody, the theoretically maximum possible distance for a dimer was estimated to:
Dimer distance (max) = 2 × (4 nm + 8 nm + 8 nm) = 40 nm |
The first term (4 nm) originated from the distance between the receptors as obtained in the crystal structure. The binding and expansion of the primary and secondary antibodies determined the second (8 nm) and third (8 nm) terms. The bonds they formed were restricted but not uniform. The minimum dimer distance that can be registered is given by the localization precision of the point detection. This means the precision with which the coordinate of a single point can be determined. Or with other words, a point is detected as a single point as long as it only fluctuates within this region. The localization precision was in the order of 10 nm (see Table S1 in the ESI†). In the estimate the minimum distance was set to 0 nm which means within localization precision only.
The relative frequencies of the next neighbour distances were counted for EGFRwt–EGFRwt, for EGFRvIII–EGFRvIII and for EGFRwt–EGFRvIII. In Fig. 6a–c the results are shown for slide 1. The red curve was calculated as the envelope of the histograms. These histograms were compared to next neighbour distance histograms of a random distribution obtained from the same number of points randomly distributed on the same area as the total membrane section evaluated (Fig. 6d–f).
Using the estimated maximum value for dimerization as calculated above, the relative amount of homo-dimers and hetero-dimers were calculated from the frequency distributions of the next neighbour distances. On average 38.04% of the EGFRwt and 69.14% of the EGFRvIII seemed to form homo-dimers according to our distance criteria.
These relative high values of presumptive homo-dimers have to be discussed in more detail. In principle these values are biased by the possibility that two or even three secondary antibodies can bind to one primary antibody. Assuming that for instance two secondary antibodies bind in such a way that their axes are perpendicularly orientated to the axis of the primary antibody, their distance could be estimated to 20 nm (2 × 8 nm for the length of the secondary antibodies + 4 nm for the thickness of the primary antibody). This distance is larger than the localization precision so that the value would contribute to the results of our point distance determination and would be interpreted as a homo-dimer. The frequency of such an antibody situation cannot be determined. However, since the secondary antibodies do not preferentially bind in the described perpendicular arrangement, it could be expected that some of them are orientated in a small angle to the axis of the primary antibody so that the two dyes cannot be separated within the localization precision. Moreover, it cannot be excluded that many primary antibodies carry only one secondary antibody or that a measured distance between two blinking events derive from two secondary antibodies that are attached to two closely adjacent primary antibodies although both primary antibodies carry more than one secondary antibody.
We also analysed the frequencies of distances to the next neighbour, the two next neighbours and the five next neighbours (Fig. S1, ESI†). For both types of receptors the three curves revealed a very similar shape with a maximum at distances below 40 nm. Since the probability that five next neighbours are secondary antibodies of one primary antibody is very low or even tends to zero, we assume in comparison with the data obtained for hetero-dimerization that it seems to be reasonable that a considerable amount of distance pairs in our data set were due to homo-dimerization.
Another point that should be mentioned in the context of the secondary antibodies is the degree of labelling (DOL) of such antibodies which can range from 2–8 dye molecules per antibody. Usually in a biological micro-environment several of these dye molecules quench each other, but in general it cannot be excluded that more than one dye molecule is actively blinking at one secondary antibody. However, if it is assumed that the length of such an antibody is about 8 nm along which the dye molecules can be arranged, the multiple blinking events are below the localization precision (Table S1†) and therefore recorded as one event.
Our data support the hypothesis that EGFRs prefer homo-dimerisation.53 These results obtained for homo-dimerization were significantly different from the results obtained for the same number of randomly distributed points (5.95% or 18.13%, respectively) (Table 1). The difference of these two values of the random distributions corresponds to the different numbers of blinking events in general which we observed for EGFRwt and EGFRvIII.
Sample | Receptor EGFR | Measurement | Random data | ||
---|---|---|---|---|---|
Mean distance [nm] | Amount of dimers [%] | Mean distance [nm] | Amount of dimers [%] | ||
Slide 1 | wt | 5.37 ± 3.14 | 47.02 | 158.24 ± 11.27 | 5.95 |
vIII | 7.05 ± 3.05 | 72.1 | 62.2 ± 1.39 | 21.01 | |
Control | wt | 45.16 ± 4.27 | 34.96 | Not available | |
vIII | 11.29 ± 1.69 | 74.36 | 74.35 ± 1.80 | 17 | |
Slide 5 | wt | 31.15 ± 3.83 | 32.15 | Not available | |
vIII | 15.30 ± 2.27 | 60.97 | 74.37 ± 1.73 | 16.37 | |
Mean wt | 38.04 ± 6.45 | Significance 8.7σ | 5.95 | ||
Mean vIII | 69.14 ± 5.85 | Significance 5σ | 18.13 ± 2.05 |
In contrast to the results for homo-dimerization, the results obtained for hetero-dimerization of 18.47% did not significantly differ from the result of randomly distributed points (17.06%) within one standard deviation (Table 2). This rather low value for hetero-dimerization might also support the assumption that multiple binding of secondary antibodies is not the major labelling situation.
Sample | Measurement | Random data | ||
---|---|---|---|---|
Mean distance [nm] | Amount of dimers [%] | Mean distance [nm] | Amount of dimers [%] | |
Slide 1 | 60.18 ± 4.38 | 23.08 | 66.42 ± 1.84 | 19.76 |
Control | 87.93 ± 3.03 | 17.49 | 73.45 ± 2.17 | 15.63 |
Slide 5 | 93.48 ± 2.61 | 14.85 | 75.48 ± 1.82 | 15.79 |
Mean | 18.47 ± 3.43 | Mean | 17.06 ± 1.91 |
In summary, these calculations and measurements by SMLM indicated that the formation of homo-dimers is considerably preferred to the formation of hetero-dimers for EGFRwt and EGFRvIII. These findings are also supported by bio-physical simulation models (see below) of clustering of proteins integrated into a bi-lipid membrane where clustering of equally sized proteins is preferred which could better support homo-dimerization that hetero-dimerization.54–56
A well-established measure to compare the barcode patterns of two receptor clusters is the Jaccard index.57,59 Due to normalization of this similarity measure, index values between 0 and 1 can be obtained. A value of 0 means that there is no overlap when the two compared bar sets are superimposed; with a value of 1, the two bar sets are identical.
For a representative data set obtained from slide 1, the Jaccard indices were calculated for the components and for the holes and averaged. The results are depicted in a heatmap representation in Fig. 8a. When comparing EGFRwt with EGFRwt similarity values between 0.420 and 0.445 were obtained. For the comparison of EGFRvIII with EGFRvIII, the similarity values were lower between 0.410 and 0.425. These values were further but only partly decreased in the case of comparing EGFRwt with EGFRvIII (between 0.400 and 0.430). For a better overview, 2nd generation heatmaps59 were calculated in which one pixel represents the mean value of the values of one 1st generation heatmap. These 2nd generation heatmaps are shown in Fig. 8b for the components, the holes and the average of components and holes. In all three heatmaps the comparison of EGFRwt with itself has the highest similarity followed by slightly lower similarities for EGFRvIII with itself. The comparison of EGFRwt with EGFRvIII and vice versa resulted in the lowest similarity values.
EGFR negative Chinese Hamster Ovary (CHO) cells were cultivated as described in ref. 61 using α-minimal essential medium (Gibco) supplemented with 5% fetal calf serum (FCS, Gibco). For EGFRwt and EGFRvIII over-expression, CHO cells were transfected with 2 μg plasmid carrying either DNA for EGFRwt or EGFRvIII receptors using Lipofectamine 2000 according to the manufacture's protocol (Thermo Fisher) and protein expression was analysed after 24 h using immunofluorescence microscopy. All cells were grown at 37 °C, 5% CO2 and 100% humidification. All cells were identified by a short tandem repeat multiplex assay (Applied Biosystems) and were negatively tested for mycoplasma.
For the data presented here, four replicates of slides with primary and secondary antibodies and four control replicates with secondary antibodies only were prepared. After a first pre-screening those slides which were used for the results presented here were selected by visual inspection and according to the quality standards required for SMLM data acquisition and evaluation. This means that due to blinking background signals outside the cells, the data sets acquired for two of the slides were excluded as being insufficient for further evaluation. Slides No. 1 and No. 5 were subjected to detailed evaluation. The quality selection also excluded two slides of the controls. Since the remaining control slides both showed the same results, they were merged to one control data set for better overview.
A so-called neutral density filter wheel allows the regulation of the lasers intensity with twelve different grey filters of different densities. After the filter wheel, the light beam is expanded and focussed onto the specimen. The emitted light of the fluorophores then is separated from the laser beam by a dichroic and a blocking filter wheel before it is focussed onto the CCD camera (Sensicam QE, PCO, Kehlheim, Germany) which measures the intensity of the incoming light. The CCD camera used, has a size of 1376 × 1040 pixels whereas each pixel covers an area of 6.45 × 6.45 μm2. In combination with the objective 100×/NA 1.4, this yields an effective pixel size of 64.5 × 64.5 nm2.
The selection and in-focus adjustment of cells for data acquisition was accomplished by visual means. Therefore, low intensities were chosen such that the shape of the cells could be seen but nearly no bleaching of the fluorophores occurred. In order to measure only receptors on the membrane of the cell and to irradiate the specimen equally within a confined area, a region of interest (ROI) was chosen for each measurement. These were chosen to be planar membrane extensions of single cells. After the ROIs were determined, a wide-field image and a localization measurement were recorded for each wavelength. To avoid bleaching of the fluorophores, data acquisition always started with the larger wavelength (568 nm before 488 nm). For the localization measurements, 1000 image frames were acquired at a frequency of approximately 7.7 frames per s and with an integration time of 100 ms.
An important advantage of the orte-matrix is its simplicity and the opportunity to sort out signals with too low intensities. This can be done by a threshold factor, which in this work was set to be 3. This means that all intensities which occur in the orte-matrix have to be at least as big as three times the background intensity. Therefore, autofluorescence of the cell and light emission of molecules different than the used fluorophores become negligible.
In addition, the localization precision of the obtained signals is determined by the brightness of a point. On average it is in the order of 10 nm (Table S1, ESI†), which is above the theoretical limits for SMLM because of the influence of the setup, the fluorophores and the preparation of the cells.
The density of signals on the membrane was calculated by manually cutting the images into defined sized (2 × 2 μm2) squares. This is necessary because in general the membrane was only making up 75% of the acquired ROIs. Therefore, the membrane part of each ROI was subdivided into 20–25 squares of the same size and the density of signals on the membrane thereby calculated.
For the investigation of clustering of receptors, the DBSCAN algorithm was used65 with a cluster definition with a minimum number of five signals within a radius of 60 nm. All the points which fulfill this condition are considered to be in a cluster. This method yielded values for cluster sizes, cluster diameter and distance distributions within and out of clusters.
The coordinate values of the points were subjected to Ripley statistics47 for which the point-to-point distances were measured by distances from each central point to all its peripheral ones. These pairwise distances were summarized in a distance frequency histogram and the envelope function was used to determine point arrangements.42 A homogeneous point-to-point distance distribution leads to a linear curve with a different slope. The diagonal means a random distribution. If the points form clusters, smaller distances are more frequent which results in a peak.
In order to determine the probability of homo- or hetero-dimerization, a custom-written script was used to determine the next, two next, and five next distances for each point. The orte-matrices therefore had to be shift corrected. In this work special significance lies on the next-neighbour distances because they would be possible dimer distances. Because the total number of points of EGFRwt of the cell line DKMGvIII+ is comparably small in contrast to the number of EGFRvIII, the hetero-distances were determined from wild type to the deletion mutant.
As a final processing step Persistent Homology (PH)57,58 analysis was used to explore the topology of the labelled proteins.42 This method aims to find out the significant structures of a point pattern. In order to obtain a pattern that represents these major features, each point is surrounded by a circle with an increasing radius. Here, two properties were considered: (a) the number of “components”, which is the number of elements that are still separated by this process and (b) the number of “holes” of the structures inside the components. These holes are “born” if the circles around the points have formed one closed component with free space inside. In algebraic topology, these properties are called zero- and one-dimensional complexes. These two dimensions can be represented by barcodes. Therefore, a geometric relationship among detected points is obtained by growing circles around each point (starting of bars for dimension 0). Whenever two of these circles mutually embed each-other's center, these centers are connected and they belong to the same component (end of a bar of dimension 0). With increasing radii, more points are connected to one component. Whenever a component forms a closed curve (in its simplest form a triangle) with free space inside, the area of this space is considered as a hole (start of a bar for dimension 1). With reducing the number of components, the number of holes is first increasing and then decreasing again when the holes are closed (end of a bar for dimension 1). The end points values of the bars for the sets of barcodes can be shown in frequency histograms giving information of the structural arrangements of the point pattern. The similarity of the bar codes can be quantified by the Jaccard index.59 The result of this normalized similarity measure is a value between 0 and 1, where 0 means no overlap and 1 the identity of two bars. The Jaccard indizes were calculated for each receptor cluster. All values for one cell were averaged and presented in a heatmap.57 Such heatmaps were computed for components and holes or for the mean values of both. The results obtained by these heatmaps were averaged for each type of receptor and the mean values can be compared by a 2nd generation heatmap.60
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4nr01570c |
‡ Equal contribution. |
§ Present addresses: School of Engineering and Applied Sciences, Harvard University, 9 Oxford St, 02138 Cambridge, MA, USA. |
¶ Present addresses: KU Leuven, Department of Biosystems, Division of Animal and Human Health Engineering, Geel Campus, Kleinhoefstraat 4, 2440 Geel, Belgium. |
This journal is © The Royal Society of Chemistry 2024 |