Open Access Article
This Open Access Article is licensed under a Creative Commons Attribution-Non Commercial 3.0 Unported Licence

DeSciDe: a tool for unbiased literature searching and gene list curation unveils a new role for the acidic patch mutation H2A E92K

Cameron J. Douglasab and Ciaran P. Seath*a
aDepartment of Chemistry, Wertheim UF Scripps, Jupiter, Florida 33418, USA. E-mail: cseath@ufl.edu; cseath@scripps.edu
bThe Skaggs Graduate School of Chemical and Biological Sciences, 120 Scripps Way, Jupiter, FL 33458, USA

Received 1st July 2025 , Accepted 22nd September 2025

First published on 10th November 2025


Abstract

Omics analysis has become an indispensable tool for researchers in the life sciences, enabling the study of DNA, RNA, and proteins and how they respond to cellular stimulus. Many methods of data analysis exist for the generation and characterization of gene lists, however, selection of genes for further investigation is still heavily influenced by prior knowledge, with practitioners often studying well characterized genes, reinforcing bias in the literature. Here, we have developed an open-source, R package for impartial ranking of gene lists derived from omics analysis that we term deciphering scientific discoveries (DeSciDe). We applied a pipeline that sorts a gene list first by precedence, which we define as co-occurrence of the gene with pre-defined search terms in publications. We then rank gene lists by connectivity, an underutilized metric for how related a gene is to other enriched genes. The combination of these rankings by scatterplot provides a method for gene selection by simple visual analysis. We apply this analysis method to published Omics datasets, identifying novel avenues for investigation. Further, using this method we have been able to assign a novel loss of function role for the histone mutation H2A E92K.


1. Introduction

The study of biochemical processes is increasingly performed using large scale omics analyses to survey changes in RNA or protein expression (RNA-seq/proteomics). These experiments generate large data sets that require robust filtering and analysis to determine broader biological implications. A suite of effective tools has been developed to this effect such as STRING, a database of known and predicted protein–protein interactions, gene ontology (GO),1,2 a knowledge base about the functions of genes, and gene set enrichment analysis (GSEA),3 a computational method that determines whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states (Fig. 1A).2–6 These open-source tools have become an essential resource and are widely used throughout the community. These tools are well suited to uncovering relationships between enriched genes but are not able to study relationships between enriched genes and the cellular stimulus under study, leading to a degree of manual curation that results in human bias towards the study of well characterised genes. This bias perpetuates the continued study of a subset of genes, leaving many more underexplored. When deploying more complex omics experiments, such as proximity proteomics, where nearby proteins to a protein of interest are identified7 or CRISPR screens, where ranked lists of genes that confer sensitivity or resistance to a biological challenge of interest are identified from cells with genetically encoded perturbations,8 the importance of data curation is even more critical as false positives are more common.
image file: d5mo00160a-f1.tif
Fig. 1 An open-source pipeline for unbiased omics analysis. (A) Omics analysis is often used to understand how stimulus can change cellular biology. (B) DeSciDe performs automated literature data mining to rank gene lists for precedence within a particular field or in relation to a particular stimulus or disease. (C) DeSciDe ranks genes according to their STRING connectivity, providing a ranked list of genes. Connectivity analysis can provide a numerical basis for choosing genes for follow up studies.

When studying gene lists from omics analyses, it can be valuable to search for interactions between enriched genes to aid users in assigning molecular mechanisms. These physical interactions are typically explored using STRING analysis, where genes are plotted as interconnected nodes. This tool is enabling when studying small lists of genes, but the graphical output becomes unwieldy and challenging to deconvolute when large numbers of interconnected genes are present, limiting the use of the tool when analysing lists with greater than 50 hits.

Based on these limitations, end users of omics methods have an urgent need for tools to aid in unbiased selection of gene hits for follow up studies. In considering this need we identified several requirements; (1) a method must incorporate the cellular stimulus or pathway that is being studied; (2) the method should be able to assign value to interactions between genes; (3) the program must be readily available and applicable to many different experimental types.

To address these requirements, we have developed an R package to enable the analysis of gene lists that are experimentally associated with a relevant search term. Our package, Deciphering Scientific Discoveries (DeSciDe), can incorporate any number of cellular stimuli and cross-reference them against lists of enriched genes. Based on the co-occurrence of the gene and the cellular stimulus, a hit is assigned as either strongly or poorly associated with a particular search term. Further, our system computes network connectivity metrics from the gene interaction networks provided by the STRING database, producing a quantitative evaluation of connectivity. The gene lists can then be sorted based on two key values: interconnectivity and precedence. Through analysis of existing datasets, we demonstrate that connectivity is a viable metric for selecting genes for follow up investigation, and when combined with the quantification of literature precedence, can provide a systematic and objective method for gene selection. We anticipate that such a tool will be valuable for myriad applications across the biological sciences.

2. Results and discussion

We began developing a tool to search gene lists against any desired term that could represent a cellular stimulus (Fig. 1B). This was achieved by cross searching every gene in the given list against each search term using PubMed abstracts as the database. The resulting references are reported in a table as the number of references per gene and search term combination. Additionally, we incorporate a graphical representation of the citations in the form of a heatmap. The heatmap shows number of references for each stimulus/search term, which can be used to visualize the top “hits” based on literature precedence for multiple search terms at once. DeSciDe then ranks the genes based on literature precedence. This can be done using two different methods: weighted and total. The default method is weighted, in which the gene list is filtered by the number of publications associated with the first input term followed by filtering for the subsequent terms in the order provided. This method allows the user to prioritize highly specific search terms (e.g., histone H3 K23 acetylation), while still incorporating broader cellular contexts (e.g., cancer) in their terms list without biasing the results toward the term with the highest numbers of publications. Alternatively, ranking by total number of publications across all search terms for a gene can be conducted when users do not deem it necessary to prioritize a specific cellular stimulus context.

Next, we sought to establish a metric for quantifying and ranking gene interconnectivity, which we suggest may be valuable and generally applicable for hit selection following omic analyses. We incorporated existing STRING interaction networks and quantified each gene according to two criteria: number of interactions (known as degree in graph theory), and connectivity (known as clustering coefficient in graph theory) (Fig. 1C). The number of interactions is straightforward, representing the number of connections made by each node. A gene's connectivity score represents the percentage of existing connections compared to the theoretical maximum number of connections in the subnetwork spanned by the respective node (the network comprised of the node and its neighbours). We then compile the gene lists and their network properties in a table that is filterable. By default, the genes are ranked first by the number of connections and then by percent connectivity. The package also produces a scatterplot showing the number of interactions versus the percent connectivity. This type of plot provides a visual representation of how interconnected the genes in the list are. Genes in the bottom left of the plot have few connections and low connectivity, whereas the top right corner contains the most connected set of genes (Fig. 1C).

The final component of the application is the combination of precedence and connectivity rankings. Here, we create a scatterplot that displays the rank order of precedence versus the rank order of connectivity. Since rank order arranges values from high to low, in this visualization, hits that have high precedence and high connectivity appear close to the origin. By default, DeSciDe classifies these genes based on a 20% threshold of total genes in the list with high-connectivity, high-precedence genes falling in the top 20th percentile of ranked genes in both lists and high-connectivity, low-precedence genes falling in the top 20th percentile of connectivity and the bottom 20th percentile in precedence. This threshold can be adjusted by the user as deemed fit for their analysis. We found this to be a broadly useful visualization for hit selection. To illustrate how DeSciDe might be used we applied our workflow to four case studies.

The plots produced by the DeSciDe analysis pipeline can be easily exported and saved for use in presentations and publications. Examples of the plots produced by default running of DeSciDe can be seen in SI (Fig. S1–S5). Additionally, the data tables of results can be saved as TSV, CSV, or Excel files for further analysis or for generating revised figures.

We began by analysing proximity proteomics data sets. We chose these as examples as they are uniquely suited to this analysis for several reasons: (1) the bait (or protein of interest bearing the proximity labelling enzyme or catalyst) is not typically included in standard analysis as it is often spatial9 or biological (i.e., a small molecule11), and (2) these experiments are typically designed to look for unknown interactions. Therefore, careful filtering of gene lists is required before choosing genes of interest for further investigation, adding an element of human bias.

First, we analysed a proximity proteomics dataset published by Geri et al. that describes the interactome of the receptor PDL1 on Jurkat cells.9 PDL1 plays an important role in the immune system as a “save me” signal, stopping immune cells from attacking healthy cells. As PDL1 is frequently expressed by cancer cells to the immune system, it has become an active oncology target.12 The purpose of this experiment was to identify novel interactors of PDL1 that may play a role in immune oncology. From the gene list provided, we filtered for all differentially enriched genes that met statistical significance (41 total) and passed them through the DeSciDe pipeline (Fig. 2A) with the keyword's cancer, immunology, and checkpoint blockade. STRING analysis of the 41 hits was informative, with 12 genes in the list having no known interactions and 24 genes having at least three known interactors within the data set (Fig. 2B). To illustrate how connectivity is calculated from STRING data, nodes surrounding FCER2 and HLA-B are shown in Fig. 2C. DeSciDe ranks these genes by connectivity (Fig. 2D), placing ICAM1, CD40, and CD274(PDL1) as the top three hits. Of these three, CD274 is the bait protein, and CD40 and ICAM1 have both been validated to colocalize with PDL1 in immune synapses and are themselves targets for immune based therapies.13–15


image file: d5mo00160a-f2.tif
Fig. 2 Pipeline for analysis of a proximity proteomics dataset. (A) Graphical representation of DeSciDe pipeline for unbiased ranking of gene lists. (B) STRING analysis of PDL1 interactomics data set, published by Geri et al.9 (C) Example of how connectivity analysis is performed. (D) PDL1 interactome ranked by connectivity (top 6 genes shown). (E) Heatmap showing results of DeSciDe data mining against the search terms “cancer” “immunology” and “checkpoint blockade”. (F) Scatterplot of genes ranked for connectivity vs. precedence with suggested alternate genes for investigation highlighted in boxes. Graphs made in Prism from exported DeSciDe data.

Next, using DeSciDe datamining, we cross referenced the gene list with the search terms described above and plotted via heatmap (Fig. 2E). These data clearly show that the genes within this data set have significant overlap with the search terms immunology and cancer, but fewer with the more specific term checkpoint blockade. Our application makes it trivial to identify related genes from the heatmap. The ranked lists were then combined and displayed as a scatterplot of precedence vs. connectivity (Fig. 2F). The genes of highest connectivity and highest precedence are located near the origin. In this data set, that includes the previously discussed ICAM1, CD40, and PDL1, in addition to CD70, HLA-A, and the death receptor FAS. Based on previous reports in the area and the precedence from the data mining, all the genes within this sector are confident hits.16–19 With this knowledge, it can be assumed that connectivity is a reasonable metric to rank genes. If this is correct, then moving to the top left quadrant, where connectivity remains high, but the genes have far fewer precedented reports relating to the three search terms, may provide novel targets for investigation. In this area, we found 7 genes (TNFRSF8, FCER2, LY75, CD300A, SCARB1, LILBR1) that are candidates to be novel interactors, with less known about their involvement in PDL1 based checkpoint blockade.

In our next case study, we examined a proximity proteomics dataset with a significantly more complex interactome. The experiment, published recently by the laboratories of MacMillan and Muir, described how a somatic mutation on histone H2A disrupts the nucleosome microenvironment in HEK293T cells (Fig. 3A).10 This type of hypothesis generating experiment is a good match for our data analysis pipeline as the proteomics data can lead to several areas of study and is easily influenced by inherent bias. In the original study the authors identified several enriched genes (SIRT6, DNMT3A/B, BRD2/3/4) for further investigation. When analysing this data set with our pipeline, we found that these genes were among the most well studied (by searching the terms chromatin, nucleosome, and acidic patch) ranking 2nd, 3rd, 4th, 11th, and 12th of all genes in our measure of precedence (Fig. 3B). Visual inspection of STRING analysis for this data set was not instructive due to the density of the interaction networks (Fig. 3C). However, connectivity analysis suggested a set of genes that were not implicated in the original study (Fig. 3D and E).


image file: d5mo00160a-f3.tif
Fig. 3 Re-analysis of proximity proteomics data studying the effect of somatic mutations on the nucleosome acidic patch. (A) Original published dataset. Genes highlighted were validated or taken for further investigation10. (B) Heatmap derived from DeSciDe analysis using the search terms “chromatin” “nucleosome” and “acidic patch”. The genes taken on for further study were in the top 5 most studied genes (in the context of the three search terms employed). (C) STRING analysis of significantly enriched genes is too complex for visual interpretation. (D) Graphical interpretation of STRING analysis via DeSciDe computed connectivity. Genes are more readily visualized as being highly connected. (E) Unbiased gene selection by DeSciDe, sorting for either precedent or connectivity.

Plotting the ranked lists against each other proved to be enlightening, with both critical quadrants i.e., nearest the origin (high confidence genes) and the top left quadrant (highly connected but less well studied in this context), both suggesting investigation of genes relating to regulation of the cell cycle (Fig. 4A and B).


image file: d5mo00160a-f4.tif
Fig. 4 Re-analysis of proximity proteomics data studying the effect of somatic mutations on the nucleosome acidic patch. (A) DeSciDe plotting of connectivity vs. precedence provides new avenues of investigation. (B) DeSciDe analysis suggests genes related to regulation of cell cycle for further analysis, a novel phenotype for acidic patch mutations. Graphs made in Prism from exported DeSciDe data. (C) Cell-cycle analysis via flow cytometry using propidium iodide (n = 3, 50[thin space (1/6-em)]000 cells counted per replicate, whiskers represent standard deviation). P-Value for 1.1% difference in G2 phase = 0.0221. (D) Cell proliferation data over 72 h comparing HEK293T expressing H2A or H2AE92K plated at 10[thin space (1/6-em)]000 cells/well in a 96-well plate at time = 0 h (n = 30, whiskers represent standard deviation, data graphed represents mean fluorescent intensity at indicated timepoint). (E) Mechanistic model for cell-cycle stalling and reduced proliferation for E92K mutation. *P < 0.05, ***P < 0.001, ****P < 0.0001.

Recently, McGinty and co-workers demonstrated the role of the acidic patch in coordinating VRK1 phosphorylation of H3T3 during cell division.20 Furthermore, pathogenic mutations on VRK1 were shown to disrupt this interaction, providing a molecular basis for how these rare mutants may cause rare adult-onset distal spinal muscular atrophy. Our reanalysed data also suggest that the acidic patch may play a role in cellular division, and that the E92K mutation may lead to deregulation of this critical cellular pathway.

We further investigated this by performing cell-cycle analysis using propidium iodide in HEK293T cells stably expressing H2A or H2A E92K. We observed the mutant cell line contains a higher concentration of cells in the G2 phase, suggesting that the small proportion of mutated nucleosomes confer a subtle change in the cell cycle (Fig. 4C). Based on this stalling, we anticipated that the mutant cell line would show reduced proliferation compared to the wild-type cell line. Cell proliferation assays of both cell lines over 72 hours showed the E92K mutation significantly decreased proliferation, in line with our hypothesis (Fig. 4D). Based on these data, we can assign a new role for the E92K acidic patch mutation. The previously reported chemoproteomics data shows that the mutation disrupts interactions between the nucleosome acidic patch and cell cycle proteins AURKB, CDC6, and CyclinB1, which all play a significant role in the G2-M transition during cell division.23–25 This disruption leads to stalling in the G2 phase of cell division and subsequent reduction in proliferation (Fig. 4E). The data suggests these effects are subtle, likely due to the relatively low incorporation (approx. 1 in 16 nucleosomes), and that it is likely that only one face of each mutant nucleosome bears the mutation. However, this effect is in line with the chemoproteomics and ATAC-seq data previously reported.10 This example indicates that unbiased analysis using a program such as DeSciDe can provide different avenues of investigation that may be overlooked in favour of genes that are highly represented in the literature.

Furthermore, we demonstrate that DeSciDe is widely applicable to omics analysis by reanalysing publicly available datasets from RNA-seq, global proteomics, CRISPR screens, and ATAC-seq experiments. A recent RNA-seq experiment published by Ma et al., was deployed to study differential splicing in the context of TDP-43 deficient FTD-ALS.21 The authors chose the mRNA UNC13A for follow up studies, demonstrating its role in ALS pathology (Fig. 5A). Plotting connectivity vs. precedence using the search terms RNA-splicing, ALS, and TDP-43 suggested UNC13A as the highest confidence hit (most connected and most precedented), illustrating that this bioinformatics methodology can recapitulate complex data analyses without prior knowledge of the field (Fig. 5B). Further, based on the scatterplot, we can suggest alternative genes for investigation that either cluster around UNC13A or have less precedented associations with the search terms, which may not be obvious candidates for follow-up studies (Fig. 5C).


image file: d5mo00160a-f5.tif
Fig. 5 Analysis of RNA-seq data using DeSciDe. (A) Brief outline of RNA-seq experiment performed in the study by Ma et al.21 (B) DeSciDe analysis suggests UNC13A as the highest confidence hit, recapitulating the authors analysis in an unbiased manner. (C) DeSciDe analysis can suggest alternate genes for further analysis based on novelty or confidence. (D) Aim of proteomics and transcriptomics experiments re-examined using DeSciDe. (E) Both proteomics and transcriptomics analyses via DeSciDe arrive at the same conclusion as the authors. Many high confidence genes remain unexplored within both studies.22 Graphs made in Prism from exported DeSciDe data.

Finally, we analysed a 2020 study on the proteomic and transcriptomic host response to COVID-19.22 One key aim from these experiments was to identify factors that contribute to fatality following infection (Fig. 5D). From 4065 and 637 differentially expressed genes across RNA-seq and proteomic datasets, respectively, the authors highlighted expression of cathepsins as a marker for poor prognosis. DeSciDe analysis also points towards CTSB and CTSL as high confidence hits in the quadrant closest to the origin in both RNA-seq and proteomics datasets (Fig. 5E). Once again, these data suggest that analysis through the DeSciDe pipeline and using connectivity and literature datamining to rank hits is a viable and useful bioinformatics method for unbiased analysis of gene lists.

While this method of data analysis appears to be powerful for ranking genes in an unbiased way, some deficiencies remain. Connectivity is based upon and intrinsically related to precedence, where more connections have been reported for more “popular” genes, so completely uncharacterized genes will still be ignored using this analysis. Further, the search function within this application cannot filter for the most relevant journal articles, only those that contain the gene name and the manually selected search terms, so some articles may not actually show a meaningful connection. The selection of the search terms used can produce different results, so there is a burden on the user to be thoughtful of what terms they wish to employ when running DeSciDe. Additionally, this platform operates best with gene lists of >20 and <500, as small datasets result in limited hits in the quadrants of interest, and large datasets may still produce hundreds of genes in the regions of interest, decreasing the utility of the tool to narrow down hits. Therefore, curation of the omics dataset to include a list of the top statistically significant hits is important to gain meaningful insight from DeSciDe. As with any biological investigation, interpretation of data falls to the researcher. DeSciDe can help guide a researcher towards genes of interest, but ultimately they will need to investigate and validate the hits experimentally. Finally, these analyses do not include enrichment fold change, which is often a metric used for gene selection. We are currently working on methods to improve the analysis pipeline to solve some of these issues.

3. Conclusions

In summary, we have developed an open-source R package for unbiased analysis of gene sets from omics experiments. The application can rank genes by how connected they are to the rest of the dataset and by how well they are associated in the literature with predefined search terms. The combination of these two rankings provides a valuable scatterplot that can be used to identify high confidence genes for further investigation. Using this method, we reanalyse a proximity proteomics data set and identify new biological implications of H2A E92K mutations. We believe this application will find broad usage within the life sciences and will aid researchers in identifying new avenues for biological investigation. The code of the application is freely available under the MIT License at https://github.com/camdouglas/DeSciDe.

Conflicts of interest

There are no conflicts to declare.

Data availability

Supplementary information (SI): a vignette for detailed instruction on how to use the DeSciDe package and tables of the genes selected from select publications and their DeSciDe results (XLXS). DeSciDe is available for download on CRAN at https://CRAN.R-project.org/package=DeSciDe. See DOI: https://doi.org/10.1039/d5mo00160a.

Code can be accessed at https://github.com/camdouglas/DeSciDe.

Acknowledgements

CPS would like to thank Wertheim UF Scripps for startup funding. This work was supported by NIH (1R35GM150765-01 to CPS). Thank you to the MacMillan Lab at Princeton University for providing the cell lines used in this study.

Notes and references

  1. Gene Ontology Consortium, S. A. Aleksander, J. Balhoff, S. Carbon, J. M. Cherry, H. J. Drabkin, D. Ebert, M. Feuermann, P. Gaudet, N. L. Harris, D. P. Hill, R. Lee, H. Mi, S. Moxon, C. J. Mungall, A. Muruganugan, T. Mushayahama, P. W. Sternberg, P. D. Thomas, K. Van Auken, J. Ramsey, D. A. Siegele, R. L. Chisholm, P. Fey, M. C. Aspromonte, M. V. Nugnes, F. Quaglia, S. Tosatto, M. Giglio, S. Nadendla, G. Antonazzo, H. Attrill, G. Dos Santos, S. Marygold, V. Strelets, C. J. Tabone, J. Thurmond, P. Zhou, S. H. Ahmed, P. Asanitthong, D. Luna Buitrago, M. N. Erdol, M. C. Gage, M. Ali Kadhum, K. Y. C. Li, M. Long, A. Michalak, A. Pesala, A. Pritazahra, S. C. C. Saverimuttu, R. Su, K. E. Thurlow, R. C. Lovering, C. Logie, S. Oliferenko, J. Blake, K. Christie, L. Corbani, M. E. Dolan, H. J. Drabkin, D. P. Hill, L. Ni, D. Sitnikov, C. Smith, A. Cuzick, J. Seager, L. Cooper, J. Elser, P. Jaiswal, P. Gupta, P. Jaiswal, S. Naithani, M. Lera-Ramirez, K. Rutherford, V. Wood, J. L. De Pons, M. R. Dwinell, G. T. Hayman, M. L. Kaldunski, A. E. Kwitek, S. J. F. Laulederkind, M. A. Tutaj, M. Vedi, S.-J. Wang, P. D’Eustachio, L. Aimo, K. Axelsen, A. Bridge, N. Hyka-Nouspikel, A. Morgat, S. A. Aleksander, J. M. Cherry, S. R. Engel, K. Karra, S. R. Miyasato, R. S. Nash, M. S. Skrzypek, S. Weng and M. Westerfield, Genetics, 2023, 224, iyad031 CrossRef CAS PubMed.
  2. M. Ashburner, C. A. Ball, J. A. Blake, D. Botstein, H. Butler, J. M. Cherry, A. P. Davis, K. Dolinski, S. S. Dwight, J. T. Eppig, M. A. Harris, D. P. Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J. C. Matese, J. E. Richardson, M. Ringwald, G. M. Rubin and G. Sherlock, Nat. Genet., 2000, 25, 25–29 CrossRef CAS PubMed.
  3. A. Subramanian, P. Tamayo, V. K. Mootha, S. Mukherjee, B. L. Ebert, M. A. Gillette, A. Paulovich, S. L. Pomeroy, T. R. Golub, E. S. Lander and J. P. Mesirov, Proc. Natl. Acad. Sci. U. S. A., 2005, 102, 15545–15550 CrossRef CAS PubMed.
  4. B. Snel, G. Lehmann, P. Bork and M. A. Huynen, Nucleic Acids Res., 2000, 28, 3442–3444 CrossRef CAS PubMed.
  5. D. Szklarczyk, A. L. Gable, K. C. Nastou, D. Lyon, R. Kirsch, S. Pyysalo, N. T. Doncheva, M. Legeay, T. Fang, P. Bork, L. J. Jensen and C. von Mering, Nucleic Acids Res., 2021, 49, D605–D612 CrossRef CAS PubMed.
  6. V. K. Mootha, C. M. Lindgren, K.-F. Eriksson, A. Subramanian, S. Sihag, J. Lehar, P. Puigserver, E. Carlsson, M. Ridderstråle, E. Laurila, N. Houstis, M. J. Daly, N. Patterson, J. P. Mesirov, T. R. Golub, P. Tamayo, B. Spiegelman, E. S. Lander, J. N. Hirschhorn, D. Altshuler and L. C. Groop, Nat. Genet., 2003, 34, 267–273 CrossRef CAS PubMed.
  7. P. Samavarchi-Tehrani, R. Samson and A.-C. Gingras, Mol. Cell. Proteomics, 2020, 19, 757–773 CrossRef CAS PubMed.
  8. C. Bock, P. Datlinger, F. Chardon, M. A. Coelho, M. B. Dong, K. A. Lawson, T. Lu, L. Maroc, T. M. Norman, B. Song, G. Stanley, S. Chen, M. Garnett, W. Li, J. Moffat, L. S. Qi, R. S. Shapiro, J. Shendure, J. S. Weissman and X. Zhuang, Nat. Rev. Methods Primers, 2022, 2, 8 CrossRef CAS.
  9. J. B. Geri, J. V. Oakley, T. Reyes-Robles, T. Wang, S. J. McCarver, C. H. White, F. P. Rodriguez-Rivera, D. L. Parker, E. C. Hett, O. O. Fadeyi, R. C. Oslund and D. W. C. MacMillan, Science, 2020, 367, 1091–1097 CrossRef CAS PubMed.
  10. C. P. Seath, A. J. Burton, X. Sun, G. Lee, R. E. Kleiner, D. W. C. MacMillan and T. W. Muir, Nature, 2023, 616, 574–580 CrossRef CAS PubMed.
  11. A. D. Trowbridge, C. P. Seath, F. P. Rodriguez-Rivera, B. X. Li, B. E. Dul, A. G. Schwaid, B. F. Buksh, J. B. Geri, J. V. Oakley, O. O. Fadeyi, R. C. Oslund, K. A. Ryu, C. White, T. Reyes-Robles, P. Tawa, D. L. Parker and D. W. C. MacMillan, Proc. Natl. Acad. Sci. U. S. A., 2022, 119, e2208077119 CrossRef CAS PubMed.
  12. H. O. Alsaab, S. Sau, R. Alzhrani, K. Tatiparti, K. Bhise, S. K. Kashaw and A. K. Iyer, Front. Pharmacol., 2017, 8, 561 CrossRef PubMed.
  13. T. Tang, X. Cheng, B. Truong, L. Sun, X. Yang and H. Wang, Pharmacol. Ther., 2021, 219, 107709 CrossRef CAS PubMed.
  14. M. Jung, Y. Yang, J. E. McCloskey, M. Zaman, Y. Vedvyas, X. Zhang, D. Stefanova, K. D. Gray, I. M. Min, R. Zarnegar, Y. Y. Choi, J.-H. Cheong, S. H. Noh, S. Y. Rha, H. C. Chung and M. M. Jin, Mol. Ther. Oncolytics, 2020, 18, 587–601 CrossRef CAS PubMed.
  15. D. Djureinovic, M. Wang and H. M. Kluger, Cancers, 2021, 13, 1302 CrossRef CAS PubMed.
  16. Y. Gu, Q. Xue, Y. Chen, G.-H. Yu, M. Qing, Y. Shen, M. Wang, Q. Shi and X.-G. Zhang, Hum. Immunol., 2013, 74, 267–276 CrossRef CAS PubMed.
  17. D. White, A. Cote-Martin, M. Bleck, N. Garaffa, A. Shaaban, H. Wu, D. Liu, D. Young, J. Scheer, I. C. Lorenz, A. Nixon, J. S. Fine, F. R. Byrne, M. L. Mbow and M. E. Moreno-Garcia, Mol. Immunol., 2023, 156, 31–38 CrossRef CAS PubMed.
  18. F. Perea, A. Sánchez-Palencia, M. Gómez-Morales, M. Bernal, Á. Concha, M. M. García, A. R. González-Ramírez, M. Kerick, J. Martin, F. Garrido, F. Ruiz-Cabello and N. Aptsiauri, Oncotarget, 2018, 9, 4120–4133 CrossRef PubMed.
  19. Y. Li, S. Yang, H. Yue, D. Yuan, L. Li, J. Zhao and L. Zhao, Pathol. Oncol. Res., 2020, 26, 1451–1458 CrossRef CAS PubMed.
  20. G. R. Budziszewski, Y. Zhao, C. J. Spangler, K. M. Kedziora, M. R. Williams, D. N. Azzam, A. Skrajna, Y. Koyama, A. P. Cesmat, H. C. Simmons, E. C. Arteaga, J. D. Strauss, D. Kireev and R. K. McGinty, Nucleic Acids Res., 2022, 50, 4355–4371 CrossRef CAS PubMed.
  21. X. R. Ma, M. Prudencio, Y. Koike, S. C. Vatsavayai, G. Kim, F. Harbinski, A. Briner, C. M. Rodriguez, C. Guo, T. Akiyama, H. B. Schmidt, B. B. Cummings, D. W. Wyatt, K. Kurylo, G. Miller, S. Mekhoubad, N. Sallee, G. Mekonnen, L. Ganser, J. D. Rubien, K. Jansen-West, C. N. Cook, S. Pickles, B. Oskarsson, N. R. Graff-Radford, B. F. Boeve, D. S. Knopman, R. C. Petersen, D. W. Dickson, J. Shorter, S. Myong, E. M. Green, W. W. Seeley, L. Petrucelli and A. D. Gitler, Nature, 2022, 603, 124–130 CrossRef CAS PubMed.
  22. M. Wu, Y. Chen, H. Xia, C. Wang, C. Y. Tan, X. Cai, Y. Liu, F. Ji, P. Xiong, R. Liu, Y. Guan, Y. Duan, D. Kuang, S. Xu, H. Cai, Q. Xia, D. Yang, M.-W. Wang, I. M. Chiu, C. Cheng, P. P. Ahern, L. Liu, G. Wang, N. K. Surana, T. Xia and D. L. Kasper, Proc. Natl. Acad. Sci. U. S. A., 2020, 117, 28336–28343 CrossRef CAS PubMed.
  23. E. Müllers, H. Silva Cascales, H. Jaiswal, A. T. Saurin and A. Lindqvist, Cell Cycle, 2014, 13, 2733–2743 CrossRef PubMed.
  24. J. R. Bischoff, L. Anderson, Y. Zhu, K. Mossie, L. Ng, B. Souza, B. Schryver, P. Flanagan, F. Clairvoyant, C. Ginther, C. S. Chan, M. Novotny, D. J. Slamon and G. D. Plowman, EMBO J., 1998, 17, 3052–3065 CrossRef CAS PubMed.
  25. E. Lau, C. Zhu, R. T. Abraham and W. Jiang, EMBO Rep., 2006, 7, 425–430 CrossRef CAS PubMed.

This journal is © The Royal Society of Chemistry 2025
Click here to see how this site uses Cookies. View our privacy policy here.