Analyzing data with APL

Elzbieta Gralinska1*, Clemens Kohl1** and Martin Vingron1***

1Max Planck Institute for Molecular Genetics, Berlin, Germany

*gralinska@molgen.mpg.de
**kohl@molgen.mpg.de
***vingron@molgen.mpg.de

28 April 2026

Abstract

This package performs correspondence analysis (CA) and allows to identify cluster-specific genes using Association Plots (AP). Additionally, APL computes the cluster-specificity scores for all genes which allows to rank the genes by their specificity for a selected cell cluster of interest.

Package

APL 1.16.0

1 Introduction
2 Installation
- 2.1 Changes regarding python dependencies
3 Preprocessing
4 Quick start
5 Step-by-step way of computing Association Plots
6 APL and GO enrichment analysis
Session info

1 Introduction

“APL” is a package developed for computation of Association Plots, a method for visualization and analysis of single cell transcriptomics data. The main focus of “APL” is the identification of genes characteristic for individual clusters of cells from input data.

When working with APL package please cite:

Gralinska, E., Kohl, C., Fadakar, B. S., & Vingron, M. (2022).
Visualizing Cluster-specific Genes from Single-cell Transcriptomics Data Using Association Plots.
Journal of Molecular Biology, 434(11), 167525.

A citation can also be obtained in R by running citation("APL"). For a mathematical description of the method, please refer to the manuscript.

2 Installation

To install the APL from Bioconductor, run:

if (!requireNamespace("BiocManager", quietly = TRUE)) {
  install.packages("BiocManager")
}

BiocManager::install("APL")

Alternatively the package can also be installed from GitHub:

library(devtools)
install_github("VingronLab/APL")

To additionally build the package vignette, run instead

install_github("VingronLab/APL", build_vignettes = TRUE, dependencies = TRUE)

Building the vignette will however take considerable time.

2.1 Changes regarding python dependencies

Previous versions of APL used pytorch SVD to speed up the computation of the full SVD. This has been deprecated in favor of fast truncated SVD implementations starting with Version 1.10.1. Calling runAPL or cacomp with python = TRUE will not lead to an error, but only issue a warning. If you still want to perform a full SVD, set the dimensions to rank of the matrix. Until a faster replacement is identified, this computation will be performed by the rather slow base R svd and should therefore not be done on very large matrices. The default number of dimensions now defaults to half of the rank of the matrix.

3 Preprocessing

3.1 Setup

In this vignette we will use a small data set published by Darmanis et al. (2015) consisting of 466 human adult cortical single cells sequenced on the Fluidigm platform as an example. To obtain the data necessary to follow the vignette we use the Bioconductor package scRNAseq.

Besides the package APL we will use Bioconductor packages to preprocess the data. Namely we will use SingleCellExperiment, scater and scran. However, the preprocessing could equally be performed with the single-cell RNA-seq analysis suite Seurat.

The preprocessing steps are performed according to the recommendations published in Orchestrating Single-Cell Analysis with Bioconductor by Amezquita et al. (2022). For more information about the rational behind them please refer to the book.

library(APL)
library(scRNAseq)
library(SingleCellExperiment)
library(scran)
library(scater)
set.seed(1234)

3.2 Loading the data

We start with the loading and preprocessing of the Darmanis data.

darmanis <- DarmanisBrainData()
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
darmanis
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
#> class: SingleCellExperiment 
#> dim: 22085 466 
#> metadata(0):
#> assays(1): counts
#> rownames(22085): 1/2-SBSRNA4 A1BG ... ZZZ3 tAKR
#> rowData names(0):
#> colnames(466): GSM1657871 GSM1657872 ... GSM1658365 GSM1658366
#> colData names(6): metrics age ... experiment_sample_name tissue
#> reducedDimNames(0):
#> mainExpName: NULL
#> altExpNames(0):

3.3 Normalization, PCA & Clustering

Association Plots from APL should be computed based on the normalized expression data. Therefore, we first normalize the counts from the Darmanis data and calculate both PCA and UMAP for visualizations later.

For now, APL requires the data to be clustered beforehand. The darmanis data comes already annotated, so we will use the cell types stored in the cell.type metadata column instead of performing a clustering.

set.seed(100)
clust <- quickCluster(darmanis)
darmanis <- computeSumFactors(darmanis, cluster = clust, min.mean = 0.1)
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
darmanis <- logNormCounts(darmanis)
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
#> Warning in .local(x, ...): 'normalizeCounts' is deprecated.
#> Use 'scrapper::normalizeCounts' instead.
#> See help("Deprecated")

dec <- modelGeneVar(darmanis)
#> Warning in fitTrendVar(fm, fv, ...): 'fitTrendVar' is deprecated.
#> Use 'scrapper::fitVarianceTrend' instead.
#> See help("Deprecated")
#> Warning in combineBlocks(collected, method = method, equiweight = equiweight, : 'combineBlocks' is deprecated.
#> See help("Deprecated")
top_darmanis <- getTopHVGs(dec, n = 5000)
#> Warning in getTopHVGs(dec, n = 5000): 'getTopHVGs' is deprecated.
#> Use 'scrapper::chooseHighlyVariableGenes' instead.
#> See help("Deprecated")
darmanis <- fixedPCA(darmanis, subset.row = top_darmanis)
#> Warning in fixedPCA(darmanis, subset.row = top_darmanis): 'fixedPCA' is deprecated.
#> Use 'scrapper::runPca.se' instead.
#> See help("Deprecated")
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
darmanis <- runUMAP(darmanis, dimred = "PCA")
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'

plotReducedDim(darmanis, dimred = "UMAP", colour_by = "cell.type")
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'

4 Quick start

The fastest way to compute the Association Plot for a selected cluster of cells from the input data is by using a wrapper function runAPL(). runAPL() automates most of the analysis steps for ease of use.

For example, to generate an Association Plot for the oligodendrocytes we can use the following command:

runAPL(
  darmanis,
  assay = "logcounts",
  top = 5000,
  group = which(darmanis$cell.type == "oligodendrocytes"),
  type = "ggplot"
)
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'
#> Warning in rm_zeros(obj): Matrix contains rows with only 0s. These rows were
#> removed. If undesired set rm_zeros = FALSE.
#> No dimensions specified. Setting dimensions to: 93
#> Warning in rm_zeros(mat): Matrix contains rows with only 0s. These rows were
#> removed. If undesired set rm_zeros = FALSE.
#> 
#>  Using 13 dimensions. Subsetting.

#> 
  |                                                                            
  |                                                                      |   0%
  |                                                                            
  |======================================================================| 100%
#> 
  |                                                                            
  |                                                                      |   0%
  |                                                                            
  |=======================                                               |  33%
  |                                                                            
  |===============================================                       |  67%
  |                                                                            
  |======================================================================| 100%

The generated Association Plot is computed based on the log-normalized count matrix. By default runAPL uses the top 5,000 most variable genes in the data, but the data can be subset to any number of genes by changing the value for the argument top. The dimensionality of the CA is determined automatically by the elbow rule described below (see here). This default behavior can be overriden by setting the dimensions manually (parameter dims). The cluster-specificity score (\(S_\alpha\)) for each gene is also calculated (score = TRUE). In order to better explore the data, type can be set to "plotly" to obtain an interactive plot. runAPL has many arguments to further customize the output and fine tune the calculations. Please refer to the documentation (?runAPL) for more information. The following sections in this vignette will discuss the choice of dimensionality and the \(S_\alpha\)-score.

5 Step-by-step way of computing Association Plots

Alternatively, Association Plots can be computed step-by-step. This allows to adjust the Association Plots to user’s needs. Below we explain each step of the process of generating Association Plots.

5.1 Correspondence Analysis

The first step of Association Plot computations is correspondence analysis (CA). CA is a data dimensionality reduction method similar to PCA, however it allows for a simultaneous embedding of both cells and genes from the input data in the same space. In this example we perform CA on the log-normalized count matrix of the darmanis brain data.

# Computing CA on logcounts
logcounts <- logcounts(darmanis)
ca <- cacomp(
  obj = logcounts,
  top = 5000
)
#> Warning in rm_zeros(obj): Matrix contains rows with only 0s. These rows were
#> removed. If undesired set rm_zeros = FALSE.
#> No dimensions specified. Setting dimensions to: 93

# The above is equivalent to:
# ca <- cacomp(obj = darmanis,
#              assay = "logcounts",
#              top = 5000)

The function cacomp accepts as an input any matrix with non-negative entries, be it a single-cell RNA-seq, bulk RNA-seq or other data. For ease of use, cacomp accepts also SingleCellExperiment and Seurat objects, however for these we additionally have to specify via the assay and/or slot (for Seurat) parameter from where to extract the data. Importantly, in order to ensure the interpretability of the results cacomp (and related functions such as runAPL) requires that the input matrix contains both row and column names.

When performing a feature selection before CA, we can set the argument top to the desired number of genes with the highest variance across cells from the input data to retain for further analysis. By default, only the top 5,000 most variable genes are kept as a good compromise between computational time and keeping the most relevant genes. If we want to ensure however that even marker genes of smaller clusters are kept, we can increase the number of genes.

The output of cacomp is an object of class cacomp:

ca
#> cacomp object with 466 columns, 5000 rows and 93 dimensions.
#> Calc. standard coord.:  std_coords_rows, std_coords_cols
#> Calc. principal coord.: prin_coords_rows, prin_coords_cols
#> Calc. APL coord.:       
#> Explained inertia:      3.1% Dim1, 2.2% Dim2

As can be seen in the summarized output, by default both types of coordinates in the CA space (principal and standardized) are calculated. Once the coordinates for the Association Plot are calculated, they will also be shown in the output of cacomp. Slots are accessed through an accessor function:

cacomp_slot(ca, "std_coords_cols")[1:5, 1:5]
#>                  Dim1      Dim2       Dim3        Dim4       Dim5
#> GSM1657871  0.1067086 0.1253297 -0.5046485 -0.32461780 -2.4262520
#> GSM1657872 -1.5101357 0.4299418  0.1219273  0.07942823 -0.2362747
#> GSM1657873  0.2237619 0.2610148 -0.1955599  0.16963578 -2.1477040
#> GSM1657874  0.5680872 0.1138251 -0.4725071  0.04409559  0.6708562
#> GSM1657875  0.4739344 0.2505648 -0.4384626  0.10316896 -2.8910899

In the case of SingleCellExperiment and Seurat objects, we can alternatively set return_input = TRUE to get the input object back, with the CA results computed by “APL” and stored in the appropriate slot for dimension reduction. This also allows for using the plotting functions that come with these packages:

darmanis <- cacomp(
  obj = darmanis,
  assay = "logcounts",
  top = 5000,
  return_input = TRUE
)
#> Warning in rm_zeros(obj): Matrix contains rows with only 0s. These rows were
#> removed. If undesired set rm_zeros = FALSE.
#> No dimensions specified. Setting dimensions to: 93
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'

plotReducedDim(darmanis,
  dimred = "CA",
  ncomponents = c(1, 2),
  colour_by = "cell.type"
)
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'

plotReducedDim(darmanis,
  dimred = "CA",
  ncomponents = c(3, 4),
  colour_by = "cell.type"
)
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'

However, some functions such as apl_coords() require information that cannot be stored in the single-cell container objects. It is therefore often easier to work with a cacomp object instead. We can convert Seurat or SingleCellExperiment objects which have CA results stored to a cacomp object using the function as.cacomp():

# Converting the object darmanis to cacomp
ca <- as.cacomp(darmanis)
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'

5.2 Reducing the number of CA dimensions

When working with high-dimensional data, after singular value decomposition there will often be many dimensions which are representing the noise in the data. In order to minimize the noise, it is generally recommended to reduce the dimensionality of the data before generating Association Plots.

The number of dimensions to retain can be computed using the function pick_dims. This function offers three standard methods which we implemented:

elbow rule (method = "elbow_rule") - the number of dimensions to retain is calculated based on scree plots generated for randomized data, and corresponds to a point in the plot where the band of randomized singular values enters the band of the original singular values,
80% rule (method = "maj_inertia") - only those first dimensions are retained which in total account for >= 80% of total inertia,
average rule (method = "avg_inertia") - only those dimensions are retained which account for more inertia than a single dimension on average.

Additionally, the user can compute a scree plot to choose the number of dimensions by themselves:

pick_dims(ca, method = "scree_plot") +
  xlim(c(0, 20))
#> Warning: Using `size` aesthetic for lines was deprecated in ggplot2 3.4.0.
#> ℹ Please use `linewidth` instead.
#> ℹ The deprecated feature was likely used in the APL package.
#>   Please report the issue to the authors.
#> This warning is displayed once per session.
#> Call `lifecycle::last_lifecycle_warnings()` to see where this warning was
#> generated.
#> Warning: Removed 74 rows containing missing values or values outside the scale range
#> (`geom_col()`).
#> Warning: Removed 73 rows containing missing values or values outside the scale range
#> (`geom_line()`).

In the scree plot above we can see that the first dimension explains only ~1% of the total inertia and we observe the “jump” in the scree plot at roughly 5 dimensions. The first dimensions however explain only a small amount of the total inertia.

Here we compute the number of dimensions using the elbow rule. For demonstration, only three data permutations are computed:

pd <- pick_dims(
  ca,
  mat = logcounts(darmanis),
  method = "elbow_rule",
  reps = 3
)
#> Warning in rm_zeros(mat): Matrix contains rows with only 0s. These rows were
#> removed. If undesired set rm_zeros = FALSE.

pd
#> [1] 13

In this case the elbow rule leads to a higher number of dimensions.

# Compute the amount of inertia explained by each of the dimensions
D <- cacomp_slot(ca, "D")
expl_inertia <- (D^2 / sum(D^2)) * 100

# Compute the amount of intertia explained
# by the number of dimensions defined by elbow rule
sum(expl_inertia[seq_len(pd)])
#> [1] 21.14017

In this example the elbow rule suggests to keep 13 dimensions that explain 21.14% of the total inertia from the data.

Finally, we can reduce the dimensionality of the data to the desired number of dimensions:

ca <- subset_dims(ca, dims = pd)

5.3 Association Plots

When working with single-cell transcriptomics data we are often interested in which genes are associated to a cluster of cells. To reveal such genes we can compute an Association Plot for a selected cluster of cells. In the following example we want to generate an Association Plot for the cluster of endothelial cells:

# Specifying a cell cluster of interest
endo <- which(darmanis$cell.type == "endothelial")

# Calculate Association Plot coordinates for endothelial cells
ca <- apl_coords(ca, group = endo)

After computing the coordinates of genes and cells in the Association Plot we are able to plot the results using the apl function.

# endothelial marker genes
marker_genes <- c("APOLD1", "TM4SF1", "SULT1B1", "ESM1", "SELE")

# Plot APL
apl(ca,
  row_labs = TRUE,
  rows_idx = marker_genes,
  type = "ggplot"
) # type = "plotly" for an interactive plot

In the Association Plot all genes are represented by blue circles. The further to the right a gene is located the more associated it is with the chosen cluster of cells and the lower the y-axis value, the more specific it is for the selected cluster. Additionally, it is possible to highlight in the Association Plot any set of genes. In the example above we highlighted five genes (APOLD1, TM4SF1, SULT1B1, ESM1, SELE) which are known to be marker genes for endothelial cells. As we can see in the plot, they are located in the right part of the plot, which confirms their specificity for endothelial cells.

By default we plot only the genes in the Association Plot. To also display the cells in the Association Plot, use the argument show_cols = TRUE. This way we can identify other cells which show similar expression profiles to the cells of interest. Cells that belong to the cluster of interest will be colored in red, and all remaining cells will be colored in violet. Furthermore, an interactive plot in which you can hover over genes to see their name can be created by setting type = "plotly".

5.4 Association Plots with the \(S_\alpha\)-scores

The \(S_\alpha\)-score allows us to rank genes by their specificity for a selected cell cluster, and is computed for each gene from the Association Plot separately. The higher the \(S_\alpha\)-score of a gene, the more characteristic its expression for the investigated cell cluster. The \(S_\alpha\)-scores can be computed using the apl_score function. To display the \(S_\alpha\)-scores in the Association Plot, we can use the argument show_score = TRUE in the apl function:

# Compute S-alpha score
# For the calculation the input matrix is also required.
ca <- apl_score(ca,
  mat = logcounts(darmanis),
  reps = 5
)

apl(ca,
  show_score = TRUE,
  type = "ggplot"
)

By default, only genes that have a \(S_\alpha\)-score larger than 0 are colored as these tend to be genes of interest and we consider them as cluster-specific genes. This cutoff can be easily changed through the score_cutoff argument to apl().

The \(S_\alpha\)-scores are stored in the "APL_score" slot and can be accessed as follows:

head(cacomp_slot(ca, "APL_score"))
#>             Rowname    Score Row_num Rank
#> IFITM1       IFITM1 2.750320    2081    1
#> LOC646268 LOC646268 2.743558      81    2
#> OR2A20P     OR2A20P 2.743558    4888    3
#> OR4C46       OR4C46 2.743558    4889    4
#> PRKCDBP     PRKCDBP 2.722349    2390    5
#> TWIST1       TWIST1 2.637943    1688    6

To see the expression of genes with the highest \(S_\alpha\)-scores (or any selected genes) across all cell types from the data we can use plotting functions provided by scater:

scores <- cacomp_slot(ca, "APL_score")

plotExpression(darmanis,
  features = head(scores$Rowname, 3),
  x = "cell.type",
  colour_by = "cell.type"
)


plotReducedDim(darmanis,
  dimred = "UMAP",
  colour_by = scores$Rowname[1]
)
#> Found more than one class "package_version" in cache; using the first, from namespace 'SeuratObject'
#> Also defined by 'alabaster.base'

As expected, the 3 most highly scored genes are over-expressed in the endothelial cells. Due to the small size of the data set and number of cells in the cluster (only 20 out of 466 cells are endothelial cells) some cluster specific genes are only expressed in a few cells. Most data sets nowadays are significantly larger so this should not be a major concern and it can further be mitigated by performing a more stringent feature selection before CA.

5.5 Visualization of CA

In addition to Association Plots “APL” produces also other forms of the output. For instance, we can use “APL” to generate a two- and three-dimensional correspondence analysis projection of the data. The so-called biplot visualizes both cells and genes from the input data and can be created using the function ca_biplot. Alternatively, a three-dimensional data projection plot can be generated using the function ca_3Dplot. To generate such biplots a cacomp object is required.

# Specifying a cell cluster of interest
endo <- which(darmanis$cell.type == "endothelial")

# Creating a static plot
ca_biplot(ca, col_labels = endo, type = "ggplot")
#> Warning: `aes_()` was deprecated in ggplot2 3.0.0.
#> ℹ Please use tidy evaluation idioms with `aes()`
#> ℹ The deprecated feature was likely used in the APL package.
#>   Please report the issue to the authors.
#> This warning is displayed once per session.
#> Call `lifecycle::last_lifecycle_warnings()` to see where this warning was
#> generated.
#> Warning in ggplot2::geom_point(data = rows, ggplot2::aes_(x = as.name(rnmx), :
#> Ignoring unknown aesthetics: text
#> Warning in ggplot2::geom_point(data = cols, ggplot2::aes_(x = as.name(cnmx), :
#> Ignoring unknown aesthetics: text
#> Warning in ggplot2::geom_point(data = cols[col_labels, ], ggplot2::aes_(x =
#> as.name(cnmx), : Ignoring unknown aesthetics: text


# Creating an interactive plot
# ca_biplot(ca, type = "plotly", col_labels = platelets)

# 3D plot
# ca_3Dplot(ca, col_labels = platelets)

The above described plots give us a quick overview of the first 2 dimensions of the data (more dimensions can be plotted). As shown in the commented-out code, to interactively explore the projection of the data type = "plotly" can be set.

6 APL and GO enrichment analysis

After computing an Association Plot and identifying a set of genes specific for a selected cluster of cells we might be interested in conducting a Gene Ontology (GO) enrichment analysis of the identified gene set. To conduct a GO enrichment analysis of microglia specific genes as idenitfied using an Association Plot, we first need to compute the coordinates of the genes in the Association Plot for microglia cells, as well as the \(S_\alpha\)-score for each gene:

# Get indices of microglia cells
microglia <- which(darmanis$cell.type == "microglia")

# Calculate Association Plot coordinates of the genes and the $S_\alpha$-scores
ca <- apl_coords(ca, group = microglia)

ca <- apl_score(ca,
  mat = logcounts(darmanis),
  reps = 5
)

Now we can conduct GO enrichment analysis as implemented in the package topGO using the most cluster-specific genes from the Association Plot. By default we use all genes with an \(S_\alpha\)-score higher than 0, but the cutoff may have to be adjusted depending on the dataset. In the example below we restrict it to genes with a \(S_\alpha\)-score higher than 1 to restrict it to truly significant genes. In case that no \(S_\alpha\)-scores were calculated, one can also choose to use the ngenes (by default 1000) genes with the highest x-coordinates by setting use_coords = TRUE.

enr <- apl_topGO(ca,
  ontology = "BP",
  organism = "hs",
  score_cutoff = 1
)
head(enr)
#>        GO.ID
#> 1 GO:0019886
#> 2 GO:0045087
#> 3 GO:0002250
#> 4 GO:0002503
#> 5 GO:0045730
#> 6 GO:0032755
#>                                                                                Term
#> 1 antigen processing and presentation of exogenous peptide antigen via MHC class II
#> 2                                                            innate immune response
#> 3                                                          adaptive immune response
#> 4                        peptide antigen assembly with MHC class II protein complex
#> 5                                                                 respiratory burst
#> 6                                   positive regulation of interleukin-6 production
#>   Annotated Significant Expected raw.p.value
#> 1        12          10     0.83     1.2e-10
#> 2       281          60    19.51     1.7e-10
#> 3       162          46    11.25     2.9e-10
#> 4        11           9     0.76     1.6e-09
#> 5        14          10     0.97     1.7e-09
#> 6        36          15     2.50     4.0e-09

The function plot_enrichment() was implemented to visualize the topGO results in form of a dotplot.

plot_enrichment(enr)

Microglia cells are innate immune cells of the brain and as such the most highly scored genes are enriched in gene sets related to the immune response and microglia specific gene sets as one would expect.

Session info

#> R version 4.6.0 RC (2026-04-17 r89917)
#> Platform: x86_64-pc-linux-gnu
#> Running under: Ubuntu 24.04.4 LTS
#> 
#> Matrix products: default
#> BLAS:   /home/biocbuild/bbs-3.23-bioc/R/lib/libRblas.so 
#> LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.12.0  LAPACK version 3.12.0
#> 
#> locale:
#>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
#>  [3] LC_TIME=en_GB              LC_COLLATE=C              
#>  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
#>  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
#>  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
#> 
#> time zone: America/New_York
#> tzcode source: system (glibc)
#> 
#> attached base packages:
#> [1] stats4    stats     graphics  grDevices utils     datasets  methods  
#> [8] base     
#> 
#> other attached packages:
#>  [1] SparseM_1.84-2              org.Hs.eg.db_3.23.1        
#>  [3] AnnotationDbi_1.74.0        scater_1.40.0              
#>  [5] ggplot2_4.0.3               scran_1.40.0               
#>  [7] scuttle_1.22.0              scRNAseq_2.25.0            
#>  [9] SingleCellExperiment_1.34.0 SummarizedExperiment_1.42.0
#> [11] Biobase_2.72.0              GenomicRanges_1.64.0       
#> [13] Seqinfo_1.2.0               IRanges_2.46.0             
#> [15] S4Vectors_0.50.0            BiocGenerics_0.58.0        
#> [17] generics_0.1.4              MatrixGenerics_1.24.0      
#> [19] matrixStats_1.5.0           APL_1.16.0                 
#> [21] BiocStyle_2.40.0           
#> 
#> loaded via a namespace (and not attached):
#>   [1] BiocIO_1.22.0            bitops_1.0-9             filelock_1.0.3          
#>   [4] tibble_3.3.1             graph_1.90.0             XML_3.99-0.23           
#>   [7] lifecycle_1.0.5          httr2_1.2.2              edgeR_4.10.0            
#>  [10] topGO_2.64.0             globals_0.19.1           lattice_0.22-9          
#>  [13] ensembldb_2.36.0         alabaster.base_1.12.0    magrittr_2.0.5          
#>  [16] limma_3.68.0             sass_0.4.10              rmarkdown_2.31          
#>  [19] jquerylib_0.1.4          yaml_2.3.12              metapod_1.20.0          
#>  [22] otel_0.2.0               spam_2.11-3              sp_2.2-1                
#>  [25] cowplot_1.2.0            DBI_1.3.0                RColorBrewer_1.1-3      
#>  [28] abind_1.4-8              AnnotationFilter_1.36.0  RCurl_1.98-1.18         
#>  [31] rappdirs_0.3.4           ggrepel_0.9.8            irlba_2.3.7             
#>  [34] listenv_0.10.1           alabaster.sce_1.12.0     RSpectra_0.16-2         
#>  [37] dqrng_0.4.1              parallelly_1.47.0        codetools_0.2-20        
#>  [40] DelayedArray_0.38.0      tidyselect_1.2.1         UCSC.utils_1.8.0        
#>  [43] farver_2.1.2             viridis_0.6.5            ScaledMatrix_1.20.0     
#>  [46] BiocFileCache_3.2.0      GenomicAlignments_1.48.0 jsonlite_2.0.0          
#>  [49] BiocNeighbors_2.6.0      progressr_0.19.0         tools_4.6.0             
#>  [52] Rcpp_1.1.1-1.1           glue_1.8.1               gridExtra_2.3           
#>  [55] SparseArray_1.12.0       xfun_0.57                GenomeInfoDb_1.48.0     
#>  [58] dplyr_1.2.1              HDF5Array_1.40.0         gypsum_1.8.0            
#>  [61] withr_3.0.2              BiocManager_1.30.27      fastmap_1.2.0           
#>  [64] rhdf5filters_1.24.0      bluster_1.22.0           digest_0.6.39           
#>  [67] rsvd_1.0.5               R6_2.6.1                 GO.db_3.23.1            
#>  [70] dichromat_2.0-0.1        RSQLite_2.4.6            cigarillo_1.2.0         
#>  [73] h5mread_1.4.0            FNN_1.1.4.1              rtracklayer_1.72.0      
#>  [76] httr_1.4.8               S4Arrays_1.12.0          org.Mm.eg.db_3.23.0     
#>  [79] uwot_0.2.4               pkgconfig_2.0.3          gtable_0.3.6            
#>  [82] blob_1.3.0               S7_0.2.2                 XVector_0.52.0          
#>  [85] htmltools_0.5.9          dotCall64_1.2            bookdown_0.46           
#>  [88] ProtGenerics_1.44.0      SeuratObject_5.4.0       scales_1.4.0            
#>  [91] alabaster.matrix_1.12.0  png_0.1-9                knitr_1.51              
#>  [94] rjson_0.2.23             curl_7.1.0               cachem_1.1.0            
#>  [97] rhdf5_2.56.0             BiocVersion_3.23.1       vipor_0.4.7             
#> [100] parallel_4.6.0           restfulr_0.0.16          pillar_1.11.1           
#> [103] grid_4.6.0               alabaster.schemas_1.12.0 vctrs_0.7.3             
#> [106] BiocSingular_1.28.0      dbplyr_2.5.2             beachmat_2.28.0         
#> [109] cluster_2.1.8.2          beeswarm_0.4.0           evaluate_1.0.5          
#> [112] magick_2.9.1             tinytex_0.59             GenomicFeatures_1.64.0  
#> [115] cli_3.6.6                locfit_1.5-9.12          compiler_4.6.0          
#> [118] Rsamtools_2.28.0         rlang_1.2.0              crayon_1.5.3            
#> [121] future.apply_1.20.2      labeling_0.4.3           ggbeeswarm_0.7.3        
#> [124] viridisLite_0.4.3        alabaster.se_1.12.0      BiocParallel_1.46.0     
#> [127] Biostrings_2.80.0        lazyeval_0.2.3           Matrix_1.7-5            
#> [130] ExperimentHub_3.2.0      sparseMatrixStats_1.24.0 bit64_4.8.0             
#> [133] future_1.70.0            Rhdf5lib_2.0.0           KEGGREST_1.52.0         
#> [136] statmod_1.5.1            alabaster.ranges_1.12.0  AnnotationHub_4.2.0     
#> [139] igraph_2.3.0             memoise_2.0.1            bslib_0.10.0            
#> [142] bit_4.6.0