Hi everyone,
I'm working on a multimodal analysis of single cells from frozen nuclei, capturing both ATAC and RNA data from the same nuclei. While processing my data, I noticed a significant improvement in the transcriptome when using the Cell Ranger Count pipeline (v7.1.0), which handles only RNA-seq, compared to Cell Ranger ARC (v2.0.2), which handles both ATAC and RNA.
Here are the anonymized metrics I gathered:
Metric | Sample1 (ARC Count) | Sample1 (Count) | Sample2 (ARC Count) | Sample2 (Count) |
---|---|---|---|---|
Estimated Number of Cells | 1,053 | 464 | 562 | 564 |
Mean Raw Reads per Cell | 31,233.53 | 70,881 | 57,313.97 | 57,111 |
Fraction of Transcriptomic Reads in Cells | 19.3% | 38.3% | 22.6% | 43.8% |
Median UMI Counts per Cell | 144 | 1,056 | 365 | 1,063 |
Median Genes per Cell | 117 | 894 | 275 | 818 |
Total Genes Detected | 21,462 | 27,316 | 18,662 | 26,112 |
Reads Mapped to Genome | 63.4% | 63.4% | 45.1% | 45.1% |
Reads Mapped Confidently to Genome | 49.7% | 49.7% | 28.4% | 28.4% |
Reads Mapped Confidently to Intergenic Regions | 16.7% | 16.7% | 6.6% | 6.6% |
Reads Mapped Confidently to Intronic Regions | 25.0% | 25.0% | 15.7% | 15.7% |
Reads Mapped Confidently to Exonic Regions | 8.0% | 8.0% | 6.0% | 6.0% |
Reads Mapped Confidently to Transcriptome | 21.4% | 21.4% | 16.4% | 16.4% |
Reads Mapped Antisense to Gene | 11.3% | 11.3% | 5.1% | 5.1% |
The key metrics that stood out to me were:
- The Fraction of Transcriptomic Reads in Cells increased from 19.3% (ARC) to 38.3% (Count) for Sample1 and from 22.6% (ARC) to 43.8% (Count) for Sample2.
- The Median UMI Counts per Cell also saw a huge jump when using Cell Ranger Count: from 144 to 1,056 for Sample1 and from 365 to 1,063 for Sample2.
- A significant increase in Median Genes per Cell and Total Genes Detected was also observed in both datasets when using Count over ARC.
Has anyone else encountered a similar issue when running multimodal analyses on frozen nuclei? If so, what steps did you take moving forward? Did you modify any processing or analysis steps to resolve this? I’m curious to know how others have approached this.
Thanks in advance!
10x recommends that when analyzing multi-modal data you should use
cellranger-arc
.From: https://kb.10xgenomics.com/hc/en-us/articles/360059656912-Can-I-analyze-only-the-Gene-Expression-data-from-my-single-cell-multiome-experiment