What is the best practice for the analysis of single-cell RNA-seq of mouse and human mixed cells?
2
0
Entering edit mode
2.3 years ago
Dan ▴ 180

My samples are patient-derived xenografts that contain both human epithelial cancer cells and mouse fibroblast/blood cells. I run cellranger count using the human and mouse combined reference genome, which was downloaded from 10xgenomics (https://cf.10xgenomics.com/supp/cell-exp/refdata-gex-GRCh38-and-mm10-2020-A.tar.gz). After Leiden clustering, the human and mouse cells can be separated by the marker genes found with scanpy.tl.rank_genes_groups. Then the data can be separated into human cells and mouse cells. The mouse genes in human cells were removed and the raw data of human cells can be re-clustered.
I am wondering if is this the correct practice for the analysis.

Is it possible to filter out the human cell from the fastq data and run cellranger count using only human reference genome?

Thanks.

single-cell RNA-seq • 2.2k views
ADD COMMENT
2
Entering edit mode

One option is to use Sargasso (https://github.com/biomedicalinformaticsgroup/Sargasso) in order to split the raw FASTQ reads based on their respective species. I have used this program before, but I am unsure of its usage in scRNA-seq.

ADD REPLY
2
Entering edit mode
2.3 years ago
dsull ★ 7.0k

One thing I like doing is mapping raw FASTQs to combined human+mouse reference to figure out which cell barcodes are human and which cell barcodes are mouse (e.g. if >75% of reads/UMIs belonging to a given barcode are of human origin, it's probably human and not mouse).

Then, I map the original raw FASTQ to human-only reference and throw away the cell barcodes that were of non-human origin from the previous step.

For scRNA-seq, I like this approach of deciding at the cell barcode-level what's human and what's mouse.

ADD COMMENT
0
Entering edit mode

Thanks for your suggestion. Can you please let me know how to split the fastq files based on the cell barcodes? Thanks

ADD REPLY
2
Entering edit mode
2.3 years ago
Dan ▴ 180

To split FASTQ based on cell barcodes, I tried running subset-bam:

https://github.com/10XGenomics/subset-bam

and then run bamtofastq:

https://github.com/10XGenomics/bamtofastq

and then map the fastq files to human-only and mouse-only references.

ADD COMMENT

Login before adding your answer.

Traffic: 2496 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6