Question

Average expression of a sample in single-cell data

0

Entering edit mode

8 months ago

newuser2024 • 0

Hi all, I have just started working with single-cell data and I apologise if this question seems nonsensical. I have counts data from this paper (https://doi.org/10.1371/journal.pbio.3001017) and, for example, there are 9 and 7 samples of zygote and oocyte, respectively. My biological question is to look at splicing patterns in these stages which I plan to obtain by passing the counts file through SUPPA. I was wondering if it makes sense to average the expression levels of transcripts in each stage (e.g. zygote) by simple mean() function in R or is this inappropriate considering it will treat the single-cell data as bulk? Is there a more appropriate way of doing this or it is best not to average expression at all? Any insight would be appreciated. Thanks for your help in advance!

single-cell rna-seq • 586 views

ADD COMMENT • link 8 months ago by newuser2024 • 0

1

Entering edit mode

The common way I know in terms of pseudobulking cells is to sum, not average cells. Keep in mind that single-cell data are often 3'-tagged so reliable splicing detection might be difficult.

ADD REPLY • link 8 months ago by ATpoint 85k

0

Entering edit mode

Thank you for your reply. Sorry to ask this but could you explain how 3' tag can effect splicing detection, please?

ADD REPLY • link 8 months ago by newuser2024 • 0

3

Entering edit mode

Well, if you're only sequencing the very end (the 3' end) of transcripts, how are you going to detect any of the splice junctions that appear in the middle or at the beginning (5' end) of an RNA transcript?

It looks like that paper used nanopore so the full length will be sequenced in which case you can do splicing/isoform detection analysis.

ADD REPLY • link 8 months ago by dsull ★ 6.9k

0

Entering edit mode

Thank you for this. I went back to do some more reading and this makes a lot of sense!

ADD REPLY • link 8 months ago by newuser2024 • 0