Hi!
I'm trying to produce strand specific bigwig coverage files for a yeast RNA-seq experiment with unknown library type.
After mapping the reads against the S. cerevisiae reference genome STAR returns two coverage files:
Here, you can see that the gene is on the sense strand but half of the reads are mapped to the antisense. (Blue tracks are read pileups/bigwig coverage files). I assume that this is because some reads are PCR amplicons and not the original RNA and thus map to the wrong strand.
My goal is to produce "correct" coverage files, where the orientation of the annotated gene is taken into account when the coverage profiles are produced. How could I do this?
In this case for example, I would expect that the correct coverage profile show roughly the sum of both profiles but on the upper track/sense strand and 0 on the antisense strand.
Any help is much appreciated!
Best Timon
Two questions: Are your RNA-seq data stranded, and how did you produce the tracks?
I don't know but ran salmon to autoinfer the strandedness which says "U" so I assume it's unstranded.
If it's unstranded then it's exactly that. I don't see how any tool could change that other than assuming that if a gene was on reverse strand then all intersecting signal would need to be on that strand too. But that's a strong assumption and a problem when genes overlap. It's custom.
If your data is unstranded then you should have used
--outWigStrand Unstranded
.