Hello all,
So I would like to get the sequence of the genomic regions with mapped reads after an RNA-seq analysis.
First, I have mapped the reads into a reference genome using hisat2 and --rna-strandness F
.
Does anyone have a suggestion on how I could extract the regions of the genome with mapped reads?
My plan was to use bedtools genomecov
on the bam file to find the regions with coverage > 0 and then use bedtools getfasta
using the bed with the regions and the genome in fasta format.
However, I'm not sure on how to keep strand information.
I think I have to include -strand
in genomecov and -s
in getfasta, but I have found that genomecov
gives the same results with and without the -strand
flag. I was expecting including -strand
I would get strand information in the bed file but it is not the case. This is the command I have tried:
bedtools genomecov -ibam input.bam -strand -bga | awk '$4>0' >> output.bed
Does anyone have any comment?
Thank you for your time
A previous post that would be of interest: A: navigate IGV to the points of interest
Thank you very much, I think bamToBed worked for my purpose here
Please use the formatting bar (especially the
code
option) to present your post better. I've done it for you this time.Thank you for your help, I will do it next time