how to extract intron coordinates (in bed) from bam
1
0
Entering edit mode
4.8 years ago
praasu ▴ 40

I have aligned raw RNA-seq reads against a reference genome with STAR. I would like to extract the intronic coordinates from obtained bam files. Any suggestion?

RNA-Seq • 2.0k views
ADD COMMENT
2
Entering edit mode
4.8 years ago

This isn't information that one extracts, since it's not held within the BAM files. Instead your steps will be:

  1. Determine where transcripts and exons are (e.g. with stringTie).
  2. Process the resulting GTF file to retrieve introns (using whatever definition of this you'd like).

The second step may require some custom code, depending on whether you want introns that overlap exons (e.g., due to having multiple transcripts for a gene) or not.

ADD COMMENT
0
Entering edit mode

Hi Ryan, The GTF file, that I obtained from the stringTie, is missing strand information. I need strand information to check the splice sites sequence (both at 5' and 3')
Do you have idea, how can I deal with that ?

ADD REPLY
0
Entering edit mode

Did you use a strand-specific library prep?

ADD REPLY
0
Entering edit mode

We didn't use strand-specific library prep.

ADD REPLY
1
Entering edit mode

Then you can't know which strand the transcripts are from.

ADD REPLY
0
Entering edit mode

Hi Ryan,

Today, I got to know that mRNA library is stranded. In that case, how can I obtained the strand information or why could it be missing ?

Prasoon

ADD REPLY
1
Entering edit mode

You need to tell your aligner and stringTie that.

ADD REPLY

Login before adding your answer.

Traffic: 1835 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6