Reads mapped to another chromosome in paired-end data of RNA-seq
0
0
Entering edit mode
10.6 years ago
int11ap1 ▴ 490

Hello guys,

I am doing a project in RNA-seq data with some datasets that are paired-end data. After alignment with TopHat2, I have around 1-5% of reads whose mate is mapped to another chromosome. In order to call Cufflinks for annotation of transcripts, would you recommend me to remove those reads mapped to another chromosome (or call again TopHat2 with the option --no-discordant)? Is it a key step?

Thanks in advanced.

RNA-Seq • 4.3k views
ADD COMMENT
0
Entering edit mode

Sorry this isn't answering your question but instead asking a new one: What tool (or commands) did you use to find out how many reads had mates that mapped to a different chromosome?

ADD REPLY
0
Entering edit mode

samtools flagstat FILE.bam

ADD REPLY
0
Entering edit mode

I thought bowtie rejected alignments of reads whose mates are too far from one another (than the estimated fragment size). Maybe tophat retains them for splicing event detection?

ADD REPLY
1
Entering edit mode

I don't think TopHat2 will use read pairs aligning on different chromosomes for annotating transcripts. This is my personnel guess but I am quiet positive about it. So you need not to remove these reads. Still if you want you can use this command:

  1. samtools view -H Your.bam > Your.sam
  2. samtools view Your.bam | awk '$7 == "=" {print}' >> Your.sam
  3. samtools view -bS Your.sam > Your_new.bam
ADD REPLY

Login before adding your answer.

Traffic: 2563 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6