Reads mapped to another chromosome in paired-end data of RNA-seq

0

Entering edit mode

11.3 years ago

int11ap1 ▴ 490

Hello guys,

I am doing a project in RNA-seq data with some datasets that are paired-end data. After alignment with TopHat2, I have around 1-5% of reads whose mate is mapped to another chromosome. In order to call Cufflinks for annotation of transcripts, would you recommend me to remove those reads mapped to another chromosome (or call again TopHat2 with the option --no-discordant)? Is it a key step?

Thanks in advanced.

RNA-Seq • 4.7k views

ADD COMMENT • link updated 11.1 years ago by Biostar 20 • written 11.3 years ago by int11ap1 ▴ 490

0

Entering edit mode

Sorry this isn't answering your question but instead asking a new one: What tool (or commands) did you use to find out how many reads had mates that mapped to a different chromosome?

ADD REPLY • link 11.3 years ago by Ann ★ 2.4k

0

Entering edit mode

samtools flagstat FILE.bam

ADD REPLY • link updated 5.6 years ago by Ram 45k • written 11.3 years ago by int11ap1 ▴ 490

0

Entering edit mode

I thought bowtie rejected alignments of reads whose mates are too far from one another (than the estimated fragment size). Maybe tophat retains them for splicing event detection?

ADD REPLY • link updated 5.6 years ago by Ram 45k • written 11.3 years ago by ndejay ▴ 30

1

Entering edit mode

I don't think TopHat2 will use read pairs aligning on different chromosomes for annotating transcripts. This is my personnel guess but I am quiet positive about it. So you need not to remove these reads. Still if you want you can use this command:

samtools view -H Your.bam > Your.sam
samtools view Your.bam | awk '$7 == "=" {print}' >> Your.sam
samtools view -bS Your.sam > Your_new.bam

ADD REPLY • link updated 5.6 years ago by Ram 45k • written 11.1 years ago by Ashutosh Pandey 12k

Login before adding your answer.