Question: Unmapped reads at TCGA bam files
1
0
Entering edit mode
6.6 years ago
Folder40g ▴ 190

I haven't use samtools for a while and now I've some files from TCGA from which I'm checking the reads.

I'm trying to see if there are any unmapped sequences at TCGA bam files: samtools view -f 4 file_gdc_realn_rehead.bam and I get this UNC-SN:197:C0XX:4:204:155:805 69 chr1 14496 0 * = 14496 0 ACCTGCTTCCCTGGGTGGGGGTGATGGAACCAGCACTGTGCGGAGACC CCCFFFFFHHHHHJJAEHIII6@FHIJJJJJIGJJJJJIIJIHHFFDD MC:Z:46M2S RG:Z::120425_UNC15-SN850_0197_AC0TK3ACXX_ACAGTG_L004 NH:i:0 HI:i:0 nM:i:1 AS:i:43 uT:A:4 69 chr1 14496 0 * = 14496 0 ACCTGCCGTGGGGTGATGAACCAGCACTGTAGACC CCCFFFFFHHHHHJJAEHIII6@FHIJJJJJIGJJJJJIIJIHHFFDD MC:Z:46M2S RG:Z::120425_UNC-SN:197:C0XX:4:204:155:8057_AC0TK3ACXX_ACAGTG_L004 NH:i:0 HI:i:0 nM:i:1 AS:i:43 uT:A:4

If this is an unmapped read why is there information about the chromosome (chr1)? What I'm missing?

Thanks

TCGA bam • 1.9k views
ADD COMMENT
0
Entering edit mode
6.6 years ago
Tm ★ 1.1k

I think you get chromsome information because, mate of that read got mapped to chromosome 1. In case when both of them are unmapped, then there will be no chr info.

ADD COMMENT
0
Entering edit mode

Yes, it seems thats what is happening. A "bug" from STAR aligner.

ADD REPLY
1
Entering edit mode

This is the case with other aligners also. I don't think it is a bug, because when you check reads with 'flag 12' i.e both reads of the pair are unmapped then in that case there will be no chromosome information. But if any one is mapped, chr information is obtained.

ADD REPLY

Login before adding your answer.

Traffic: 2109 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6