What does the reference_id = -1 indicate in a .sam file read with pysam?

0

Entering edit mode

7.8 years ago

osman.christof • 0

Hi all,

I am using python and pysam for the analysis of my sequencing data. I have aligned my paired reads obtained from the Illumina MiSeq platform with bowtie2 to my reference genome. The reference genome consists of 17 chromosomes. When I now check my .sam file with pysam and iterate over the records, I find that most records have a 'reference_id' with a value between 0 and 16, which corresponds to the chromosomes. However, some records have the 'reference_id'= '-1'.

I'm unsure what this indicates. Does this indicate that the read could not be mapped to the reference genome? I would think so, but I can't find the information anywhere. Any help would be much appreciated!!

pysam python bowtie miseq sam-file • 2.0k views

ADD COMMENT • link 7.8 years ago by osman.christof • 0

0

Entering edit mode

I did a test using a random read, which won't map to the reference. I then check this unmapped read, its reference_id is -1.

ADD REPLY • link 7.3 years ago by imlituan ▴ 110

Login before adding your answer.