Hi I'm new to the NGS data analysis.
To practice the command line skills, I want to test the EstimateLibraryComplexity after double the bam. I have one bam file (1.bam), copy paste it (name it 1copy.bam), then merge 1.bam and 1copy.bam by using picard-tools MergeSamFiles. I use samtools flagstat check all the numbers are doubled.
The question is 1, why the merged bam size is not doubled? 1.bam is 79Mb, but merged bam is only 84Mb.
2, I run the EstimateLibraryComplexity for 1.bam and merged bam. the READ_PAIRS_EXAMINED for 1.bam is 651770, but for merged bam is only 3548.I think there mush be something wrong with my setting.
BTW, the PERCENT_DUPLICATION for merged bam is 0.5, which is I expected.And I also use samtools view to view the merged bam, every reads has identical duplicates.
Thanks,
Thanks a lot, try to change the 1copy.bam reads name, then redo the whole process again.