I got the flagstat result from my Bam file, while the mapping rate is high, the percentages of properly paired reads and singletons are relatively low, what should I have to do?
112103606 + 0 in total (QC-passed reads + QC-failed reads)
112103606 + 0 primary
0 + 0 secondary
0 + 0 supplementary
0 + 0 duplicates
0 + 0 primary duplicates
112103606 + 0 mapped (100.00% : N/A)
112103606 + 0 primary mapped (100.00% : N/A)
112102614 + 0 paired in sequencing
56339631 + 0 read1
55762983 + 0 read2
2720182 + 0 properly paired (2.43% : N/A)
109362326 + 0 with itself and mate mapped
2740288 + 0 singletons (2.44% : N/A)
101228694 + 0 with mate mapped to a different chr
88514230 + 0 with mate mapped to a different chr (mapQ>=5)
EDIT:
I used bcl2fastq with this command and adapter removing
bcl2fastq -R /proj/naiss2023-22-1174/Masomeh/240131_NB551428_0057_AHT2V3BGXV \
--input-dir /proj/naiss2023-22-1174/Masomeh/240131_NB551428_0057_AHT2V3BGXV/Data/Intensities/BaseCalls/ \
--output-dir /proj/naiss2023-22-1174/Masomeh \
--interop-dir /proj/naiss2023-22-1174/Masomeh/240131_NB551428_0057_AHT2V3BGXV/Data/Intensities/InterOp/ \
--stats-dir /proj/naiss2023-22-1174/Masomeh/240131_NB551428_0057_AHT2V3BGXV/Data/Intensities/BaseCalls/Stats/ \
--reports-dir /crex/proj/naiss2023-22-1174/Masomeh/BaseCalls/Reports/ \
--sample-sheet 'ATACsamplesheet.csv' \
-r 4 -p 40 -w 4 \
--minimum-trimmed-read-length 35 \
--mask-short-adapter-reads 22 \
--adapter-stringency 0.9 \
--ignore-missing-bcls \
--ignore-missing-filter \
--ignore-missing-positions \
--no-lane-splitting
and then I used cutadapt to remove the adapter
cutadapt -a CTGTCTCTTATACACATCT -A CTGTCTCTTATACACATCT -o SPG_rep3_trimmed_R1.fastq -p SPG_rep3_trimmed_R2.fastq SPG_rep3_S7_R1_001.fastq.gz SPG_rep3_S7_R2_001.fastq.gz
Is it possible this command affect the reads and cause the reduced properly mapping?
Did you check the reads quality before alignment? This looks like one of your pairs is failed or has a problem in read labels