Hi, FastUniq has failed with an error with one of my dataset. I checked the files and discovered that the sequence length is different between forward and reverse read. Here is an example:
MGRF_NGS_FATIMA_LIFAT-30373344/F-35905947> head L001_R1.fastq L001_R2.fastq
==> L001_R1.fastq <==
@NS500334:63:HF2WTBGXY:1:11101:15449:1054 1:N:0:GATCAG
TAAGTNAAACCCAAACGAAATTACCNTACCTTGNCCTAGCANGTCGATAAAAGGTGGATGGCATTGTAGGGTCGCTCTCTTCGNTTCGNNNTCGAANNNNNGNNNNNNNTNNNNNNANCNNNNNC
+
AAAAA#EEEEEEEE6EEEEEEEEEE#EEEEEEE#EEEEEAE#EEEEEAEEEEEEEEEEEEE<EEEEEEEEEEEEEEEEEEEEA#EEEE###EEE<E#####E#######E######/#E#####A
@NS500334:63:HF2WTBGXY:1:11101:10110:1054 1:N:0:GATCAG
CTACANATCATAATGAATACAACATNAGTTTAANGAAACAGNCACAAGTTTAAAAAAAACTGAAATAACTATAAAATAACATGNCCAANNNCACTANNNNNTNNNNNNNANNNNNNANGNNNNNCCNNNNNNNNNNNNNNNNNNNNNNNNN
+
AAAAA#EEEAEEAEEEEEEEAEE/E#EEEEEEE#E/EA6EE#AEAEEEEEEEEEEEEEEEEEEEEEEE<EEEEEEEEEEEAEE#EEA/###AEEE/#####E#######A######E#E#####E6#########################
@NS500334:63:HF2WTBGXY:1:11101:20814:1054 1:N:0:GATCAG
CATGCNATGAGAAGATTTCATTTGCNAGGGTCCNTGTTGAANTGGATGCTGCCTATCCACTTCCTGATGAATTGGAGATTGATNCCCCNNNTGGCTNNNNNCNNNNNNNANNNNNNTNCNNNN
==> L001_R2.fastq <==
@NS500334:63:HF2WTBGXY:1:11101:15449:1054 2:N:0:GATCAG
NTATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATNNNNATTNNNNAAANNNNACTGNNNNATCNNNNAAAANNGCGAGNNTATCCTGTCTTANNTTAGTANCCACACGCACTGGATAATTTATGAACAAT
+
#AAA#################################################EE####EEE####EEA####EEEE####EEA####EEEE##A<EEA##EEEEEE<EEEE/##EAEE/E#EAAE/AA/AEEEE/EEEEEEEEEEE<AAE
@NS500334:63:HF2WTBGXY:1:11101:10110:1054 2:N:0:GATCAG
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
###################################
@NS500334:63:HF2WTBGXY:1:11101:20814:1054 2:N:0:GATCAG
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
Should forward and reverse read have the same length? What would be the best way to fix it?
Thank you in advance.
Mic
This reads have not been trimmed. I always thought that forward and reverse read must have the same length.
Not necessarily. There is no requirement that a run be setup in a symmetric fashion. Cycle lengths can be set up in any arbitrary combination during run set up.