whole genome sequencing and assembly
0
0
Entering edit mode
16 months ago

we extracted the whole genome of a fish and went for short-read sequencing using an MGI sequencer. usually, when we get sequencing results we get two files one fastq for forward and one fastq for reverse. but in my case, we got several fastq files of different sizes. can anyone explain what are these multiple files and how we can do an assembly out of them?

assembly whole-genome sequencer MGI • 895 views
ADD COMMENT
0
Entering edit mode

How do your files look like? My guess is they were split by lane and/or flowcell.

ADD REPLY
0
Entering edit mode

yes, I got 4 different lanes in 4 different folders. inside each lane, multiple fastq files are there.

ADD REPLY
0
Entering edit mode

this is lane 1 folder and inside this several fastq files are there ![\]\[1\]

ADD REPLY
0
Entering edit mode

Then you can align each combination separately and then merge the BAMs. For instance, align

  • "FLOWCELL_L001_R1_001.fastq.gz" and "FLOWCELL_L001_R2_001.fastq.gz" together
  • "FLOWCELL_L002_R1_001.fastq.gz" and "FLOWCELL_L002_R2_001.fastq.gz" together,

and so on.

ADD REPLY
0
Entering edit mode

can you say me why multiple fastq files were generated?? is it because the sample was uploaded in different flowcells and from different flowcells different fastq were generated? and does all the flowcell contains data from the same sample?

ADD REPLY

Login before adding your answer.

Traffic: 1707 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6