How to Process PacBio RS FASTQ Files from SRA for Genome Coassembly with Illumina Reads?

0

Entering edit mode

9 weeks ago

till • 0

Hello Biostars community,

I’m working on coassembling the genome of Desulfovibrio glucosivorans DMSS-1 using sequencing data from both PacBio RS and Illumina HiSeq 2000 platforms. All the reads are available under BioProject accession PRJNA186466.

I read that preprocessing of PacBio RS reads typically requires primary analysis data to use in SMRT Link. However, the Sequence Read Archive (SRA) for my project only provides FASTQ files, and I haven’t been able to figure out how to process PacBio RS FASTQ files for before coassembly.

Any guidance on how I should preprocess the FASTQ files before coassembly with Unicycler would be greatly appreciated! Thank you in advance for your help!

coassembly fastq pacbio • 257 views

ADD COMMENT • link updated 7 weeks ago by qin • 0 • written 9 weeks ago by till • 0

0

Entering edit mode

More than likely the fastq files have already been pre-processed. This is a submission from Joint Genome Institute so that is some additional assurance. You can go ahead and start on your assembly. This data is from 2013.

You could run fastplong (LINK) on the data to see if it is able to identify any recurring sequences but that is probably not needed.

ADD REPLY • link 9 weeks ago by GenoMax 148k

Login before adding your answer.