How to Process PacBio RS FASTQ Files from SRA for Genome Coassembly with Illumina Reads?
1
0
Entering edit mode
7 months ago
till • 0

Hello Biostars community,

I’m working on coassembling the genome of Desulfovibrio glucosivorans DMSS-1 using sequencing data from both PacBio RS and Illumina HiSeq 2000 platforms. All the reads are available under BioProject accession PRJNA186466.

I read that preprocessing of PacBio RS reads typically requires primary analysis data to use in SMRT Link. However, the Sequence Read Archive (SRA) for my project only provides FASTQ files, and I haven’t been able to figure out how to process PacBio RS FASTQ files for before coassembly.

Any guidance on how I should preprocess the FASTQ files before coassembly with Unicycler would be greatly appreciated! Thank you in advance for your help!

coassembly fastq pacbio • 701 views
ADD COMMENT
0
Entering edit mode

More than likely the fastq files have already been pre-processed. This is a submission from Joint Genome Institute so that is some additional assurance. You can go ahead and start on your assembly. This data is from 2013.

You could run fastplong (LINK) on the data to see if it is able to identify any recurring sequences but that is probably not needed.

ADD REPLY
0
Entering edit mode
4 months ago

Hi! I was just about to try to do this myself and then I found the genome is already assembled on the JGI's genome portal. See if you can follow the link below, if not, you can try to find it this way from the IMG id: 2576861818

https://img.jgi.doe.gov/cgi-bin/m/main.cgi?section=TaxonDetail&page=taxonDetail&taxon_oid=2576861818

Good luck! Courtney

P.S. we have an isolated culture and ONT genome of this critter ping me here or send me an email if you are interested, https://portal.research.lu.se/en/persons/courtney-w-stairs

ADD COMMENT

Login before adding your answer.

Traffic: 1882 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6