Question

Strategies To Blast Against Fastq Files

1

Entering edit mode

11.2 years ago

Cacau ▴ 520

I am going to look for the homolog of my gene in another species using its transcriptome data (Illumina). What might be a good strategy? Is there any tool that can be used to run blast-like search directly against fastq file? I am thinking if it is OK to convert fastq file to fasta format and then run blast. Any help will be appreciated.

fastq blast • 14k views

ADD COMMENT • link updated 8.0 years ago by bioinform_1 • 0 • written 11.2 years ago by Cacau ▴ 520

score 2 · Answer 1 · 2013-10-04

It is not 100% clear what source of data you want to blast.

Do you want to blast the raw Illumina reads or some form of assembled reads (assembled transcriptome)?

If the former, some tools are fast to blast high volumes of short sequences (blat?), if the latter, then a normal blast would do.

In both cases, going from fastq to fasta is the way the natural way to go. If anybody knows of a blast-like tool that can use fastq, I would be glad to know about it!

score 1 · Answer 2 · 2013-10-04

1

Entering edit mode

11.2 years ago

JC 13k

I will choose to assemble first the transcriptome data with Oases/Trinity/Trans-ABySS and then look for gene homology with blast/blat.

ADD COMMENT • link 11.2 years ago by JC 13k

score 0 · Answer 3 · 2016-11-14

0

Entering edit mode

8.0 years ago

bioinform_1 • 0

Hi,

I have a nr database, i want to blast fastq (illumina file) on this database, is there any software or anyway?

Thanks

ADD COMMENT • link 8.0 years ago by bioinform_1 • 0

0

Entering edit mode

There's nothing stopping you from stripping the quality lines from a fastq and turning it into a fasta, then running the sequences in blast. Not sure what'd you get out of it asides from a nearest match for each read to a member of your index. Are you trying to count something, map something or assign identities before a de novo assembly? Mapping the reads to the fasta that your nr is derived from might give you visualization of your reads in context of your database if that is what you were looking for.

If your insert length is short enough and your read length long enough, merging reads may give you more to work with; and deduplication may eliminate some reads that correspond to overly represented areas that might save you some runtime.

ADD REPLY • link 8.0 years ago by ctseto ▴ 310