I have a query file containing around 2600 nucleotide sequences. and another group of transctiptomes files (over 150 files) containing hundreds of thousands of sequences in each file. I need to blast the query file over the transcriptomes files to determine which gene in the query file is found (expressed) in the transcriptomes files. Does anyone have any ideas how I can go about it?
One way is by looping the blast command in a shell script
Do you have any idea or hints on how i can go about that..? I will really appreciate
Make a blast database of your 2600 sequences using
makeblastdb
. As TriS suggested below, you have to loop in a way it takes each fasta file from your 150 and blast it against 2600.For eg: your file names are file1.fasta, file2.fasta...........file149.fasta, file150.fasta
* I have cut short the blast command, you may have to include other parameters also.