Question about BLAST database
1
0
Entering edit mode
8.1 years ago

I need to run blast against mouse genome to cancel mouse related strings in a set of 16S Amplicon reads.

Could someone please tell me which of the files in the following link should I put as the database for blastn

basically blastn -db ** ........

* I understand should be one of the following files or its unzipped format :

ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_000001635.7_GRCm38.p5

blast ncbi • 2.1k views
ADD COMMENT
1
Entering edit mode

Dear Ali, Hi

The genomic.fna.gz file is FASTA format of the genomic sequences in the assembly.

Also have a look at Mouse in Genome Browser or ENSEMBL and Online Resources For Mouse Research and the answer in this post, please.

~ Best

ADD REPLY
0
Entering edit mode

Thanks Farbod Quick question. Do you know if blastn can read through gz files or do I need to unzip them first?

ADD REPLY
2
Entering edit mode

Ali Jan,

As you must use -makeblastdb to convert your database (here mouse genome) to a blastable database, I guess you must decompress your .gz file first.

of course it seems that it could be done in just one script as :

gunzip -c mouse-genome.gz | makeblastdb -in nucl

Have a look here please.

~ Best

ADD REPLY
0
Entering edit mode

Thanks Farbod. Guess you came to my rescue :) The BLAST documentation seems to be extensive but quite misleading.

ADD REPLY
0
Entering edit mode
8.1 years ago
GenoMax 147k

You should use bbsplit from BBMap to separate the mouse reads away from your metagenomic data. It would be immensely more efficient. Directions for use are here: BBSplit syntax for generating builds for the reference genome and how to call different builds.

ADD COMMENT

Login before adding your answer.

Traffic: 1312 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6