Is is possible to use blastn to search on remote database from terminal?
1
0
Entering edit mode
6.1 years ago

Dear all,

would be possible to use BLAST from the command line to search on the remote NCBI nr database?

The examples I get from the internet requires the creation of a local database with makeblastdb followed by a local search with blastn.

What would be the syntax to search a local sequence to the NCBI nr database, if possible in the first place? (I don't think it would be feasible to create a local version of such huge collection)

Thank you

database blast remote access • 4.9k views
ADD COMMENT
0
Entering edit mode

blastn is for DNA to DNA comparisons. nr is a protein DB, so either you use nt/nr (=the nucleotide DB) or you switch to blastx if you want to compare DNA to protein.

It is feasible to build these DBs locally (we keep them locally in sync in our lab) but for blasting once in a while a few sequences it might be overkill indeed.

Can you tell what exactly you want to do? what is your input file, DNA, protein?

Syntax-wise the answer of Santosh Anand is correct to start a remote blast indeed

ADD REPLY
0
Entering edit mode

I am looking to blast DNA sequences to identify the organism of origin. I have thousands of entries, so I need to automate the process. I reckon the nucleotide database (the whole non-redundant collection) would be quite big to build locally and anyway, how could I get all the NCBI entries in one go?

ADD REPLY
0
Entering edit mode

OK, yes, blastn is your weapon of choice then indeed.

well, the DB is substantial yes, but you can download the complete DB in fasta format (and then build it locally yourself) or download the pre-formatted DB from NCBI. have a look at the ftp site of NCBI , there they offer all this

ADD REPLY
2
Entering edit mode
6.1 years ago

You need to use blastn from blast+ suite with a -remote flag. Something like:

blastn -db nr -query sequence.fasta -remote -out result.blast

ADD COMMENT
0
Entering edit mode

ok, freakish .. I was gonna comment on the fact you're using blastn againt the nr DB (=protein) , but that does seem to work ?????

seems like blast is magically switching to nt when it notices you use blastn (but it's not doing so for blastp). It's really going downhill with the blast-people at NCBI :/

ADD REPLY
0
Entering edit mode

I actually got this output:

mingled KR605506.1  100.000 434 0   0   1053    1486    1   434 0.0 802
NC_022893.1 KF516923.1  100.000 412 0   0   1   412 1   412 0.0 761

where the query was a viroid DNA. The answer NC_022893.1 does refer correctly to Cherry leaf scorch small circular viroid-like RNA. Anyway, I thought that nr stood for non-redundant. I will use nt as required.

ADD REPLY
0
Entering edit mode

yes, the naming is very unfortunate indeed. In the download it's nt for nucleo and nr for protein, in the webblast it's nr for protein and nt/nr for nucleo

ADD REPLY

Login before adding your answer.

Traffic: 2067 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6