Dear all,
would be possible to use BLAST from the command line to search on the remote NCBI nr database?
The examples I get from the internet requires the creation of a local database with makeblastdb
followed by a local search with blastn
.
What would be the syntax to search a local sequence to the NCBI nr database, if possible in the first place? (I don't think it would be feasible to create a local version of such huge collection)
Thank you
blastn
is for DNA to DNA comparisons.nr
is a protein DB, so either you usent/nr
(=the nucleotide DB) or you switch toblastx
if you want to compare DNA to protein.It is feasible to build these DBs locally (we keep them locally in sync in our lab) but for blasting once in a while a few sequences it might be overkill indeed.
Can you tell what exactly you want to do? what is your input file, DNA, protein?
Syntax-wise the answer of Santosh Anand is correct to start a remote blast indeed
I am looking to blast DNA sequences to identify the organism of origin. I have thousands of entries, so I need to automate the process. I reckon the nucleotide database (the whole non-redundant collection) would be quite big to build locally and anyway, how could I get all the NCBI entries in one go?
OK, yes,
blastn
is your weapon of choice then indeed.well, the DB is substantial yes, but you can download the complete DB in fasta format (and then build it locally yourself) or download the pre-formatted DB from NCBI. have a look at the ftp site of NCBI , there they offer all this