Filter nt database remotely
1
0
Entering edit mode
4.9 years ago

How can I filter nt database remotely, to include only some taxa? (I'm sorry if this is a naive question)

nt biopython blast • 1.2k views
ADD COMMENT
1
Entering edit mode
4.9 years ago
GenoMax 148k

You can't do this (in the sense of filter remotely and download only what you need).

  1. You could run a remote blast search at NCBI with a valid entrez query (simulating remote filtering).
  2. You could also search NCBI with species/genes of interest and download the search sequences from the web page results to create a local sequence search database from those.
ADD COMMENT
0
Entering edit mode

Can I do this: https://imgur.com/tu8lv27 with entrez?

ADD REPLY
0
Entering edit mode

Yes. That is the kind of search term you will use as shown in the search box. Here is a full list of those terms

ADD REPLY
0
Entering edit mode

I'm using entrez, and I did this command: Entrez.esearch(db='nucleotide',term="Mammals AND cox1[Gene]", idtype='acc') and it counts me 61818, but when I list the IDS I only get some of them. Is this the maximum number of IDS to print, or the result by itself?

ADD REPLY
1
Entering edit mode

Looks like you are doing this in python. Add retmax=1000000 to get all results.

ADD REPLY
0
Entering edit mode

That worked, thanks that worked. I want to delete the fasta sequences of that query so I did this line: Entrez.efetch(db='nucleotide', id=id_list,retmode="fasta"), but I can't retrive the fasta sequences.

ADD REPLY

Login before adding your answer.

Traffic: 2139 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6