I want to retrieve all the species name along with their taxonomy IDs from the Taxonomy Database. Eg: Homo sapiens(9606)
I am aware of that there are two files available in FTP:
nodes.dmp (which associates each taxid with its parent taxid)
names.dmp (which associates names with taxids).
But if I am correct, name file contains not only species name, but also other higher level (family, class, phylum etc). If I am using these two files, first I need to find all the Taxonomy IDs that are "Species" level from node.dmp and secondly, I need to get the names for those filtered IDs from name.dmp file.
Is there any straight forward/handy way to retrieve only species names along with IDs other than the method I explained?
Check out the module
NCBITaxa
within theETE3
toolkit. It will allow you to do this fairly easily.