TL;DR: NCBI API does not respect species name when giving the whole species name.
I am trying to go through the NCBI databases looking for genomic data on specific species. I am using NCBI entrez Direct. I am using the following one liner:
esearch -db assembly -query "{insert species name here}" | esummary | xtract -pattern DocumentSummary -element SpeciesName
An example run with an actual species name:
esearch -db assembly -query "Aristolochia cretica" | esummary | xtract -pattern DocumentSummary -element SpeciesName
Produces the output
Aristolochia contorta
Aristolochia fimbriata
This is an issue for me because I requested "Aristolochia cretica" and got "Aristolochia contorta" and "Aristolochia fimbriata".
Second example:
esearch -db assembly -query "Physaria geyeri" | esummary | xtract -pattern DocumentSummary -element SpeciesName
Output:
Physaria ovalifolia
Physaria fendleri
Physaria acutifolia
third example:
esearch -db assembly -query "Vicia serinica" | esummary | xtract -pattern DocumentSummary -element SpeciesName
Vicia sativa
Vicia sativa
Vicia faba
I want the API to be mindful of the specific name as well. I am following this official manaul and I might have missed something but I am not being able to find a way to account for this behaviour.
Is writing something myself the only solution? Or is there a flag or setting I am missing?