I'm trying to get species and taxonomic family out of NCBI database for each of GCA_..... identification listed in list.txt file. But my output file is empty. It would be great if in my output file one row would look like: GCA_.... Species Family
As far I tried with this:
while IFS= read -r line
do
esearch -db assembly -query "$line" | xtract -pattern DocumentSummary \
-element ScientificName Division >> output.txt
done < ./list.txt
I also tried with this, but don't know how to add GCA to output:
while IFS= read -r line
do
esearch -db assembly -query $line | elink -target taxonomy | esummary | xtract -pattern DocumentSummary -element ScientificName Division AssemblyAccesion >> output.txt
done < ./list.txt
Post example GCA#.
GCA_902705575 GCA_000002455 GCA_000002595 GCA_000002975 GCA_000018645 GCA_000090985 GCA_000091205 GCA_000092065 GCA_000143455