there on "How to extract the Summary text from records in Gene" subtittle it appears that its conveniant to use a perl file named geneDocSum.pl that I can download from here: https://ftp.ncbi.nih.gov/gene/tools/
But I dont understand the usage of the tool when querying gene information. As an example they provide this:
geneDocSum.pl -q "has_summary[prop] AND human[orgn]" -o tab -t Name -t Summary
UPDATE: I just noticed that you are specifically looking for the gene summary format, which is not available (yet) on NCBI datasets, so that might not be helpful. Sorry about that.
Also, this species Ulva compressa doesn't have any gene summaries. The genes in the gene table you posted are either from the mitochondria or chloroplast. There are two genomes in the same genus: https://www.ncbi.nlm.nih.gov/labs/data-hub/taxonomy/3118/, but neither has nuclear gene annotations. Let me know if there's anything else I can help you with.
Hi,
In the link you posted, you already have a list of all genes available for the reference genome of that algae species you posted. From that page, you can click in the Download (circled in red) button and choose what you need:
Data package: it includes protein, sequence and transcript sequences
(you choose which files you want)
Data table: downloads a tsv file with the columns being displayed
(you can add or remove columns by clicking in Select Columns (circled in blue)). If you only need the gene-ids, you can have only that column selected.
Let me know if that's the info you need. NCBI Datasets also has a command line tool that might be useful for what you need.
You can try this:
orgn = Organism you are interested in
so replace human with your organism of choice.