Hello,
First of all, sorry for this noobish question and secondly, I posted the same question on StackExchange but I don't think it will get any answer there (low traffic, the question got closed as "not bioinformatics").
I'm wondering, if, by bioinformatic means only, is possible to "extract" genes and, after that proteins and natural compounds from plant genome data that is available on NCBI (please see https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_029618835.1/ as an example)? Example: just by processing the data from the link above we can get to Flavoxanthin or any other substance.
What other useful information from a chemical compound perspective can be found from digging into the genome itself?
Any hints will be greatly appreciated.
Thank you!
Not sure if this answers your question, but have you tried looking at the taxonomy browser for this species - https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=41496. Seems like there are some annotated genes and proteins that you could find here. But I agree with the other comment that the assembly seems premature at this stage with no assembled chromosomes
Yes, it helps, thank you. But I was wondering of a kind of "de-novo" annotation can be made (if that's the correct term). Please see my reply to the other comment. Thank you!
By no stretch of the imagination am I an expert in annotating "compounds", but what you say in the other comment about "guessing" what plant can contain or not - I am guessing (as GenoMax says) this will majorly revolve around guessing genes and their protein products. So if your question is whether, given a stretch of the genome, can you guess how many and which genes are present and what are their protein products then the answer, to the best of my knowledge, would be - kind of. I would refer you to this review which enlists these methods. With this, you could guess a stretch of DNA could code for X protein, but this guess will always come with some bit of uncertainty.
The link you sent is a gold mine for me, thank you for it manaswwm :D