In the silkworm genome paper, it is mentioned that "Of the 323 wing-development genes known in D. melanogaster, 300 are found in silkworm." The author did not give any reference to how they got the 323 wing-development genes. My question is how to get the list of genes involved in a biological process. Take fruitfly for example. I tried to get the list of wing-development genes from flybase using QueryBuilder. However, I only got 33 genes, far less than 323. I stored my query and the query parameter is as follows:
target=fbgn
output=HitList
format=-
species=Dmel
guistyle=1
AND go go-id CV term FBbt:00004729 no
Did I make some mistake or are there some other ways to do this? Any advice is welcome.
In my experience, the FlyBase query builder often does not work or produces results that are difficult to interpret. If you end up using GO based queries, I suggest using the BioMart or FlyMine systems instead for more robust and reproducible results.
Casey, unfortunately, BioMart does not deal with the tree-structure of the gene ontology, even though it would be possible to query for the cumulative list of identifiers in my answer below. FlyMine does deal with the tree-structure, but I looked up 'wing disc development' and got 277 distinct gene identifiers as result. I find that suspect, because that number is less what Simon, Pierre and I have found using independent methods. Thanks for the FlyMine pointer though -- I keep forgetting it exists. :D