Hello,
I am relatively new to bioinformatics and RNA-Seq differential gene expression. I was able to generate a matrix of significantly expressed gene ids from bacteria. I would like to categorize these genes based on cellular function (membrane components, metabolism proteins, cell stress and so on). I have a list of a couple hundred genes that I could look up on NCBI gene, but am thinking there is an easier way.
I tried websites like DAVID but it does not seem to recognize the gene ids. I entered names like PD_RS00325 which NCBI database identifies as a gene in my bacteria of interest. I also tried Blast2Go in Omicsbox, but I seemed to have trouble annotating the list because it was not in the right format. Any suggestions on some way to automate this process of identification of the cell functions of these genes would be very helpful.
Thanks for your time.
Did you select the organism the gene IDs belong to before running the analysis? If I remember correctly, DAVID maps the input IDs to human gene IDs.
Yes, I added the name of the species after selecting "official_gene_id." DAVID software prompted me to convert the ids to Entrez IDs but mentioned that all my gene ids were uncharacterized. I don't think that is the case since you can search the gene ids and pull up names and nucleotide sequences in NCBI databases.