So what I am doing: I am analyzing sequenced transcriptome of eukaryotic host + bacterias. Host is known in advance, and goal is to identify pathogens. I'm interested if it is possible to identify those bacterias by their CDSs found in that transcriptome.
For example, is it possible to identify bacteria by the subset of its CDSs which are found in a sample? Is there some database which holds which subset of CDSs must be found (active) in order to identify certain bacteria?