Hi,
I am fairly new to bioinformatics. I have the genome sequences of two closely related bacteria (or, more accurately, both genome sequences are publicly available). One bacterium grows much faster under a particular set of growth conditions than the other. I'm trying to determine what genes may be responsible for this improved growth. How do I compare these two genomes to find out what genes are different between the two? And is there a way to find out what the functions (e.g. GO terms) for the different genes are or will I just get a list of gene names as output?
Thanks for any help you can offer.
It may depend on how "closely related" the genomes are. There are tools like Mauve that can give you a birds-eye view of the genome organization.
If the two genomes you are interested in are in MBGD then that would be one place to start.
Thanks! I tried using Mauve, but got an error when trying to align my two sequences. I'm trying to troubleshoot the error.
Both genomes are in MBGD, so I'll check that out too.
How would you define different? You are looking for point mutations, mutations in functional domains, copy number variants, regulatory variation,...? All are possible sources of difference between the two species. There is definitely a bioinformatics answer to your question, but you need to make sure you ask a "biologically-sound" question ;-)
That's a good question. The overall goal of this project is to engineer the strain that grows less well to see if we can get it to grow better (the strain that grows less well has some other properties that we want). I am trying to put together a list of genes to target for engineering. The method we are using for strain engineering will allow me to make tens of thousands of targeted mutations - point mutations, insertions, and/or deletions - in our strain and then screen for improved growth. I'm basically looking for changes in coding regions, functional domains or regulatory regions that might cause the one strain to grow better. I would also be interested in knowing if there are genes present in the organism that grows better that are not present in the one that grows worse or vice versa.
I am most interested in genes that are involved in membrane structure/composition, redox balance, photosystems, and the carbon monoxide oxidation pathway (which is present in both organisms), since there is some literature to support these genes/pathways playing a significant role in growth in the conditions I'm looking at.
It has been shown that that growth rates in bacteria largely depend on the codon usage and not much on replication related issues. see http://www.ncbi.nlm.nih.gov/pubmed/20090831 . There is also a software grownpred to predict gorwn rates. I would suggest looking at codon usage differences for high expressed genes for start. Genes involved in the condition that you found associated with growth rate could be tested for differing codon use.
Thanks for the suggestion!