Entering edit mode
5.6 years ago
biobiu
▴
150
Hi, I plan to sequence the genome of un-referenced bacteria. My first goal is to try to define functional elements in the genome and next to predict their function.
My general idea is to assemble the reads into contigs, perform ORF calling, blast genes against DBs, translate the genes into proteins and look for conserved domains.
Any other idea? Assuming you will know the proteins sequence how would you predict its function?
Just use a tried and tested annotation pipeline like Prokka.
That pretty much does everything you describe and more