I just have completed the genome assembly with nanopore long reads and I have transcriptomics data.
In the next step, I want to do gene prediction (including the information of CDs, exon, intron, etc.), could you recommend some software developed for this purpose? Like MARKER2 for next-generation sequencing data.
In essence there is no difference in annotating a genome that is assembled from Illumina reads, pacbio reads, nanopore etc ...
The main issue here is the quality of the resulting assembly and that can indeed differ from seq technology! Illumina being the most accurate on the per-base level and the other two are less accurate. It is this important that you get whatever genome assembly up to a (quality) point that makes it worth to annotate it (polishing). On that aspect it is advised to improve the quality of your long read assemblies with Illumina to improve the per-base accuracy of your assembly. If you don't have Illumina data at hand you will have to work with what you have of course.
The biggest issue you will encounter with such kind of assemblies (long read only) are indels and wrong base calls. These from a real problem when doing annotation (especially the indels) but as far as I know there is not a single gene prediction software that handles these better or worse than an other.
bottom line, regarding software there is no specific one that do this and any tools that does gene prediction is as feasible as the next one.
ah, and I assume you mean "MAKER" in stead of MARKER? (and yes, that one is a valid one to use)
Yes, then it would make sense to use the illumina data to polish your nanopore assembly . It will only increase the quality of your assembly and have positive effects on the gene prediction result.
Thanks so much for your reply. First of all, I also have four batches of Illumina short-read data, I will be using them to correct draft assembly iteratively. I did mean MAKER2(https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-12-491), sorry for the typo.
Yes, then it would make sense to use the illumina data to polish your nanopore assembly . It will only increase the quality of your assembly and have positive effects on the gene prediction result.