Hi, I am new to genome assembly. I have got 6 contigs after genome assembly. I wonder if I need to use all 6 contigs to merge into one with tools like mauve to get one contig or could I proceed with the longest contig for alignment with reference sequence? I would appreciate any suggestion.
Thanks
Thank you for your suggestions. Yes, it is a bacterial genome from Erwinia spp. I don't have a QUAST report to share for now. I have a question regarding QUAST. From the script, it looks like the contigs should be in separate files and also it seems like we need a annotation file too. I have a single file with all contigs. I wonder if I need to separate each contig in separate file. Can we run the QUAST without annotation file?
You do not need separated files just pass your contig file to quast. You do not need a reference as well but is cool to use one:
If you have problems running quast, you can use it through the browser.
Thank you for the suggestion. That is really helpful. I have QUAST report. How could we decide if we need to make a scaffold from the contigs or just proceed with the largest contig? I checked among the contigs and found that the smallest contig is a plasmid.
Well, if you only want the chromosome, you must map the contigs in your reference and extract those that map to it. You will probably map only the largest contig. If you map more than the largest contig, you will have to build a chromosomal scaffold, as you have so few contigs and good qualities this will be very easy, just map the contigs onto the reference how this person did. But do not throw away contigs that do not map, they are large and of good quality and may have valuable data in them, such as resistance and virulence genes.