did pilon improve my genome?
2
0
Entering edit mode
13 months ago
vasudhapai • 0

base count

my doubt is if my sequence has actually improved? refseq is the reference sequence. polished is the output from pilon and contigs.fasta is my file generated from spades. pls help I used the command

Java -Xmx2048m -jar pilon-1.24.jar --genome refseq.fasta --frags sorted bam.bam --output polished --fix all

the sorted bam file is my genome aligned with refseq am doing it correctly?

genome genomeassemblies assembly spades pilon • 957 views
ADD COMMENT
0
Entering edit mode

Please do not paste screenshots of plain text content, it is counterproductive. You can copy paste the content directly here (using the code formatting option shown below), or use a GitHub Gist if the content volume exceeds allowed length here.

ADD REPLY
1
Entering edit mode
13 months ago

You can use QUAST with multiple contig sets (if available, ie through multiple rounds of polishing or different polishing tools).

https://github.com/ablab/quast

Then run multiqc on the dir with results to pick up the comparative number of SNPs and indels per 100kbp pre and post polishing. That will allow you to make a judgement.

ADD COMMENT
0
Entering edit mode
13 months ago
Corentin ▴ 610

You should have log files from pilon, which indicates the % of corrected bases per contig and other useful metrics.

As mentioned by colindaven, you need to compare the genomes pre and post polishing to judge wether pilon improved your assembly.

You can use Quast to compare contig statistics, BUSCO to assess completeness (but I suspect pilon will not change these values too much) and / or align your assemblies to closely related species, if available.

ADD COMMENT

Login before adding your answer.

Traffic: 2277 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6