Entering edit mode
3.5 years ago
K
▴
10
Hi,
I had consensus reads from a PacBio sequencing. I used the assembler hifiasm to create an assembly on these CCS. I got five .gfa files. I have transformed .gfa file to .fasta contigs using bandage and awk command.
What can be used to get more statistics on the assembly?
Quast and BUSCO would be my initial suggestions, but it really depends on what specific stats you're looking to compute.
bandage gave me N50, total length but I am particularly looking for coverage and depth of the assembly.
To get that, you should maps the reads back to the assembly (e.g., with minimap2 for long reads) and then use a tool like samtools or mosdepth to get the depth from the sam/bam file.