How to identify gaps in a genome?
1
0
Entering edit mode
7 months ago
rj.rezwan ▴ 10

Hi, I assembled the genome using PacBio HiFi reads and Hi-C reads. I want to know how can I identify the gaps in genome. Is there any good tool for this? I shall be grateful to you.

awk seqkit assembly fasta genome • 686 views
ADD COMMENT
0
Entering edit mode

Find related genomes (preferably finished) from NCBI and align your own to get some idea of coverage. Run BUSCO on your genome to see if you have captured most of the gene content expected to be present.

ADD REPLY
0
Entering edit mode

GenoMax I have higher BUSCO than the previously published genome but I am looking to find the "NNNNNN" regions in my genome that exhibit the gaps.

ADD REPLY
0
Entering edit mode
7 months ago

You can use seqtk to find gaps in a fasta

https://github.com/lh3/seqtk

seqtk gap
Usage: seqtk gap [-l 50] <in.fa>
ADD COMMENT

Login before adding your answer.

Traffic: 2318 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6