Count all (non N) nucleotides within regions defined in BED file
1
0
Entering edit mode
4.0 years ago
William ★ 5.3k

Is there an easy way with some tool or library to count all (non N) nucleotides within regions defined in BED file?

Of course in combination with the reference genome fasta on which the regions are defined.

This to check that all (non N) nucleotides of the reference fasta are in 1 region of the BED file?

bed fasta • 861 views
ADD COMMENT
2
Entering edit mode
4.0 years ago
Prasad ★ 1.6k

You can extract sequences for bed regions using "getfasta" followed by base counting using any of the options provided here

ADD COMMENT
0
Entering edit mode

Thank you. I'll use the pyfastx fasta composition to compare the content of the getfasta and original fasta file. Also added that option to the linked question.

ADD REPLY

Login before adding your answer.

Traffic: 2074 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6