Best way to compare genome files similarity
3
0
Entering edit mode
2.8 years ago

Hi,

I am new to bioinformatics and would like to know if there is any recommended software to compare for example:

Let's say I have multiple files (fasta for example) from complete genomes and want to compare their similarity % or/and if they share the same origin.

Something like different bat coronavirus and human genomes.

I tried to make my own python script but it is a hassle and very time consuming.

Thanks

Fasta • 1.5k views
ADD COMMENT
0
Entering edit mode

Is there any way to do this using R?

ADD REPLY
3
Entering edit mode
2.8 years ago
Mensur Dlakic ★ 28k

FastANI rapidly compares fasta sequences and will output global identity. It is meant only for relatively similar genomes, and will not create any output if identity is below 75-80%.

ADD COMMENT
0
Entering edit mode

This sounds useful as I used pairwise using blastn and achieved 96% and 85% per ident for the sequences that were supposed to be similar when compared to the main sequence. The other sequences had bad e-values and similarity lower than 67%.

ADD REPLY
2
1
Entering edit mode
2.8 years ago

MAFFT is another tool that could also employ

https://mafft.cbrc.jp/alignment/software/

that being said the task of comparing bat coronaviruses is incomparably simpler than comparing human genomes.

radically different methodologies would be needed in the latter case.

ADD COMMENT

Login before adding your answer.

Traffic: 1869 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6