How to extracts variants from multiple sequence alignment files?
0
0
Entering edit mode
3.4 years ago
FadyNabil ▴ 20

I have a clustalw file in .aln format I want to get the variants only from this file

clustalw biopython fasta fastq • 1.9k views
ADD COMMENT
0
Entering edit mode

I try it but I got this: Alignment sequences.fasta contains sequences of unequal length. Expected length is 712 but got 688 in sequence MT364213.1

ADD REPLY
0
Entering edit mode

Did you use Pierre's script or https://github.com/sanger-pathogens/snp-sites ?

ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode

That link requires an aligned fasta format file which is not the same as .aln file you have. I suggest you use Pierre's script in this answer: Getting A Vcf File From A Fasta Alignment

ADD REPLY
0
Entering edit mode

I found that the sequences must have the same length

ADD REPLY
0
Entering edit mode

If your sequences aren't the same length, they aren't aligned, so you need to check your input data properly.

ADD REPLY

Login before adding your answer.

Traffic: 2869 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6