Aligning two fasta files with multiple sequences
0
0
Entering edit mode
3.4 years ago

Hi! I am Aligning two fasta files with multiple sequences (~2000 seqs each) and I would like to keep only the perfect matches in a separate file for further analysis. Can someone help me with a code or an app to do that?

Protein Fasta Sequence Alignment • 2.0k views
ADD COMMENT
1
Entering edit mode

Use command line blast+ or blat along with a tab delimited format (e.g. -outfmt 6 or psl respectively). Parse the results to keep identical hits.

ADD REPLY
0
Entering edit mode

Dear GenoMax, Can you explain further please? because I am bit a lost. How do I indicate the inputs and the outputs file? Can you write help me with a more simpler description of the line of code like?

"blast+ inputfile.fasta > outputfile.fasta"

ADD REPLY
1
Entering edit mode

Why don't you try on your own first? Figuring it out on your own is faster than waiting for someone to spoon feed you here.

ADD REPLY
0
Entering edit mode

I will love to do that? but I am completely lost and I dont understand your description above.

ADD REPLY
1
Entering edit mode

You will need to use blast+ from NCBI. You can find instructions here. This is the same package that you may have used on web at NCBI. You will need to use it on the unix command line though.

Using blat can be simpler in some ways but you will need to be reasonably proficient with using unix command line. You can find a user guide for blat here.

ADD REPLY
0
Entering edit mode

Thank you for very much.

ADD REPLY
1
Entering edit mode

MAFFT has an option (Output orde) like this where the same sequences are aligned one after another.

ADD REPLY
0
Entering edit mode

Thank you.

ADD REPLY

Login before adding your answer.

Traffic: 1679 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6