Non-Redundant Data Sets Of Protein
1
0
Entering edit mode
10.7 years ago
eka0700 ▴ 30

Hello everyone,

I'm working with plant proteome. As there are huge number of proteins in a proteome, I want to filter out the protein number within a limited length of amino acids. Is there any tool available for this? And anyone for calculating the average length of amino acid sequences in plant proteomes?

Thanks in advance.

• 2.5k views
ADD COMMENT
0
Entering edit mode

What file type you're working with? Is it fasta?

ADD REPLY
0
Entering edit mode

Yes, it is fasta.

ADD REPLY
0
Entering edit mode

What have you done already? Searching "filter fasta by length" on google got me several answers: How to Filter Multi fasta by length?? ; filter sequence by length ; How to Filter the Sequence by Their Length.

ADD REPLY
0
Entering edit mode

Thanks. I've tried those. But it didn't work :(

ADD REPLY
1
Entering edit mode
10.7 years ago

Jalview can do this fairly easily, though if you use an entire genome's worth of proteins it may run very slowly. Just sort by length through calculate, and then you can highlight the range of sequences you want to remove, and just hit delete. Then save the "alignment" as a new fasta file.

ADD COMMENT
0
Entering edit mode

Is multiple sequence alignment required before using Jalview?

ADD REPLY

Login before adding your answer.

Traffic: 1820 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6