I need to extract just the set of "peptide" feature annotations as a FASTA dump, from human Swiss-Prot in the first instance
These are in the "molecule processing" sections in the form Peptide 34 – 42 9 Angiotensin 1-9 PRO_0000420659
The "fragments" are also have the same PRO_ id type so I need to keep these out, as well as signal peptides
Suggestions welcome, even better if someone could just drop them out (less than 500 I guess ?)
give us one or two example of accession number please.
P01019 would be an example.