Hi, all.
I would like to edit headers from fasta.
I have fasta with random header as following(headers are separated by space);
>3R5.1a wormpep=CE24758 gene=WBGene00007065 locus=pot-3 status=Confirmed uniprot=G5EFG7 insdc=CAA21777.2 product="POT1PC domain-containing protein"
>2RSSE.1a wormpep=CE32785 gene=WBGene00007064 locus=rga-9 status=Confirmed uniprot=A4F337 insdc=CCD61138.1 product="Rho-GAP domain-containing protein"
>2L52.1a wormpep=CE32090 gene=WBGene00007063 status=Confirmed uniprot=A4F336 insdc=CCD61130.1
I would like to edit it like a following;
>3R5.1a gene=WBGene00007065 locus=pot-3
>2RSSE.1a gene=WBGene00007064 locus=rga-9
>2L52.1a gene=WBGene00007064 -
In fact, this file is so long that command operations are required as you know. But I don't know how to edit random header. Could you please give me a help?
I really appreciate your help in advance! Thank you.
One lazy way:
If
locus
field is not present in every header then whatever is next will be picked up e.g.status
in example above in line 3.