Hello everyone, I have a DNA Sequences data set of 5000 sequences. The sequences header contains the sequences' names and dates. I want to prepare a data file that comprises taxon names from sequence alignment and dates separated by spaces. My header formate is as given below
>A/Kilifi/100/2015|A /H3N2|1604249|12/14/2015
Required formate is (space between name and date)
>A/Kilifi/100/2015|A /H3N2|1604249| 12/14/2015
I cant do it manually because it will take a lot of time. if anyone knows a quick method to create a space. let me know. it would be really appreciated. i am looking forward to a positive response. Thankyou
have a look at
sed
https://linux.die.net/man/1/sedThankyou . I will check the link.
Can you use the command line? Have you tried any of the typical tools that would come to mind, e.g.
awk
orsed
?I used the sed previously for removing the spaces in the header. but i could not succeed in creating the spaces between name and date.