I'd like to change the entries of each fasta file
from:
gi|556503834|ref|NC_000913.3|Escherichia coli str. K-12 substr. MG1655, complete genome
to:
Escherichia_coli_str._K-12_substr._MG1655
which means i want to remove the accession number and just want to keep the species name, at the same time all the space is replaced by underscore. either R or unix is ok.
Thank you very much.
Always mention what you've tried. Your questions suggests that you just want an answer and are not interested in learning how to get there, which should not be how anyone approaches this.