I have a CSV file that looks like this:
201200175,A/name1/175/2012
201200287,A/name2/287/2012
201200845,A/name3/845/2012
Currently my fasta headers look like:
>201200175_AA
>201200175_AB
>201200175_BB
and I want to change it to:
>A/name1/175/2012_AA
>A/name1/175/2012_AB
>A/name1/175/2012_BB
I want to preserve the suffix (_AA etc..). I have multiple fasta files, and they are all multifastas.
I was wondering if there is a quicker way in bash rather than writing out some Perl...
Do the fasta files have linebreaks in seqs? If not, then perhaps e.g.
can you explain the commands, it would be so helpful to me
man join
,man sed
,man sort
,man paste
You can try them out yourself. e.g.
sed 's/,/\t/' file.csv
you will see that it replaces the first commas with tabs,paste - - <file.fa
etc. you can see yourself..Thanks, no they don't.