Hello,
I am struggling to convert a file with nucleotide sequences into an actual fasta file (i.e. with the ">" seq_name / enter sequence format). I am following the strategy below, but any other suggestion is warmly welcome. I am trying to convert the following type of file:
>kmer_1 AAAAAAAAAAAAAAAAAAAAAAAACCCACCCA
>kmer_2 AAAAAAAAAAAAAAAAAAAAAAACAGAGATGT
>kmer_3 AAAAAAAAAAAAAAAAAAAAAAACCCACCCAC
>kmer_4 AAAAAAAAAAAAAAAAAAAAAACAGAGATGTA
>kmer_5 AAAAAAAAAAAAAAAAAAAAAACCCACCCACA
>kmer_6 AAAAAAAAAAAAAAAAAAAAAAGAAGAGAAAA
> kmer_7 AAAAAAAAAAAAAAAAAAAAACCCACCCACAT
>kmer_8 AAAAAAAAAAAAAAAAAAAAAGAAGAGAAAAA
>kmer_9 AAAAAAAAAAAAAAAAAAAAAGAGAACGACAC
>kmer_10 AAAAAAAAAAAAAAAAAAAACCCACCCACATG
Into something like this:
>kmer_1
AAAAAAAAAAAAAAAAAAAAAAAACCCACCCA
>kmer_2
AAAAAAAAAAAAAAAAAAAAAAACAGAGATGT
>kmer_3
AAAAAAAAAAAAAAAAAAAAAAACCCACCCAC
>kmer_4
AAAAAAAAAAAAAAAAAAAAAACAGAGATGTA
>kmer_5
AAAAAAAAAAAAAAAAAAAAAACCCACCCACA
>kmer_6
AAAAAAAAAAAAAAAAAAAAAAGAAGAGAAAA
>kmer_7
AAAAAAAAAAAAAAAAAAAAACCCACCCACAT
>kmer_8
AAAAAAAAAAAAAAAAAAAAAGAAGAGAAAAA
>kmer_9
AAAAAAAAAAAAAAAAAAAAAGAGAACGACAC
>kmer_10
AAAAAAAAAAAAAAAAAAAACCCACCCACATG
Any idea on how to do it by using either paste, awk, etc? Thanks in advance
C: fasta file to tab delimited file
tab to fasta file conversion
What strategy? You didn't show anything you tried.