Question

tab to fasta file conversion

1

Entering edit mode

6.4 years ago

blooming.daisy333 ▴ 110

can someone kindly help me out how to convert tab delimited protein file (ID in one column and sequence in second column) into fasta file ?? any simple solution plz

thanks

rna-seq • 11k views

ADD COMMENT • link updated 6.4 years ago by nora ▴ 40 • written 6.4 years ago by blooming.daisy333 ▴ 110

0

Entering edit mode

Try something-there are multiple solutions. If you get stuck, post your efforts and errors.

ADD REPLY • link 6.4 years ago by st.ph.n ★ 2.7k

0

Entering edit mode

some example data?

ADD REPLY • link 6.4 years ago by cpad0112 21k

0

Entering edit mode

Something like

protein1      AGCHCGCGAC
protein2      GAGCSFATHCK

ADD REPLY • link 6.4 years ago by GenoMax 147k

0

Entering edit mode

Biopython version: A: Biopython: SeqIO.write () function to write dictionary object to fasta file

ADD REPLY • link 6.4 years ago by Eric Lim ★ 2.2k

0

Entering edit mode

it is easy to make, using for example the interface galaxy, this function is present here

ADD REPLY • link 6.4 years ago by nora ▴ 40

score 5 · Answer 1 · 2018-07-02

5

Entering edit mode

6.4 years ago

drkennetz ▴ 560

I think you could do this with awk, give this a try:

awk '{print ">"$1"\n"$2}' tab.tsv > seqs.fa

Let me know if that works for you!

Dennis

edit: $1 is your name column and $2 is your sequence column, so switch those if the order is sequence, name.

ADD COMMENT • link 6.4 years ago by drkennetz ▴ 560

score 3 · Answer 2 · 2018-07-02

3

Entering edit mode

6.4 years ago

cpad0112 21k

if first column doesn't have >:

awk -v OFS="\n" '{print ">"$1,$2}' test.txt
sed -e 's/^/>/;s/\t/\n/g' test.txt 
parallel  --colsep '\t'  echo -e '\>{1}\\n{2}'  :::: test.txt

ADD COMMENT • link 6.4 years ago by cpad0112 21k