Question

Is it possible to convert several protein ID in to Fasta format?

0

Entering edit mode

4.5 years ago

tpm ▴ 30

Like this

1>seqid1 
MKRISTTITTTITITTGNGAG 

2>seqid2 
MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA
   YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV

sequence fasta • 972 views

ADD COMMENT • link 4.5 years ago by tpm ▴ 30

0

Entering edit mode

These are already in fasta format assuming you added 1,2 before the records.

ADD REPLY • link 4.5 years ago by GenoMax 147k

0

Entering edit mode

I did not know how I can edit properly on the question I asked, my bad :), but suppose I have 10 gene IDs, how can I fetch the fasta format?

ADD REPLY • link 4.5 years ago by tpm ▴ 30

0

Entering edit mode

It will depend on what kind of ID's those are. Can you provide examples?

ADD REPLY • link 4.5 years ago by GenoMax 147k

0

Entering edit mode

For example: TATE OPPC GLSA1 RNPA RATB FTSI GALP FENR MNTR YGIS DCOR PTHC YQCA FLAW

ADD REPLY • link 4.5 years ago by tpm ▴ 30

0

Entering edit mode

Please use ADD REPLY/ADD COMMENT when responding to existing posts to keep threads logically organized. SUBMIT ANSWER is only for new answers to original question.

What organism do you want to sequences from? e.g. TATE if that is the tatE protein has thousands of sequences in GenBank.

Do you need the sequences for a specific organism? You can find the genome page of the organism you want at NCBI. e.g. Kelbsiella pneumoniae. All proteins can be downloaded in fasta format by using Protein link in the top box.
If you just need at tatE genes then searching the protein data base would be the appropriate solution. Then use Send to drop-down at top of page to send the data to a multi-fasta file.

ADD REPLY • link 4.5 years ago by GenoMax 147k

0

Entering edit mode

I am working with an E. coli BW25113 strain. Suppose I have 200 gene IDs, is it possible to extract their fast files simultaneously? Not 1 by 1?

ADD REPLY • link 4.5 years ago by tpm ▴ 30

0

Entering edit mode

Here is the directory containing the proteins from this strain at Ensembl Bacteria.
Download the fasta file with the sequences.

wget ftp://ftp.ensemblgenomes.org/pub/bacteria/release-47/fasta/bacteria_87_collection/escherichia_coli_bw25113/pep/Escherichia_coli_bw25113.ASM75055v1.pep.all.fa.gz
Unzip the file

gunzip Escherichia_coli_bw25113.ASM75055v1.pep.all.fa.gz

Then use one of the solutions here to pull out genes you need: Extract fasta sequences from a file using a list in another file.

ADD REPLY • link 4.5 years ago by GenoMax 147k