Is it possible to convert several protein ID in to Fasta format?
0
0
Entering edit mode
4.5 years ago
tpm ▴ 30

Like this

1>seqid1 
MKRISTTITTTITITTGNGAG 

2>seqid2 
MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA
   YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV
sequence fasta • 977 views
ADD COMMENT
0
Entering edit mode

These are already in fasta format assuming you added 1,2 before the records.

ADD REPLY
0
Entering edit mode

I did not know how I can edit properly on the question I asked, my bad :), but suppose I have 10 gene IDs, how can I fetch the fasta format?

ADD REPLY
0
Entering edit mode

It will depend on what kind of ID's those are. Can you provide examples?

ADD REPLY
0
Entering edit mode

For example: TATE OPPC GLSA1 RNPA RATB FTSI GALP FENR MNTR YGIS DCOR PTHC YQCA FLAW

ADD REPLY
0
Entering edit mode

Please use ADD REPLY/ADD COMMENT when responding to existing posts to keep threads logically organized. SUBMIT ANSWER is only for new answers to original question.

What organism do you want to sequences from? e.g. TATE if that is the tatE protein has thousands of sequences in GenBank.

  1. Do you need the sequences for a specific organism? You can find the genome page of the organism you want at NCBI. e.g. Kelbsiella pneumoniae. All proteins can be downloaded in fasta format by using Protein link in the top box.
  2. If you just need at tatE genes then searching the protein data base would be the appropriate solution. Then use Send to drop-down at top of page to send the data to a multi-fasta file.
ADD REPLY
0
Entering edit mode

I am working with an E. coli BW25113 strain. Suppose I have 200 gene IDs, is it possible to extract their fast files simultaneously? Not 1 by 1?

ADD REPLY
0
Entering edit mode
  1. Here is the directory containing the proteins from this strain at Ensembl Bacteria.
  2. Download the fasta file with the sequences.

    wget ftp://ftp.ensemblgenomes.org/pub/bacteria/release-47/fasta/bacteria_87_collection/escherichia_coli_bw25113/pep/Escherichia_coli_bw25113.ASM75055v1.pep.all.fa.gz

  3. Unzip the file

    gunzip Escherichia_coli_bw25113.ASM75055v1.pep.all.fa.gz

Then use one of the solutions here to pull out genes you need: Extract fasta sequences from a file using a list in another file.

ADD REPLY

Login before adding your answer.

Traffic: 1392 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6