How To Get The Coding Sequences (Cds) Based On The Protein Aminoacid Sequence
4
0
Entering edit mode
11.5 years ago
harrymao20 • 0

Now I have protein (NCBI) NP_391988.1 ,and its mRNA file NM_033668.2,how can i get the CDS correspond to the Protein sequence

cds • 9.5k views
ADD COMMENT
0
Entering edit mode

if you have the aminoacid sequence, you have the CDS.

ADD REPLY
2
Entering edit mode
11.5 years ago
Gabriel R. ★ 2.9k

go to ncbi:

http://www.ncbi.nlm.nih.gov/nuccore/NM_033668?report=GenBank

  1. next to Send: click on the arrow.
  2. click on coding sequence
  3. make sure fasta nucleotide is checked
  4. download
ADD COMMENT
1
Entering edit mode
7.8 years ago
seancho ▴ 90

The most straight forward method is what Gabriel R. has suggested. The alternative, which allows documenting it at the same time, is to do so from the command line via Edirect.

esearch -db nucleotide -query "NM_033668.2" | efetch -format fasta_cds_na > cds_nm033668-2.fasta
ADD COMMENT
0
Entering edit mode
9.2 years ago

You can use a software we have recently published I-PV (http://i-pv.org/). You will need perl and circos installed to be able to use it. It needs an mRNA sequence and a fasta file of your protein sequence ( the one that starts with NP_...). It will automatically generate a protein graph for your CDS. You can than compare each aminoacid and its codon...etc

ADD COMMENT
0
Entering edit mode
7.8 years ago

Click View - DNA in the UCSC Genome Browser, search for your identifiers, click on the transcript and click "Get Protein sequence".

ADD COMMENT

Login before adding your answer.

Traffic: 1987 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6