Now I have protein (NCBI) NP_391988.1 ,and its mRNA file NM_033668.2,how can i get the CDS correspond to the Protein sequence
Now I have protein (NCBI) NP_391988.1 ,and its mRNA file NM_033668.2,how can i get the CDS correspond to the Protein sequence
go to ncbi:
http://www.ncbi.nlm.nih.gov/nuccore/NM_033668?report=GenBank
The most straight forward method is what Gabriel R. has suggested. The alternative, which allows documenting it at the same time, is to do so from the command line via Edirect.
esearch -db nucleotide -query "NM_033668.2" | efetch -format fasta_cds_na > cds_nm033668-2.fasta
You can use a software we have recently published I-PV (http://i-pv.org/). You will need perl and circos installed to be able to use it. It needs an mRNA sequence and a fasta file of your protein sequence ( the one that starts with NP_...). It will automatically generate a protein graph for your CDS. You can than compare each aminoacid and its codon...etc
Click View - DNA in the UCSC Genome Browser, search for your identifiers, click on the transcript and click "Get Protein sequence".
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
if you have the aminoacid sequence, you have the CDS.