Question

Generating Codon Usage

3

Entering edit mode

14.8 years ago

Arvalon ▴ 30

Are there any online services or biopython scripts that will calculate the codon usage average for all sequences in a file not individually for each sequence. I only know python, unix and a little bit of perl.

codon • 12k views

ADD COMMENT • link updated 6.7 years ago by anamaria ▴ 30 • written 14.8 years ago by Arvalon ▴ 30

0

Entering edit mode

Not sure what is meant by "codon usage average". Perhaps you mean "frequency" ?

ADD REPLY • link 14.8 years ago by Neilfws 49k

Ram · Answer 1 · 2010-10-05

The EMBOSS program cusp takes one or more nucleotide sequences as input and outputs codon usage data, looking like this (first few lines):

#CdsCount: 1

#Coding GC 67.79%
#1st letter GC 67.88%
#2nd letter GC 46.89%
#3rd letter GC 88.60%

#Codon AA Fraction Frequency Number
GCA    A     0.077     7.772      3
GCC    A     0.462    46.632     18
GCG    A     0.462    46.632     18
GCT    A     0.000     0.000      0
....

There are a number of EMBOSS servers if you want to run the analysis online.

Ram · Answer 2 · 2010-10-04

3

Entering edit mode

14.8 years ago

Alastair Kerr 5.3k

Not online but CodonW does this.

ADD COMMENT • link 14.8 years ago by Alastair Kerr 5.3k

1

Entering edit mode

Web interface is here.

ADD REPLY • link updated 5.9 years ago by Ram 45k • written 14.8 years ago by Darked89 4.7k

0

Entering edit mode

CodonW is good for PCA.

ADD REPLY • link 11.9 years ago by Naren ★ 1.0k

Ram · Answer 3 · 2010-10-04

3

Entering edit mode

14.8 years ago

Pierre Lindenbaum 166k

For the web services, you can find some services in the BioCatalogue.

You can then run those services with Taverna.

ADD COMMENT • link updated 5.9 years ago by Ram 45k • written 14.8 years ago by Pierre Lindenbaum 166k

score 1 · Answer 4 · 2010-10-04

1

Entering edit mode

14.8 years ago

Larry_Parnell 16k

You could try http://www.bioinformatics.org/sms2/codon_usage.html and their codon usage tool. There is a mechanism to use the tool off-line. See link on the above page. I have no experience with this tool, but you may need to concatenate your individual sequences into one.

ADD COMMENT • link 14.8 years ago by Larry_Parnell 16k

Ram · Answer 5 · 2010-10-04

0

Entering edit mode

14.8 years ago

Fred Fleche 4.3k

http://www.bioinformatics.fr/bioinformatics.php?subsection=Codon%20usage

ADD COMMENT • link updated 5.9 years ago by Ram 45k • written 14.8 years ago by Fred Fleche 4.3k

score 0 · Answer 6 · 2013-09-25

0

Entering edit mode

11.9 years ago

Naren ★ 1.0k

CodonW can concatenate genes to one sequence and then calculates the overall codon usage offline on Windows.

ADD COMMENT • link 11.9 years ago by Naren ★ 1.0k

score 0 · Answer 7 · 2018-11-08

Leaving this here for the future reference... Following up on the concatenation approach, you could make use of the Bioconductor packages in R to do this: concatenate the sequences using Biostrings, and then analyse codon usage with coRdon.

https://bioconductor.org/packages/release/bioc/html/Biostrings.html

https://bioconductor.org/packages/release/bioc/html/coRdon.html