I need a server or a software to calculate nucleotide composition of many sequences Simultaneously .can you help me?(except CAIcal server)
Please ask me at once.
Thank you
I need a server or a software to calculate nucleotide composition of many sequences Simultaneously .can you help me?(except CAIcal server)
Please ask me at once.
Thank you
if the number of sequences is not too big, you can save and open the following HTML page (it uses the HTML5 file-API to load the data):
EDIT:: i've updated my code. It handles multiple FASTA files.
original post: http://plindenbaum.blogspot.fr/2011/04/playing-with-html5-file-api-translating.html ( dna -> protein )
Easy command line tool (faCount) from UCSC (download page for linux- http://hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/) will give you nucleotide composition of each sequence or can give you a summary of total nt content for the input fasta.
(faSize is another handy one to summarize the number of reads and total number of nt)
Simply type command:
grep -o "A" * | wc -l
You can also use any k-mer counter and set k=1. My own python k-mer counter is here (needs khmer and screed python libraries).
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
when you say nucleotide composition, what do you mean exactly. Plus it is important to understand the scope of the problem. How many sequences do you need to process and how long is the longest?
Solutions could be as simple as writing something like:
If your sequences are longer than say a ten million bases or you have tens of millions of sequences to process then it needs to be a little more sophisticated.