Computing Number Of Nucleotide And Genes On Each Chromosome
1
0
Entering edit mode
11.6 years ago
Dataminer ★ 2.8k

Hi!

I am interested in knowing number of nucleotides and genes on each chromosome for hg18 and hg19.

Any suggestions or links to get these numbers?

Thank you

chip-seq ucsc mysql • 3.8k views
ADD COMMENT
6
Entering edit mode
11.6 years ago

Let's see if for once I can beat Pierre in posting a Mysql answer!! :-)

Remember that the human genome is not completely sequenced, as many repetitive and low complexity regions can not be sequenced with the current technologies. Moreover, hg18 and hg19 are the reference genomes, but it is likely that there is variability in the number of nucleotides between individuals.

Count the number of genes in hg18 and hg19, grouped by chromosome:

mysql -h genome-mysql.cse.ucsc.edu -u genome -D hg18 -N -A -e 'select chrom, count(*) from knownGene GROUP BY chrom'
mysql -h genome-mysql.cse.ucsc.edu -u genome -D hg19 -N -A -e 'select chrom, count(*) from knownGene GROUP BY chrom'

I recommend you to get the counts grouped by chromosomes, so you will be able to see which chromosomes you want to include.

For the number of nucleotides, maybe you can use the tool fetchChromSizes from the UCSC utils. http://hgdownload.cse.ucsc.edu/admin/exe/ , or use the chromInfo table:

mysql -h genome-mysql.cse.ucsc.edu -u genome -D hg18 -N -A -e 'select chrom,size from chromInfo'

However, this won't take into account centromers and regions with no coverage.

ADD COMMENT
3
Entering edit mode

That's not fair ! I was attending a meeting ! :-)

ADD REPLY
7
Entering edit mode

everything is fair in biostar and in bioinformatics :-)

ADD REPLY

Login before adding your answer.

Traffic: 2804 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6