Where To Find The Genome Sizes Of Sequenced Plant Species?
7
7
Entering edit mode
14.7 years ago
Zhaorong ★ 1.4k

I thought this is a simple question, but after some searching over the Internet, I just found several out-of-date databases like http://data.kew.org/cvalues/

There are several useful resources though:

But I did not find any direct information on genome size.

A workaround would be to download all the fasta files of chromosome sequences of these species and just look at the file size..

I'm just wondering is there any resources on plant genome size, or if not, an efficient way to get this information?

Thank you!

plant-genome • 9.0k views
ADD COMMENT
0
Entering edit mode

It's important to stress that most eukaryotic genome size estimates are based on total DNA weight per cell. So, there plenty more estimates besides those in NCBI and related.

ADD REPLY
4
Entering edit mode
14.7 years ago

From NCBI genome projects: http://www.ncbi.nlm.nih.gov/sites/entrez?Db=genomeprj

Clink on "Plant"

Each record contains the genome size (if available)

I would also suggest to have a the field gp:EstimatedGenomeSize/ in ftp://ftp.ncbi.nih.gov/genomes/genomeprj/gp.xml but it seems that not all the organisms are present in that file...

ADD COMMENT
2
Entering edit mode
14.7 years ago

Hi Zhaorong,

Genome size or C-value as commonly used by biologists is a hard to find information. For instance, most biomolecular databases do not have this information readily available. NCBI has only a few hundred estimates on plant genomes. But, as of 2005 there are at least tens of thousands high quality estimates.

A good place is Plant C-value database in the Royal Botanic Gardens. But, be advised, I do not know database layout. So, it might not be possible to programmatically query the entries.

Anyway, for more information you can use T. Gregory 2004 book (The Evolution of the Genome) and check Michael Lynch's masterpiece (The Origins of Genome Architecture). Both bring a wealth of information on genome size, motif (TE, TFBS, etc.) contents, mutation rates, effective population size and other very useful estimates.

ADD COMMENT
0
Entering edit mode

I'll definitely read the books. Thank you :)

ADD REPLY
2
Entering edit mode
14.7 years ago

It's not comprehensive, but there are a few listed in the BioNumbers database

ADD COMMENT
0
Entering edit mode

Thank you for pointing it out! The bionumbers database is a cool idea.

ADD REPLY
1
Entering edit mode
14.7 years ago
Darked89 4.7k

You may try searching "Genome Projects" at NCBI with "txid33090[Organism:exp]" (green plants).

It is a crappy solution, since some entries seem to be duplicated, plastid/ chromosomal/ESTs projects are listed among WGS projects, but at least you should be able to extract info what is being sequenced and where. Individual genome centers/sequencing projects web pages may be the best one can get.

ADD COMMENT
0
Entering edit mode

Thank you! This is what I end up with.

ADD REPLY
0
Entering edit mode
14.7 years ago

Did you try to fill out the query form available at http://data.kew.org/cvalues/CvalServlet?querytype=1

It should allow you to get the statistics you are looking for

ADD COMMENT
0
Entering edit mode
9.1 years ago

You may try:

http://www.ncbi.nlm.nih.gov/genome/

Then type your organism name, and it will direct you to the available genome information about your organism.

ADD COMMENT
0
Entering edit mode
9.1 years ago

NCBI Mapviewer will provide you with many plant genomes with the inclusion of statistic, number of chromosomes, fasta files, genbank files, cross references to other useful databases and tools of interest, graphic maps and in some of them even more utilities such as specific Blast searches and so on

ADD COMMENT

Login before adding your answer.

Traffic: 1908 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6