most sequenced genomes (microbial)
2
I would like to know if there is any way that i can get which microorganisms with the largest number of genomes sequenced and deposited in databases. (NCBI).
i tried to get this information in the ncbi website but I did not succeed
genomes
microbial
• 1.3k views
Get prokaryote genomes summary file from NCBI here .
awk -F '\t' '{print $1}' prokaryotes.txt | sort | uniq -c | sort -k1,1nr > bact
Gets you (truncated for brevity) following. Note: This is only checking on the names as available in the summary file.
8123 Escherichia coli
8055 Streptococcus pneumoniae
4598 Staphylococcus aureus
3722 Mycobacterium tuberculosis
3161 Klebsiella pneumoniae
2492 Pseudomonas aeruginosa
2372 Listeria monocytogenes
2062 Salmonella enterica subsp. enterica serovar Typhi
1923 Acinetobacter baumannii
1351 Salmonella enterica
1210 Neisseria meningitidis
1115 Streptococcus suis
1079 Clostridioides difficile
1039 Shigella sonnei
926 Campylobacter jejuni
863 Bacillus cereus
852 Mycobacteroides abscessus subsp. abscessus
755 Enterococcus faecium
727 Streptococcus agalactiae
722 Campylobacter coli
633 Bordetella pertussis
600 Vibrio parahaemolyticus
556 Salmonella enterica subsp. enterica serovar Typhimurium
553 Enterobacter cloacae
549 Helicobacter pylori
To look for genomes marked as "Complete" use the following variation of the answer.
grep -w "Complete" prokaryotes.txt | awk -F '\t' '{print $1}' - | sort | uniq -c | sort -k1,1nr > compl_genomes
This is the result at the time of writing.
444 Escherichia coli
343 Bordetella pertussis
177 Klebsiella pneumoniae
159 Staphylococcus aureus
127 Mycobacterium tuberculosis
105 Pseudomonas aeruginosa
94 Listeria monocytogenes
88 Campylobacter jejuni
83 Streptococcus agalactiae
82 Acinetobacter baumannii
63 Neisseria meningitidis
57 Corynebacterium pseudotuberculosis
54 Helicobacter pylori
52 Legionella pneumophila
50 Bacillus velezensis
49 Brucella melitensis
47 Burkholderia pseudomallei
Login before adding your answer.
Traffic: 1772 users visited in the last hour
thank you.
I want the microorganisms with more complete genomes deposited in NCBI. Can you help me?
What do you mean by
more complete genomes
?Note: Please use
ADD COMMENT/ADD REPLY
when responding to existing posts to keep threads logically organized.sorry, i want to tell 'the microorganisms with the largest number of complete genomes deposited'
See my second answer below (A: most sequenced genomes ).