Sequence date of Refseq genomes
2
0
Entering edit mode
6.7 years ago
alyosamah • 0

I have several genomes from RefSeq database of GenBank format. ftp://ftp.ncbi.nlm.nih.gov/refseq/release/bacteria/

How can I find the sequence date of any genome or the date which this genome added to the database using python or any programmatic software tool ?!

ref-seq python biopython sequencing genome • 1.4k views
ADD COMMENT
1
Entering edit mode

I think that perhaps BioPython could help you with it. See link1.

You can find a date in the output of print(record), I think.

ADD REPLY
1
Entering edit mode
6.7 years ago
Carambakaracho ★ 3.3k

AFAIK, closest to what you can get is the release date, which you get easiest via the NCBI e-utils API. BioPython gives you a simple interface to query them via Bio.Entrez which includes a convenient XML parser. In the xml metadata you get from the NCBI, you can get the release data in <AsmReleaseDate_GenBank> or <AsmReleaseDate_RefSeq>

ADD COMMENT
0
Entering edit mode
6.7 years ago
5heikki 11k
wget ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/assembly_summary_refseq.txt
awk -v QUERY="GCF_000001215.4" 'BEGIN{FS="\t"}{if($1 ~ QUERY){print $15}}' assembly_summary_refseq.txt
2014/08/01
ADD COMMENT

Login before adding your answer.

Traffic: 2216 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6