How To Download .Gbk Files From Ncbi Website
1
0
Entering edit mode
13.0 years ago
Arun • 0

How to download .gbk files from ncbi website?

ncbi genbank • 33k views
ADD COMMENT
3
Entering edit mode

FYI (and anyone else), the NCBI website has extensive help documents, linked to right from their front page. For example: the handbook - http://www.ncbi.nlm.nih.gov/books/NBK3831/, which contains help on the Entrez databases - http://www.ncbi.nlm.nih.gov/books/NBK3836/, where the topic "Displaying and Saving a Set of Records" answers your very question.

ADD REPLY
0
Entering edit mode

This is a much older question but I am having the same issue and have followed the posted answers. NCBI seems to only be providing me with GenBank flat format files not regular gb/gbk, which I need for annotations when importing into MauveAligner.

Please advise.

ADD REPLY
0
Entering edit mode

NCBI seems to only be providing me with GenBank flat format files not regular gb/gbk

What does that mean? A GenBank format file will contain annotation but will still be plain text.

For example using EntrezDirect (truncated for space):

$ efetch -db nuccore -id NM_001123229 -format gb
LOCUS       NM_001123229            4047 bp    mRNA    linear   VRT 19-SEP-2021
DEFINITION  Danio rerio insulin receptor b (insrb), mRNA.
ACCESSION   NM_001123229 XM_685977
VERSION     NM_001123229.1
KEYWORDS    RefSeq.
SOURCE      Danio rerio (zebrafish)
  ORGANISM  Danio rerio
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Actinopterygii; Neopterygii; Teleostei; Ostariophysi;
            Cypriniformes; Danionidae; Danioninae; Danio.
REFERENCE   1  (bases 1 to 4047)
  AUTHORS   Helal M, Yan C and Gong Z.
  TITLE     Stimulation of hepatocarcinogenesis by activated cholangiocytes via
            Il17a/f1 pathway in kras transgenic zebrafish model
ADD REPLY
1
Entering edit mode

the best way to get GenBank files in 2021 is with bio

pip install bio --upgrade

then

bio fetch NM_001123229 > data.gb

nothing needs to be set, bio will all figure it out. If you want the gff file do:

bio fetch NM_001123229 --format gff > data.gff

Then use the rest of bio to manipulate the file to your hearts content.

ADD REPLY
0
Entering edit mode

when the GB file contains more data the format has to be gbwithparts otherwise just a skeleton file with top level annotation will be returned.

ADD REPLY
0
Entering edit mode

Hi, why did you delete this post, especially after it was replied to?

ADD REPLY
0
Entering edit mode

Thought I had deleted it before anyone replied, I had figured it out, although it was not intuitive.

ADD REPLY
0
Entering edit mode

You could just add what you figured out, especially if it's not intuitive. Other people (including your future self) could benefit from the time you spent addressing this problem.

ADD REPLY
6
Entering edit mode
13.0 years ago

Search for the sequence that you want. The "display settings" link at the upper left hand corner will allow you to display the entry in various formats. Choose genbank. The upper right hand corner has a "send to" button that'll let you send to file and download the entry in genbank format.

ADD COMMENT
0
Entering edit mode

No other answer required :)

ADD REPLY

Login before adding your answer.

Traffic: 1946 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6