How to batch download multiple mitochondrial FASTA & GFF files?
1
0
Entering edit mode
2.2 years ago
YOUSEUFS ▴ 30

Downloading 1 by 1 is taking an eternity.

I have a refseq_mito.txt file with 115 RefSeq accession numbers for mitochondrial genomes (examples below)

refseq_mito.txt contents example:

NC_019663.1
NC_023340.1
NC_019642.1
NC_005291.1
NC_023371.1

I've tried to input this refseq_mito.txt into Batch Entrez (Database: Genome) which sends me to this page:

Entrez Batch Output

which sends me to a page showing 10 tick genome NOT mitochondria:

Entrez Batch link output

What am I doing wrong? How can I get FASTA & GFF of the mitochondria I'm interested in?

ncbi genomics mitochondria genomes entrez • 1.2k views
ADD COMMENT
0
Entering edit mode

Have you tried clicking the link that says "Retrieve records for 110 UIDs"? That is what you probably want.

GFF files for mitochondrial genomes are not directly available. You have to select a genome e.g. https://www.ncbi.nlm.nih.gov/nuccore/NC_011943.1 and then click on Send to -> File -> Format -> GFF3

ADD REPLY
0
Entering edit mode

The "Retrieve records for 110 UIDs" just sends me to the 10 tick genomes page (Shown above, second picture) with no mention of mitochondria.

I can individually click on the mitochondrial genomes to manually download both FASTA and GFF but this will be time consuming for over 100 mitochondrial genomes.

ADD REPLY
0
Entering edit mode

Can you post a few example accession numbers?

ADD REPLY
0
Entering edit mode

Updated Post

ADD REPLY
1
Entering edit mode
2.2 years ago
GenoMax 147k

When I tried your example ID's it got me the correct results. Once that works downloading the data should follow Send to -> File -> Format -> GFF3 and Send to -> File -> Format -> FASTA. You will get a single file with sequence/GFF data for all organisms. You will need to split them into individual files.

Batch

ADD COMMENT
0
Entering edit mode

what database did you select?

ADD REPLY
1
Entering edit mode

Nucleotide.

ADD REPLY
0
Entering edit mode

This is where I was going wrong, I've been trying "Genome" and "Assembly". Many thanks for helping with this!

ADD REPLY

Login before adding your answer.

Traffic: 2551 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6