[out of memory issue] NCBI Entrez protein seq download
1
0
Entering edit mode
2.5 years ago
Alewa ▴ 170

extracting protein sequences from ncbi using entrez in terminal but keep getting error

/home/$USER/edirect/ecommon.sh: xrealloc: cannot allocate 18446744071589123328 bytes (42668032 bytes allocated)

command used

$esearch -db gene -query 'Mycobacterium tuberculosis H37Rv[Organism] AND NC_000962.3[ACCN]' | elink -db gene -target protein | efilter -organism Mycobacterium tuberculosis H37Rv -source refseq | efetch -format protein_fasta > test_elink_tb

thanks

NCBI entrez bash protein • 638 views
ADD COMMENT
2
Entering edit mode
2.5 years ago
GenoMax 147k

Why not use following since you already have the accession of the reference sequence.

$ esearch -db nuccore -query NC_000962.3 | elink -target protein | efetch -format fasta
ADD COMMENT
0
Entering edit mode

Thanks @GenoMax, that works! do you know the memory issue was happening?

ADD REPLY

Login before adding your answer.

Traffic: 2888 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6