headers missing in efetch results for SRA database
1
0
Entering edit mode
2.5 years ago
gernophil ▴ 120

Hey everyone, I am trying to retrieve a SRA table via esearch and efetch, as I did many times before. However, when I do this now, the headers in my table are missing. And I can't find any flag or similar that would make them reappear. Did SRA change anything here?

My command is:

esearch -db bioproject -query [PRJN or SRP] | efetch -format runinfo > file.csv

SRA Entrez.Direct efetch • 1.5k views
ADD COMMENT
0
Entering edit mode

While I have not done this specific search, are you saying that you used to get something like the following with that query?

$ esearch -db sra -query PRJNA253315 | efetch -format runInfo | head -3
SRR1448774,2014-11-12 16:42:10,2014-06-23 16:59:00,59803030,3049954530,0,51,1902,,https://sra-downloadb.be-md.ncbi.nlm.nih.gov/sos5/sra-pub-zq-11/SRR001/448/SRR1448774.sralite.1,SRX620734,,ChIP-Seq,ChIP,GENOMIC,SINGLE,0,0,ILLUMINA,Illumina HiSeq 2000,SRP043510,PRJNA253315,2,253315,SRS645140,SAMN02870079,simple,9606,Homo sapiens,GSM1418957,,,,,,,no,,,,,GEO,SRA172049,,public,8015BDF5C26D74EA2C32420993B7B044,FCC5BDCBF2064FBA9F03892CDB1AE898
SRR1448775,2014-11-12 16:42:10,2014-06-23 17:02:07,63157308,3221022708,0,51,2014,,https://sra-downloadb.be-md.ncbi.nlm.nih.gov/sos5/sra-pub-zq-11/SRR144/8775/SRR1448775.sralite.1,SRX620735,,ChIP-Seq,ChIP,GENOMIC,SINGLE,0,0,ILLUMINA,Illumina HiSeq 2000,SRP043510,PRJNA253315,2,253315,SRS645141,SAMN02870084,simple,9606,Homo sapiens,GSM1418958,,,,,,,no,,,,,GEO,SRA172049,,public,DB43B8469F5A907F2E48F463655F5145,0A5F480C86FF6D3577D1AFD53CECE112
ADD REPLY
0
Entering edit mode

Yes, but with headers like

Run,ReleaseDate,LoadDate,spots,bases,spots_with_mates,avgLength,size_MB,AssemblyName,download_path,Experiment,LibraryName,...

They were included in the exact same command, but since about two weaks the headers are missing somehow. That makes it really hard to identify the columns :).

ADD REPLY
0
Entering edit mode

Please report this to NCBI help desk. Be aware that it may take them up to 48h to respond at times. Looks like the -format runInfo header is not working (even in case of search I posted above).

ADD REPLY
1
Entering edit mode
2.5 years ago
PeterC_NCBI ▴ 520

Thanks for pointing out this problem with the headers. We are working on a fix. Hope to have it corrected soon.

ADD COMMENT
0
Entering edit mode

Looks like they are back. However, the headers are tab separated now, but the rest of the output is comma separated :).

ADD REPLY
0
Entering edit mode

Maybe it's helpful for you to name my NCBI case number here so you can link the cases: #CAS-921910-D4T1M8

ADD REPLY
1
Entering edit mode

Thank you! We see the problem too. I've passed this along to be fixed.

ADD REPLY

Login before adding your answer.

Traffic: 1849 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6