Is there a command line operation to get trace data from a SRA? For instance, with the link below I can click on "analysis" and get the following taxonomic breakdown of the reads. Any way to get actual trace data via command line and outputted as ASCI text?
https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR014739
Unidentified reads: 25.14%
Identified reads: 74.86%
cellular organisms: 74.82%
Eukaryota: 74.77%
Simiiformes: 71.4%
Catarrhini: 64.79%
Hominoidea: 54.22%
Hominidae: 45.46%
Homininae: 38.09%
Homo sapiens: 14.98%
Bacteria: 0.04%
Viruses: 0.04%
Edit: I'm looking for actual trace data, not the human submitted biospecies project data.
What do you mean by actual trace data? This is a NGS dataset there are no traces like Sanger reads.
The taxonomic breakdown you posted above is generated by NCBI using the program mentioned by @vkkodali below. That is not from submitters.
vkkodali can correct me but I doubt NCBI stores the read classification to allow for selective retrieval.
I don't entirely follow. Do you want to know the actual reads instead of just percentage values for each taxonomic group? NCBI does not store that data. May be if you run STAT on your own, there is an option to classify and extract reads in that manner but I am not sure how to do that.