I would like to extract all information about each SAMPLE
after running the following query (run the query and add a | grep SAMPLE
for clarification on what I mean by SAMPLE
):
esearch -db sra -query PRJNA514750 | efetch -format xml
I tried the following:
esearch -db sra -query PRJNA514750 | efetch -format xml | xtract -pattern EXPERIMENT -element SAMPLE
but this returns nothing (PS: SAMPLE
s are within an EXPERIMENT
tag). I read in the e-utilities guide that -pattern
will divide the data into rows, and -element
into columns, so I'm presuming that this didn't work because SAMPLE
has multiple tags within it. So I then tried:
esearch -db sra -query PRJNA514750 | efetch -format xml | xtract -pattern SAMPLE -element random_SAMPLE_tag
where random_SAMPLE_tag
is any tag within SAMPLE
.
Here's a concrete example:
esearch -db sra -query PRJNA514750 | efetch -format xml | xtract -pattern SAMPLE -element TITLE
This works, but I want to get all the information about each SAMPLE
, and I do not know beforehand what the tags within it are (I manually got TITLE
in this case), and since I want to get this info for a quite a few studies, I can't manually check this.