Entering edit mode
7.2 years ago
arash.iranzadeh1980
▴
30
Hi,
I am interested to know how biological databases have been distributed worldwide? I think most of them are located in US and Europe. Am I right? Any other important country involved?
@Pierre, I am using your xsltstream to parse a xml i have downloaded using ncbi eutils. I have modified the above xls file(biostar270498.xsl) to get my desired output (Title and Abstract text). It works fine but after certain entires(~500) it throws a error. Can you please see if i have done anything wrong in the xsl or while using your tool.
Usage:
cat ~/Downloads/test_e_renal_kidney.xml | java -jar dist/xsltstream.jar -t ~/Downloads/test_e_kid_renal.xsl -n PubmedArticle
xsl file:
The error i am getting after ~500 so output:
PS i am still working to get abstract using the above xls
Thank you for your time and thank you for your tool.
what is the output of
xmllint --stream --noout ~/Downloads/test_e_renal_kidney.xml
?
@Pierre Thank you for your response,
Output is:
Now i see what is the problem, its in the xml file, it has version line multiple times.
grep -c '?xml version=' ~/Downloads/test_e_renal_kidney.xml 1426
Any way i can parse this xml file to get title and abstract becase it very large file(7.9gb)
Thank you for your time