This is not strictly bioinformatics but I am not sure where this would fit. Anyway, close this question if it is misplaced.
I have a long list of PMIDs, I would like to generate a BiBTeX file from this. How could I go about doing this?
This is not strictly bioinformatics but I am not sure where this would fit. Anyway, close this question if it is misplaced.
I have a long list of PMIDs, I would like to generate a BiBTeX file from this. How could I go about doing this?
Use ncbi efetch to download your papers as XML and use a XSLT stylesheet like pubmed2bibtex.xsl ( http://svn.gna.org/viewcvs/kbibtex/trunk/xslt/ ) to transform it to bibtex.
I've used this method to compile a recent TeX paper: see http://code.google.com/p/knime4bio/source/browse/trunk/paper20110804/Makefile
This is very easy using Bio::PubMed and Bio::MEDLINE from BioRuby.
Chris Miller has posted example code which loops through a list of PMIDs and returns citations in BibTeX.
FWIW, I tried this method today and it no longer works. I'm not sure whether the package or the API changed, but I ended up using the following solution instead:
Yes it seems a few things have changed.
Bio::PubMed::query()
no longer works (returns nil)Bio::PubMed::efetch()
returns an array; elements can be converted to MEDLINE objectsThis works for me:
require 'bio'
id = "18265351"
Bio::NCBI.default_email = "me@me.com"
e = Bio::PubMed::efetch(id)
m = Bio::MEDLINE.new(e[0])
bib = m.reference.format("bibtex")
# => "@article{PMID:18265351,\n author = {Brown, T. and Mackey, K. and Du, T.},\n title = {Analysis of RNA by northern and slot blot hybridization.},\n journal = {Curr Protoc Mol Biol},\n year = {2004},\n volume = {Chapter 4},\n pages = {Unit 4.9},\n url = {http://www.ncbi.nlm.nih.gov/pubmed/18265351},\n}\n"
jabref uses bibtex as standard storage format of references. You can directly download pubmed references from within jabref.
Mekentosj Papers also supports bibtex exports.
I did something like this a while back and blogged on it: http://pathogenomics.bham.ac.uk/blog/2011/03/creating-my-perfect-citation-system-using-latex/
The Python code is at: https://github.com/nickloman/latex-pubmed
Feel free to fork it and improve it (and send me a pull request)
I just wrote a Python 3 solution here:
import requests | |
import xml.etree.ElementTree as ET | |
import sys | |
import calendar | |
# Parse PubMed IDs from the command line. | |
pmids = sys.argv[1:] | |
## Fetch XML data from Entrez. | |
efetch = 'https://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi' | |
r = requests.get( | |
'{}?db=pubmed&id={}&rettype=abstract'.format(efetch, ','.join(pmids))) | |
##print(r.text) | |
## Loop over the PubMed IDs and parse the XML. | |
root = ET.fromstring(r.text) | |
for PubmedArticle in root.iter('PubmedArticle'): | |
PMID = PubmedArticle.find('./MedlineCitation/PMID') | |
ISSN = PubmedArticle.find('./MedlineCitation/Article/Journal/ISSN') | |
Volume = PubmedArticle.find('./MedlineCitation/Article/Journal/JournalIssue/Volume') | |
Issue = PubmedArticle.find('./MedlineCitation/Article/Journal/JournalIssue/Issue') | |
Year = PubmedArticle.find('./MedlineCitation/Article/Journal/JournalIssue/PubDate/Year') | |
Month = PubmedArticle.find('./MedlineCitation/Article/Journal/JournalIssue/PubDate/Month') | |
## Year = PubmedArticle.find('./MedlineCitation/Article/ArticleDate/Year') | |
## Month = PubmedArticle.find('./MedlineCitation/Article/ArticleDate/Month') | |
Title = PubmedArticle.find('./MedlineCitation/Article/Journal/Title') | |
ArticleTitle = PubmedArticle.find('./MedlineCitation/Article/ArticleTitle') | |
MedlinePgn = PubmedArticle.find('./MedlineCitation/Article/Pagination/MedlinePgn') | |
Abstract = PubmedArticle.find('./MedlineCitation/Article/Abstract/AbstractText') | |
authors = [] | |
for Author in PubmedArticle.iter('Author'): | |
try: | |
LastName = Author.find('LastName').text | |
ForeName = Author.find('ForeName').text | |
except AttributeError: # e.g. CollectiveName | |
continue | |
authors.append('{}, {}'.format(LastName, ForeName)) | |
## Use InvestigatorList instead of AuthorList | |
if len(authors) == 0: | |
## './MedlineCitation/Article/Journal/InvestigatorList' | |
for Investigator in PubmedArticle.iter('Investigator'): | |
try: | |
LastName = Investigator.find('LastName').text | |
ForeName = Investigator.find('ForeName').text | |
except AttributeError: # e.g. CollectiveName | |
continue | |
authors.append('{}, {}'.format(LastName, ForeName)) | |
if Year is None: | |
_ = PubmedArticle.find('./MedlineCitation/Article/Journal/JournalIssue/PubDate/MedlineDate') | |
Year = _.text[:4] | |
Month = '{:02d}'.format(list(calendar.month_abbr).index(_.text[5:8])) | |
else: | |
Year = Year.text | |
if Month is not None: | |
Month = Month.text | |
try: | |
for _ in (PMID.text, Volume.text, Title.text, ArticleTitle.text, MedlinePgn.text, Abstract.text, ''.join(authors)): | |
## assert '"' not in _, _ | |
if _ is None: | |
continue | |
assert '{' not in _, _ | |
assert '}' not in _, _ | |
except AttributeError: | |
pass | |
## Print the bibtex formatted output. | |
try: | |
print('@Article{{{}{}pmid{},'.format( | |
authors[0].split(',')[0], Year, PMID.text)) | |
except IndexError: | |
print('IndexError', pmids, file=sys.stderr, flush=True) | |
except AttributeError: | |
print('AttributeError', pmids, file=sys.stderr, flush=True) | |
print(' Author="{}",'.format(' AND '.join(authors))) | |
print(' Title={{{}}},'.format(ArticleTitle.text)) | |
print(' Journal={{{}}},'.format(Title.text)) | |
print(' Year={{{}}},'.format(Year)) | |
if Volume is not None: | |
print(' Volume={{{}}},'.format(Volume.text)) | |
if Issue is not None: | |
print(' Number={{{}}},'.format(Issue.text)) | |
if MedlinePgn is not None: | |
print(' Pages={{{}}},'.format(MedlinePgn.text)) | |
if Month is not None: | |
print(' Month={{{}}},'.format(Month)) | |
## print(' Abstract={{{}}},'.format(Abstract.text)) | |
print(' ISSN={{{}}},'.format(ISSN.text)) | |
print('}') |
I use BibDesk, for Macs that let me do it. I looked at my bookmark and found this page with many tools to convert to bib files. You enter all your PMID in pubmed, export to xml and then convert xml to bib. Mendley Should let you do it as well somehow. I did it once to play around and it looked smooth. Most bibliogaphic tools should let you export to RIS and then from there convert to bibtex.
I normally use a combination of ris2xml and xml2bib (bibutils) similar to Stefano Berri description. If there is no easy ris export I use TexMed from http://www.bioinformatics.org, a website where you can enter a search-query and export the results as bibtex, the search-query can also be a list of pmids. Unfortunately the export needs to be taken with some caution because the tool might include unnecessary curly braces which you then have to remove.
Hi there,
You can also use:
online to achieve this. Just enter a pubmed PMID and it does the work for you.
HTH,
C
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Thanks! Excellent ways to do this. I wish I could mark more than one as correct.
Not a direct solution if you only have PMID's, but if you save your papers in Zotero it is easy to export a BiBTeX format list of citations.