Question

Publication Text Mining

2

Entering edit mode

12.9 years ago

Jelena Aleksic ▴ 920

Hi guys,

I'm a total newbie to text mining, and I'm interested in the following. Say I'm using something like FlyMine, where I get enrichment results for publications, and I end up with a large number of them (over 1000). I have the publication info (title, authors, journal) and the PubMed IDs for all of them. Is there a text-mining tool I can straight-forwardly import them into (e.g. by uploading the PubMed IDs), that will give me an idea of their contents?

Any suggestions would be appreciated :)

Jelena

text biology • 3.2k views

ADD COMMENT • link updated 12.9 years ago by Andrew Su 4.9k • written 12.9 years ago by Jelena Aleksic ▴ 920

0

Entering edit mode

What do you mean by "an idea of their contents"? Do you mean retrieving the publication abstracts?

Having said that, you could use the abstracts to carry out an association analysis of ontology mentions within them. That gives you a network which you can import into Cytoscape, Gephi, etc, and then apply a smart layout algorithm to it. You will then immediately see clusters forming that give you an impression of associated term mentions.

ADD REPLY • link 12.9 years ago by Joachim ★ 2.9k

0

Entering edit mode

Something like your latter suggestion sounds like what I'm looking for. How would you do the association analysis? Also, I'm not sure how you would apply a smart layout algorithm?

ADD REPLY • link 12.9 years ago by Jelena Aleksic ▴ 920

score 3 · Answer 1 · 2012-06-12

3

Entering edit mode

12.9 years ago

Andrew Su 4.9k

A while back, I created Pubmed2Wordle (app and repo) as a crude way to go from a PubMed search to a tag cloud (and also an exercise to learn Google App Engine). Just to emphasize, it's incredibly crude (and rather buggy as well)...

ADD COMMENT • link 12.9 years ago by Andrew Su 4.9k

0

Entering edit mode

Ooh, that's an awesome app :) Ran it with a gene name, it's a nice way of getting a first impression about its function.

ADD REPLY • link 12.9 years ago by Jelena Aleksic ▴ 920

score 2 · Answer 2 · 2012-06-12

2

Entering edit mode

12.9 years ago

ff.cc.cc ★ 1.3k

Have a look at proteinquest, I'm using with a trial key and It works well.

Other suggestion could be iHop database (just for data mining, not for data retrieving)

ADD COMMENT • link 12.9 years ago by ff.cc.cc ★ 1.3k

0

Entering edit mode

Proteinquest looks awesome - thanks for the link. I had no idea it existed.

I've used iHop before for looking at individual genes, but not sure how I would extend it to custom lists of publications (there might be a way I don't know about)

ADD REPLY • link 12.9 years ago by Jelena Aleksic ▴ 920

0

Entering edit mode

Hi, I just discovered a useful service from UK Pubmed that helps to focus on articles with a specific content/biological relationship: http://labs.ukpmc.ac.uk/evf. Unfortunately It works only on the open access corpus (2Milion+ articles) of the whole pubmed, but It seems to look deep in the full text DB

ADD REPLY • link 12.9 years ago by ff.cc.cc ★ 1.3k