Reproduce research paper's findings as a data scientist to implement code on top of it

0

Entering edit mode

2.6 years ago

d • 0

I would like to know the best approach to reproduce a research paper's findings (such as a cancer research paper) as a data scientist, in order to implement a machine learning prediction on the finding's data. Considering I will find different data from the research paper and reproduce the findings.

Thank you

data-science python machine-learning • 817 views

ADD COMMENT • link updated 2.6 years ago by Ram 45k • written 2.6 years ago by d • 0

2

Entering edit mode

I am not sure what you are asking. If you can obtain the data and then follow the analysis published in associated paper you should get some what of a "close" result assuming there were no major errors in analysis. Unless you had exact code for the workflow this may be the best you can do in terms of "reproducing" the analysis since some slight differences may always creep in.

ADD REPLY • link 2.6 years ago by GenoMax 152k

0

Entering edit mode

That having said, in absence of a published containerized workflow (almost never the case) you have to read the method sections, start either from raw data at GEO (if it is NGS) or processed data they may provide, and then simply try to stick as close to what they did in the methods as you can.

ADD REPLY • link 2.6 years ago by ATpoint 88k

Login before adding your answer.