Question

Consistency checks on results files?

0

Entering edit mode

5.8 years ago

russhh 5.7k

When I send results out to my colleagues, I'm a bit worried that any subsequent work on a project may change those results - particularly when those results are being followed up by subsequent benchwork. "Hi, could you send me the top 10 hits from experiment XXX". Yeah no problem. Can I send them to you again when I've refactored this bit of my script, or when I've added this seemingly independent feature to my program?

At present I don't do any consistency checks on my results files; and was wondering what approaches are used _out there_?

Do you md5sum every results file and raise a note when those values change for example?

Do you have a continually-updated results silo on dropbox or similar, and let your colleagues pull results from there.

Do you diff & log before updating any results file?

best practice project management • 1.1k views

ADD COMMENT • link updated 5.8 years ago by Devon Ryan 104k • written 5.8 years ago by russhh 5.7k

score 1 · Answer 1 · 2019-01-31

1

Entering edit mode

5.8 years ago

Devon Ryan 104k

If things are changing then you should be giving those results a different name so it's obvious what came from what (this should also be kept in your snakefiles or whatever you're using for performing an analysis). In other words, don't actually update the files unless you don't care at all about what they used to contain.

ADD COMMENT • link 5.8 years ago by Devon Ryan 104k

0

Entering edit mode

I'm happy for results to change and feel that this will be an inevitable part of an evolving research project. What I'd like to be aware of is when things are changing when I don't expect them to (refactoring of my code; updating my packages / environment), and when results that I've previously sent to colleagues have changed due to altered requirements / bug fixes etc.

ADD REPLY • link 5.8 years ago by russhh 5.7k

0

Entering edit mode

Right, but all of that (software versions and such) should be static to a given project.

ADD REPLY • link 5.8 years ago by Devon Ryan 104k

0

Entering edit mode

I'm not sure I agree. Certainly the analysis code / environments / dependencies will be static once the project is mothballed; but during it's active development, these things will necessarily change.

ADD REPLY • link 5.8 years ago by russhh 5.7k

0

Entering edit mode

In my experience software versions rarely if ever change during the course of a project. Otherwise it's a pain to keep track of which version produced which result.

ADD REPLY • link 5.8 years ago by Devon Ryan 104k