The bioinformatics paper reappears, but the results are significantly different from the original results
1
2
Entering edit mode
8 months ago
yuanj ▴ 40

The bioinformatics paper reappears, but the results are significantly different from the original results. I'm a newbie and just started doing bioinformatics analysis, so there's a lot I don't know. The following is my entire process, but the result is very different from the original article. I would like to ask here, is there something I did not do well or is there something wrong? I hope you seniors can give me some opinions and suggestions!

Here, the analysis of RNA-seq and DAP-seq data from this article is replicated. The data files and code are available in GitHub.

reproducibility • 1.1k views
ADD COMMENT
3
Entering edit mode

Different pipelines will inherently yield different results but what you mean with "very different results" need be to clarified. There is no information on the alignment process used in the article so this is a first discrepancy that could yield some variation with your results. But above all, you applied DESeq2 while they used DEGseq, two packages that rely on different assumptions and statistical methods to identify DEGs, so it is certain it will yield different DEGs. As the process given in the article is impossible to 100% reproduce, are your results still in accordance with the results obtained with your method ? Maybe the number of DEGs is highly different but the main conclusions (top DEGs, main pathways modified) may be the same ?

ADD REPLY
1
Entering edit mode

The joys of trying to replicate reported results. I've never managed to replicate a result 100%. Usually, it's an acceptable level of variation, or down to vague methods sections that makes it impossible to fully understand what parameters they used.

But it sounds like you've tried and gotten something very different. It's impossible to understand since you haven't shown what they've got and what you've got. Talk to your PI, if it's an important paper and others can't replicate their results at all, then they can make the call to contact the journal.

ADD REPLY
1
Entering edit mode

The original article is here and my process is here.I have contacted the corresponding author and hope that the problem can be solved.

ADD REPLY
0
Entering edit mode

I agree with Basti, and reiterate that you need to show how the results differ as we cannot make comment on the size of the difference without assessing it ourselves.

ADD REPLY
0
Entering edit mode

https://stackoverflow.com/questions/78273233/

you did not link between the two so no one knows that you're asking two sets of online volunteers to spend their time on your problem without telling them that you're also asking the other group.

ADD REPLY
1
Entering edit mode
8 months ago
yuanj ▴ 40

Mainly the number of differential genes in RNA-seq analysis and the consistency of sample peaks in DAP-seq experiments are different from the original results.

After communicating with the author of the original article, it was found that this difference may be caused by differences in analysis methods and filtering parameter settings (including differences in the analysis software used). The original article may have had more stringent screening criteria applied. It is a common phenomenon that different analysis methods and screening criteria lead to differences in the number of genes. Nonetheless, the reproduced results still support the discussion in the original article.

I'm a newbie and may not know the best way to ask questions. I'm sorry to take up your time. I will correct some of my practices in work and questioning.

ADD COMMENT

Login before adding your answer.

Traffic: 3021 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6