Question

Adding a control sample to bulk RNA-seq

0

Entering edit mode

14 months ago

Chris ▴ 340

Hello Biostars,

I have a control with another technical replicate then I try to down load a biological replicate to make the statistics more robust. I looked at the raw count and there is big different between the biological replicate and the two technical replicates. Could I do keep that 3 controls for down stream analysis or I need normalization with the biological replicate before adding to the two technical replicate. The PCA bellow has 3 diseased samples with 2 technical replicates each. The control I add even closer with the diseased than the first control. I appreciate your help! Unfortunately, I don't know anyone else near me who can help. They are just busy with their work. enter image description here

RNA-seq • 1.4k views

ADD COMMENT • link 14 months ago by Chris ▴ 340

3

Entering edit mode

I think you're going in circles. Given that you seem to have little to no local supervision and quite different datasets for an inexperienced bioinformatician why don't you make a post where you describe what data you have, what the overall project goal is (you can describe of course superficially without ever mentioning the exact context for confidential reasons), and where you get stuck. Maybe we can then brainstorm a bit what the steps are to take and which online resources you can read. Right now you shotgun questions but there is a clear roadmap lacking. Lets try the brainstorming things, maybe one can help with that.

ADD REPLY • link 14 months ago by ATpoint 85k

0

Entering edit mode

Hi @atpoint. Thank you so much for the kind suggestion! Is there any better way that I can learn analyze NGS data to screening to find target transcription factors for genetics diseases? I am happy to pay for you as tuition. I have several projects and data for each is different. The current project has only bulk RNA-seq (1 control with technical replicate vs 3 diseased with 2 technical replicates for each sample) to find target for a diseased with known mutation. If I can find anything potential candidate, then the next step maybe doing bulk ATAC-seq or single cell RNA-seq.

ADD REPLY • link 14 months ago by Chris ▴ 340

1

Entering edit mode

There is supposed to be a bigger difference between biological replicates and technical replicates.

What is the additional replicate you "downloaded" and why would it make sense to combine them with your data? I think a batch effect would be a concern. From my view, the PCA sort of make sense since you have separation between your "control" and "disease", but if the sample was processed differently , then maybe its best to leave it out?

ADD REPLY • link 14 months ago by rfran010 ★ 1.3k

0

Entering edit mode

I used 2 control technical replicates, so it is just from one sample, the one I downloaded from another sample so I consider combining them so my control has data from 2 biological samples. The control at the bottom is even closer to disease than the control on the left which makes me concerned. Normally, all RNA-seq data from control and diseased I got in the same folder which I think are sequenced together so adding another new data can have batch effect, so I should not add another control in this case, is that correct?

ADD REPLY • link 14 months ago by Chris ▴ 340

score 5 · Accepted Answer · 2023-09-20

5

Entering edit mode

14 months ago

swbarnes2 14k

A control from a totally different lab is a bad control. There is a limit to what can be done computationally after the fact to compensate for poor planning of the bench work.

The technical replicates are a waste of effort. You would probably get more power by combining the counts together.

ADD COMMENT • link 14 months ago by swbarnes2 14k

0

Entering edit mode

Thank you so much! It is from the same lab but from another project. Would you please explain about combining the counts together a little bit?

ADD REPLY • link 14 months ago by Chris ▴ 340

1

Entering edit mode

If a sample wasn't prepared along side your treated samples, it is a different batch. If you had treated and controls in each of two different batches, you can work with that. But adding one control from a totally different batch is not going to help.

The people who arranged this experiment did not know what they were doing, and they generated a lot of not very meaningful data. The technical replicates clearly add nothing. And you have only one decent control. The experiment needs to be redone from scratch. At least three each of treated and controls. All the extractions done on the same day. All the library preps done on the same day.

As it stands now, no interesting result you might find will be believable, because your controls are useless.

ADD REPLY • link 14 months ago by swbarnes2 14k

0

Entering edit mode

Thank you for the explanation! So just want to confirm one control with replicate is totally useless even can't use as a reference?

ADD REPLY • link 14 months ago by Chris ▴ 340