Hi everyone,
I have one csv file that contains the delta Ct values for several samples (columns) and genes as row names. I have samples from different Groups, Grp A, Grp B and Grp C. My data columns look something like this. Samples in Group A and B start with baseline disease condition while Group C samples are healthy.
Gene GrpA1_baseline GrpA1_flare GrpA2_baseline GrpA2_flare GrpB1_baseline GrpB2_baseline
GrpC1.1_healthy GrpC1.2_healthy GrpC2.1_healthy GrpC2.2_healthy
I have different conditions within Group A biological replicates (meaning samples), only baseline condition for Group B biological replicates, and I have 2 technical replicates for each sample in Group C. I was wondering how I could create a new qPCRset object using HTqPCR package in R. I understand that I could use the readCtData function but I am unsure of how to specify and tackle biological replicates and technical replicates or how to read all this in from one file.
The ultimate goal is for me to find differentially expressed genes between different groups (GrpA_flare vs GrpC_healthy) for example. As I am completely new to analyzing Ct values, any advice at all about how I could go about doing this would be greatly appreciated. Thank you.
Hi Kevin, Thank you for replying, that was useful! I cannot even seem to input my data to create a qPCRset object. This is how my dataframe that contain delta Ct values look like:
I have 347 genes under the column name ID_REF and 337 samples that includes Group A, B and C. I tried the following to create a qPCRset object:
but I see the following error:
Would you happen to know what I might be doing wrong?
Yes, that is because readCtData expects a file-listing of qPCR files for input, and not a data-matrix of values.
You may have to create it manually, like we sometimes had to do with microarray data, such as:
Hi Kevin,
Thank you for your reply. Please don't mind, but I would like to follow up with a few more questions.
Firstly, how can I carry out
when I have duplicate gene names? I end up seeing this error:
Also, when I try this command:
it shows the following error:
Thanks again for your time.
Hello again,
Do you know why there are duplicate genes? To overcome this as a quick fix, you can try (this simply adds a number beside each gene, which will make the rownames unique):
For creating the qPCRset, you instead try the new() function:
If you take a look at page 47 and 48 of the manual: https://www-test.ebi.ac.uk/bertone/software/HTqPCR.pdf
Hi Kevin,
The actual data contained gene names like this:
I am guessing that different probes map back to the same gene? In such a case, I thought it might be best to alter the gene names to something like this:
This is how I ended up with duplicate gene names. In the manual you shared, qPCRraw also seems to have genes repeated and so I thought this was something that could be done. Please correct me if I am wrong.
If I use the original gene names with the probes and follow the steps you suggested:
No errors show up, but once I wish to view raw, this error pops up:
Apologies for the additional errors!
Hmm... maybe double-check that the col and rownames are what you expect them to be?
Thanks Kevin, I shall do that.
If that does not work, then just take a look at the featureCategory parameter that can additionally be passed to the new() function.
In the example in the manual on page 47, they use
featureCategory=as.data.frame(array("OK", (n, n)))
, wheren , n
refer to the dimensions of the dataset that you are reading.As a final guide, I notice that the author of the package lurks on the Bioconductor forums, so, your final option may be to post there, or just contact her directly (email listed here: https://bioconductor.riken.jp/packages/3.0/bioc/html/HTqPCR.html)