Hello All,
I am trying to analyze some methylation data using the minfi package. I have raw idat files for all my patient samples, right now I am stuck at the very first step. As the tutorial suggest, I have created a sample targets file that includes the samples information and also location of the idat files. I have pasted few lines of my sample target file below and also the output that I get from printing read.metharray.sheet. For some reason in the input I see character(0) as Basename. But I see correct output from list.files. Would really appreciate your input on this one:
This is my code:
library(minfi)
baseDir <-"/home/idats"
list.files(baseDir)
targets <- read.metharray.sheet(baseDir)
print (head(targets))
Input targets file:
Sample_Name,Sample_Well,Sample_Plate,Sample_Group,Pool_ID,person,age,sex,status,Array,Slide,Basename
908224,NA,NA,Cancer,NA,908224,62.9,F,Cancer,420015,R01C01,/home/idats/420015_R01C01
908224,NA,NA,Cancer,NA,908224,62.9,F,Cancer,420015,R01C01,/home/idats/420015_R01C01
836160,NA,NA,Normal,NA,836160,50.1,M,Normal,7420015,R05C01,/home/idats/7420015_R05C01
836160,NA,NA,Normal,NA,836160,50.1,M,Normal,7420015,R05C01,/home/idats/7420015_R05C01
Output:
read.metharray.sheet] Found the following CSV files:
"/home/idats/test.csv"
Sample_Name Sample_Well Sample_Plate Sample_Group Pool_ID person age sex
1 908224 <NA> <NA> Cancer <NA> 908224 62.9 F
2 908224 <NA> <NA> Cancer <NA> 908224 62.9 F
3 836160 <NA> <NA> Normal <NA> 836160 50.1 M
4 836160 <NA> <NA> Normal <NA> 836160 50.1 M
status Array Slide Basename
1 Cancer 420015 R01C01 character(0)
2 Cancer 420015 R01C01 character(0)
3 Normal 7420015 R05C01 character(0)
4 Normal 7420015 R05C01 character(0)
https://github.com/stephaniehicks/bioconductorNotes/blob/master/minfi.Rmd https://www.bioconductor.org/help/course-materials/2015/BioC2015/methylation450k.html#introduction
Thanks
I also met such problem before. Something it is caused by the format of samplesheet. Something I will debug it by download the code of these function, such as read.metharray.exp(). More convenient way should be provided so that other's can help you. I prefer github to keep you data and script, so that other can easily check the pipeline with git clone.
I saw your post over here https://support.bioconductor.org/p/71585/, can you please specify what samplesheet format is applicable? Posted above is the only code that I have written for this analysis. Thanks