Entering edit mode
17 months ago
adR
▴
120
Hi all,
I have 100 whole bisulfate sequenced samples and generated 100 bismark.cov.gz files following the Bismark pipeline. I want to do differential methylation analysis as these samples have four groups.
When I open each bismark.cov.gz file, I see six columns and don't know which column to use to combine all the files to generate a single data frame. I use the R programming language for the analysis, and I would like to get some feedback or a pipeline or packages designed for this type of analysis.
Best, ADR
You may have a look at this tutorial : https://bioconductor.org/packages/release/bioc/vignettes/dmrseq/inst/doc/dmrseq.html
Thank for your suggestion but reading 100 files as you could see below turns error.
here is the errors:
[read.bismark] Parsing files and constructing valid loci ... Error in h(simpleError(msg, call)) : error in evaluating the argument 'x' in selecting a method for function 'unique': long vectors not supported yet: /Volumes/Builds/R4/R-4.2.3/src/main/memory.c:3885
do you have any idea what would be the cause?
Thank you!