Entering edit mode
3.7 years ago
prgrmmr70
•
0
How we can find the maximum number of peaks (coverage) in a bigWig data file to analyze a chip-seq data?
I do not know if the following command from HOMER can do it for me. Cuz I am new in ChIp-seq analysis and Tools for analysis.
-wig <wiggle file="" 1=""> [wiggle file 2] ... (read coverage counts from wiggle files)
Can you expand more on what you mean by maximum number of peaks?
I mean the number of binding sites in the background (control/mock samples against treatment ) in which we can find enriched peaks (which are specific to the type of target protein) using peak calling tools. The enriched peaks are a subset of maximum number of peaks in background data
Do you have the SAM/BAM file that was used to generate the bigWig?
I download bigWig file from the following link and it seems that there are bam file as well:
https://genome.ucsc.edu/cgi-bin/hgFileUi?db=hg19&g=wgEncodeBroadHistone
The broadPeak files are the ones that will have the actual peaks. If you count the number of rows that corresponds to the number of called peaks.
I downloaded broadPeak file and opened it in a text editor. the rows are as follows:
chr1 237626 238382 . 538 . 11.582263 15.7 -1
You mean the number of rows like the above example?
Yep, since each row is a peak it's as simple as counting the number of rows.
Is it the output file after peak calling? for example using MACS?
You'll need to check the documentation for the exact peak caller they used, but that is from a peak caller.