I have been trying to figure out how to use F-seq for almost a year now. I can do the basics, but there more complicated aspects are lost on me. It's a bit aggravating as other than ZINBA, F-seq is the only other peak caller specifically designed for FAIRE-seq data. It seems that many people on Biostars know some aspects and I'd like to get it all in one place.
First, the F-seq site is located here: http://fureylab.web.unc.edu/software/fseq/
Let me start with what the basic command looks like:
fseq -of bed -f 0 -l 600 -t 6.0 chr1.bed chr2.bed chr3.bed
Each chromosome is listed separately.
-of gives the format -f fragment extension length -l feature length -t threshold, this controls the stringency of the peak calling
One needs a script to split the reads into chromosomes, which is not included.
One needs a way to covert the input into a fixed wiggle file. Then one needs a script to split the wiggle file into chromosomes, not included.
Use iffBuilder, available on the F-seq webpage, like this
iffBuilder chr1.wig
. This produces a file called chr1.iff
It seems that if you put all the iff files in a directory like "Input" and use the new command:
fseq -of bed -f 0 -l 600 -d Input -t 6.0 chr1.bed chr2.bed chr3.bed
Then the program accesses the input files. It doesn't give much indication that it's using input and I don't know how to check the files are being accessed. I tried making some malformed input files to see if the program would crash and it did. So, I assume the files are being accessed. If someone has a Mac OS X or Unix command line technique for checking which files are being accessed, let me know.
The next portion involves using mappability. I have seen the post Can you create your own bff files for F-seq? on Biostars.org saying there is a second program called bffBuilder which isn't publicly available.
It's also not clear what format the mappability files should be. It seems that what we need is a fixed wiggle file with mappability scores split by chromosomes. This is pure speculation, but it sounds like the program bffBuilder would work like this: bffBuilder chr1.wig From analogy with iffBuilder, I assume it makes a file called chr1.bff
If anybody has any more information or any association with Furey lab and can make bffBuilder available, please help flesh out everything necessary to use F-seq as a tool in FAIRE-seq.
I can help out regarding the open files under UNIX:
First obtain the PID of the process, e.g.
This will return the process id {PID}
Then check in the /proc directory for the file descriptors used by this PID:
Which will show you a list of links to all open file handles for the process {PID}
HTH!
Thanks. That's very useful.