Entering edit mode
10.3 years ago
bdeonovic
▴
210
I have a fastq file (CHIP-seq) and I would like to use cisGenome for my analysis. It requires an ALN file:
The file should be tab-delimited and should have three columns as follows:
chr1[tab]359077[tab]F
chr1[tab]376890[tab]R
....
This is called ALN format. A file in this format usually has a name ending with *.aln.
What is the recommended way of getting fastq to aln? aln
For my own reference:
Do you have a recommendation for software that does histone methylation analysis for CHIP-seq?
For which step(s)? I'm guessing that you're new to this, so your life will be easier if you can either find a local collaborator to help you through the first analysis or take a course (I don't have any links handy, unfortunately, but you should be able to find one with a bit of googling).
Yes, new. My PI gave me data and told me to use cisGenome. He also talked about MACS, but suggested that MACS was better for dealing with transcription factor analysis.
Perhaps cisGenome has an alignment feature, I've never used it. If nothing else, just align with one of the aforementioned aligners and then convert the resulting SAM file to that format with awk (go ahead an post again once you have your SAM or BAM files and need help doing the conversion). I would strongly encourage you to read some recent review articles on peak calling. The unfortunate reality is that most PIs are flying pretty blind with this sort of thing, so there may be some better options for your dataset.
Thanks, I'll take a look. I posted some helpful review articles for anyone in the future who is interested.