What Tools Are Available For Analyzing Bisulfite Data?
8
9
Entering edit mode
14.1 years ago

A colleague produced an alignment (SAM format) of bisulfite-treated sequences using Novoalign. About 85% of the sequences seem correctly aligned (which may need to be resolved first). Is there a tool or set of tools available for finding and quantifying the C->T conversions? The read length for each aligned sequence is roughly 40bp.

I'm most familiar with Perl, but I'm open to tools in any language that would be helpful.

sam • 11k views
ADD COMMENT
0
Entering edit mode

@Pierre Lindenbaum. Where did your answer go?

ADD REPLY
6
Entering edit mode
14.1 years ago
brentp 24k

I had that same problem with existing pipelines for BS-treated reads--it's hard to get per-base methylation scores after the alignment.

So, (shamelss plug) I wrote MethylCoder, which takes FASTQ or Fasta reads files runs the alignment, and outputs (in addition to a SAM file) a text file like:

#seqid  mt  bp  c   t
chr1   3   1354    0   1
chr1   1   1358    6   1
chr1   2   1393    0   1
chr1   6   1394    4   9

indicating the methylation status at every base-- the methylation is c / (c + t).

where mt is the methylation type or context (1/4 are CG +/- strand, 2/5 are CHG +/- strand and 3/6 are CHH +/- ).

The pipeline is documented at the link above and also includes scripts for visualization and analysis for doing things like finding regions that are differentially methylated between 2 runs of BS-Seq. Also the author is very responsive to questions and feature requests ;)

ADD COMMENT
4
Entering edit mode
12.6 years ago
Eric Fournier ★ 1.4k

I understand that this question was answered ages ago, but since it is the most relevant question tagged "Bisulfite" and thus a natural first step for those new to the technology and seeking general guidance on how to interpret it (as I did a month ago), I think the following update is relevant:

Felix Krueger wrote the following review in early 2012 detailing the steps for analysis of bisulfite-treated libraries, common sources of errors and a list of tools. Among these tools are Bismark(ref) (written by Krueger), MethylCoder (ref) (written by brentp), BRAT (ref), BS-Seeker (ref) and BSMAP (ref).

I'd also recommend going to the SeqAnswers software list and searching for "bisulfite"/"bisulphite".

ADD COMMENT
3
Entering edit mode
14.1 years ago

I've been very pleased with the GNUmap software and the response of its developers. They have a flag for doing bisulfite analysis. However, I'm not sure if you can do the bisulfite analysis on an existing alignment, you may have to re-map the reads if you use this tool. But definitely for the future I would recommend this tool.

ADD COMMENT
2
Entering edit mode
14.1 years ago

[?]

[?]

http://quma.cdb.riken.jp/

ADD COMMENT
2
Entering edit mode
14.1 years ago

There were new tools advertised in the ISMB this year. I have not tried them myself but search the proceedings Especially the poster sessions.

ADD COMMENT
1
Entering edit mode
ADD REPLY
1
Entering edit mode

GNUMAP (mentioned by Daniel Standage) was also presented in a poster.

ADD REPLY
2
Entering edit mode
14.1 years ago
Neilfws 49k

You may want to look at methVisual, a Bioconductor package and BSMAP, a tool to map bisulfite reads to genome sequence. Disclaimer: I have not used either, so can't be sure of their suitability for your problem.

ADD COMMENT
2
Entering edit mode
13.3 years ago
brentp 24k

As another answer, GSNAP (part of GMAP) maps BS-Seq reads. In my experience, it is able to map more reads, at the expense of running time.

There is a comparison of what I consider the full-featured BS-Seq mapping programs here: https://github.com/brentp/methylcode/tree/master/bench

ADD COMMENT
0
Entering edit mode
8.0 years ago

Hope ViewBS can help you. https://github.com/readbio/ViewBS

ADD COMMENT

Login before adding your answer.

Traffic: 2519 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6