Question

Reading Ab1 Files With Python

4

Entering edit mode

14.3 years ago

bow ▴ 790

I'm looking for a way to convert a batch of AB1 trace files into FASTA using python. I've tried looking in the BioPython website but failed to find any reference on how to do it so far. For Perl folks, I know there might be a way to it with Bioperl, but as I'm much more familiar with Python, I prefer to do it that way.

So, any pointers? Help is much appreciated :)!

EDIT: Just as an additional info for people who might stumble on this page, I had some extra time for myself and decided to write a Python module for reading AB1 files. You can find it here.

python conversion • 20k views

ADD COMMENT • link updated 11.7 years ago by pstragier ▴ 10 • written 14.3 years ago by bow ▴ 790

0

Entering edit mode

was that enough of an answer? ... if so please mark a correct answer to help people who come on this question later :)

ADD REPLY • link 14.3 years ago by Will 4.6k

0

Entering edit mode

Bow donated his code to Biopython, and it is included with Biopython 1.58 onwards, see http://news.open-bio.org/news/2011/08/biopython-1-58-released/

Thanks!

ADD REPLY • link updated 5.3 years ago by Ram 44k • written 13.1 years ago by Peter 6.0k

0

Entering edit mode

from Bio import SeqIO
handle = open("test.ab1", "rb")
for record in SeqIO.parse(handle, "abi"):
    print(record)
#from here on, you can extract the data and go to any file format you want.

ADD REPLY • link updated 5.3 years ago by Ram 44k • written 11.7 years ago by pstragier ▴ 10

Ram · Answer 1 · 2010-09-13

6

Entering edit mode

14.3 years ago

Will 4.6k

I had this same issue not too long ago: Converting Ab1 Trace Files Into Scf Trace Files

I couldn't find a reasonable python method so I ended up using Staden ... the docs are here

You could certainly use python's submodules to do it but convert_trace has a "directory" option to process everything in a directory.

Hope that helps,
Will

PS. I also tried abiparser.py (available by google-search) but that turned out to be a waste of ~3 hours trying to get it to work.

ADD COMMENT • link updated 5.3 years ago by Ram 44k • written 14.3 years ago by Will 4.6k

0

Entering edit mode

Ah yes, I tried abiparser.py and it's a pain to understand :/. Staden isn't a walk in the park, too. I haven't been able to install it on Lucid. Probably you have some pointers on how to do it?

ADD REPLY • link 14.3 years ago by bow ▴ 790

0

Entering edit mode

actually its available from the Package Repository ... Its under bio-linux-staden, staden-io-lib-utils ... That's how I got it installed on my Lucid system

ADD REPLY • link 14.3 years ago by Will 4.6k

0

Entering edit mode

Is it from the official repo? I could install 'staden-io-lib-utils but not 'bio-linux-staden

ADD REPLY • link 14.3 years ago by bow ▴ 790

0

Entering edit mode

bio-linux-staden is from the Bio-Linux repository, which has lots of useful packages. Add deb http://nebc.nox.ac.uk/bio-linux/ unstable bio-linux to your /etc/apt/sources.list and do an apt-get update.

ADD REPLY • link updated 5.3 years ago by Ram 44k • written 14.3 years ago by Brad Chapman 9.7k

0

Entering edit mode

Thanks Brad, I always forget that I added the BioLnux repository

ADD REPLY • link 14.3 years ago by Will 4.6k

0

Entering edit mode

Thanks guys :)! I think I got it working now. I ended up making a bash script to iterate through the ab1 files.

ADD REPLY • link 14.3 years ago by bow ▴ 790

score 1 · Answer 2 · 2011-02-22

I've always converted to Fasta (and qual) with Phred. Don't know how it compares to Staden, but at least with old equipment, you don't want to extract just the sequence data embedded in the trace file, as Phred does (did?) a better job.

There are other options, TraceTuner and PeakTrace, I don't know how well they fare.

Ram · Answer 3 · 2013-04-19

1

Entering edit mode

11.7 years ago

pstragier ▴ 10

Using Biopython:

from Bio import SeqIO
handle = open("test.ab1", "rb")
for record in SeqIO.parse(handle, "abi"):
    print(record)

From here on, you can extract the data and go to any file format you want.

ADD COMMENT • link updated 5.3 years ago by Ram 44k • written 11.7 years ago by pstragier ▴ 10

0

Entering edit mode

The code Biopython uses to read ABI files was contributed by Bow himself (after this question was written). See the comments on the question itself.

ADD REPLY • link 11.7 years ago by Peter 6.0k