Hey,
I am looking for a python based tool/script to download/extract the sequence of a genomic region. Up to now I use fasta
from https://github.com/dancooke/bioio, but I'd like to integrate it into my Python script without the need to run/compile cpp code. Is there any tool available or does anyone know, how to translate this (https://github.com/dancooke/bioio/blob/master/fasta.cpp) to Python only?
If you have a fasta file and a bed file, you can use
bedtools getfasta
. You have pybedtools a python library for bedtools.NOTE: As you are using Python, downloading the sequence would be very easy.
Cheers,
Nitin N.
I don't have a bed file unfortunately, but this should be easy to generate :). I'll look into it.