Fasta file from a gff file
1
1
Entering edit mode
6.1 years ago
ahmedferoz20 ▴ 10

Hi, I have a GFF file and a genome sequence file in fasta format. I need to get a fasta file containing the sequence of 1 kb uspstream of each gene on the x chromosome?

I prefer linux scripts however, python code is fine too.

Really appreciate if anyone can help me.

gene genome • 2.2k views
ADD COMMENT
1
Entering edit mode

Look at bedtools or bedops.

ADD REPLY
0
Entering edit mode

Thanks h.mon. I am wondering is it working for gff file too?

ADD REPLY
0
Entering edit mode

BioPython would make this very easy, though their support for GFF is limited. Brad Chapman built BCBio though which handles gff and is able to interface with BioPython nicely

ADD REPLY
1
Entering edit mode
6.1 years ago

Hey bro,

gffread can do this for you, and it is strand-specific. Take a look at the example here: A: Cufflinks gffread utility

To add 1000bp up and downstream of each gene, then just write a script that modifies the GFF input file. AWK should easily work, in this regard.

Kevin

ADD COMMENT

Login before adding your answer.

Traffic: 1792 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6