Extract transcript fasta using gff
1
0
Entering edit mode
3.5 years ago
boczniak767 ▴ 870

Hi,

is there simple way to extract fasta sequences of transcripts using genome fasta and gff?

I came across gffread which propose command gffread -w transcripts.fa -g /path/to/genome.fa transcripts.gtf. I just don't know, what this gff file has to contain. Exons, all features,...

The second possibility is mapToTranscript from GenomicFeatures R package. The manual is just quite complicated and I don't know steps to retrieve the transcripts.

Has anybody have any experience with such procedure?

fasta gff • 2.0k views
ADD COMMENT
1
Entering edit mode
3.5 years ago
Juke34 8.9k

gffread is quite straightforward. It has to contain exons.

You might consider agat_sp_extract_sequences.pl from AGAT

Look at here for examples: Extracting genomic feature sequences from GTF/GFF files with AGAT

ADD COMMENT
0
Entering edit mode

Thanks, indeed gffread seems to work perfect. I'm just checking results. Also thanks for link to AGAT.

ADD REPLY

Login before adding your answer.

Traffic: 2157 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6