Entering edit mode
4.6 years ago
marcelolaia
▴
10
Hi, after a hard search on the net I found this awesome script (https://git.io/JfIjg). It works nice. However, I need to extract all exon sequence from a genome based on GFF3 and FASTA. Please, found here https://is.gd/gGMtUo a GFF3 sample file. From that file I need to extract sequences like these:
>Eucgr.A00001.1.v2.0.exon.1
ACTGTGACA......
>Eucgr.A00001.1.v2.0.exon.2
ACTGTGACA......
>Eucgr.A00001.1.v2.0.exon.3
ACTGTGACA......
(...)
>Eucgr.A00001.1.v2.0.exon.12
ACTGTGACA......
(...)
Could you help me? Thank you so much!
Thank you so much! It worked out off the box! I have a little trouble to install it on my Debian box. So, I leave here the step by step I do to install AGAT on my Debian testing. This steps worked for me on
Linux 5.5.0-2-amd64 #1 SMP Debian 5.5.17-1 (2020-04-15) x86_64 GNU/Linux
:Install all dependencies by APT
Clone AGAT
Run the
agat_sp_extract_sequences.pl
. For extractexon
, for example, you could run:Thank you!