GFF and Genome Fasta file
1
0
Entering edit mode
9.1 years ago
aj123 ▴ 120

Hello,

Please let me know if the gff and genome fasta file have to be the same when running a cuffdiff analysis. for example,

can I use these:

For the genome version: ftp://ftp.ensemblgenomes.org/pub/release-28/plants/fasta/arabidopsis_thaliana/dna/Arabidopsis_thaliana.TAIR10.28.dna.toplevel.fa.gz

For the gff files:

Please note that chromosomes are labeled as 'ChrX', while the fasta is labeled as 'X' ftp://ftp.arabidopsis.org/Maps/gbrowse_data/TAIR10/TAIR10_GFF3_genes_transposons.gff

RNA-Seq gff fasta cuffdiff genome • 2.6k views
ADD COMMENT
1
Entering edit mode
9.1 years ago

Yes, the chromosome names must be exact matches (unless things have changed recently), so having "ChrX" vs "X" in different files will be an issue.

The *nix command-line utility sed would be the best option, and it would be simpler (fewer changes) to change the fasta file chromosome names to match the gff names.

ADD COMMENT
0
Entering edit mode

yes, thanks sed and awk seem to work best in such cases!

ADD REPLY

Login before adding your answer.

Traffic: 2562 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6