AGAT
, seqtk comp
and featureCounts
return different length for same fragment.
Here is the command line I used:
agat_sp_extract_sequences.pl -f ./NCBI/GCF_000612305.1_Egrandis1_0_genomic.fna -gff ./NCBI/GCF_000612305.1_Egrandis1_0_genomic.gff -t exon --split -o ./NCBI/GCF_000612305.1_Egrandis1_0_genomic.exons.fna
seqtk comp ./NCBI/GCF_000612305.1_Egrandis1_0_genomic.exons.fna > ./NCBI/GCF_000612305.1_Egrandis1_0_genomic.exons.GC.fna
featureCounts -p -F GFF -a ./NCBI/GCF_000612305.1_Egrandis1_0_genomic.gff -t exon -g ID -o ./featureCounts/A1.counts.txt -f --extraAttributes gene ./raw_data/A1.sam.bam
Here is the relevant part of files I have been used.
NCBI/GCF_000612305.1_Egrandis1_0_genomic.fna
Whole file.
GCF_000612305.1_Egrandis1_0_genomic.gff
NC_014570.1 RefSeq exon 12778 13245 . - . ID=id-EucgrC_p006-2;Parent=gene-EucgrC_p006;Dbxref=GeneID:9829628;exon_number=2;gbkey=exon;gene=atpF;locus_tag=EucgrC_p006;number=2
NC_014570.1 RefSeq exon 47867 47974 . - . ID=id-EucgrC_p021-1;Parent=gene-EucgrC_p021;Dbxref=GeneID:9829651;exon_number=1;gbkey=exon;gene=ycf3;locus_tag=EucgrC_p021;number=1
NC_014570.1 RefSeq exon 46882 47109 . - . ID=id-EucgrC_p021-2;Parent=gene-EucgrC_p021;Dbxref=GeneID:9829651;exon_number=2;gbkey=exon;gene=ycf3;locus_tag=EucgrC_p021;number=2
NC_014570.1 RefSeq exon 45999 46148 . - . ID=id-EucgrC_p021-3;Parent=gene-EucgrC_p021;Dbxref=GeneID:9829651;exon_number=3;gbkey=exon;gene=ycf3;locus_tag=EucgrC_p021;number=3
NC_014570.1 RefSeq exon 74612 74836 . - . ID=id-EucgrC_p046-3;Parent=gene-EucgrC_p046;Dbxref=GeneID:9829683;exon_number=3;gbkey=exon;gene=clpP;locus_tag=EucgrC_p046;number=3
NC_014570.1 RefSeq exon 99515 100267 . - . ID=id-EucgrC_p066-2;Parent=gene-EucgrC_p066;Dbxref=GeneID:9829705;exon_number=2;gbkey=exon;gene=ndhB;locus_tag=EucgrC_p066;number=2
NC_014570.1 RefSeq exon 148743 149495 . + . ID=id-EucgrC_p081-2;Parent=gene-EucgrC_p081;Dbxref=GeneID:9829732;exon_number=2;gbkey=exon;gene=ndhB;locus_tag=EucgrC_p081;number=2
GCF_000612305.1_Egrandis1_0_genomic.exons.fna
>id-EucgrC_p006-2 transcript=gene-EucgrC_p006 gene=nbis-gene-8 name=atpF seq_id=NC_014570.1 type=exon
GTTTGGTTCGGGAAGGGATTATGGAAGTTTTGAAATGAATGGAAAGATAATCTACTTTCA
TTAAGCGATTTATTAGATAATCGAAAACAGAGGATCTTGAATACTATTCGAAATTCAGAA
GAATTACGCGGCGGGGCCATTGAACAGCTGGAAAAAGCCCGGGCCCGTTTACGGAAAGTG
GAAATGGAAGCGGAGCAGTTTCGAGTGAATGGATATTCTGAGATAGAACAAGAAAAGTTG
AATCTGATTAATTCAACTTATAAGACCTTGGAACAATTAGAAAATTACAAAAACGAAACT
ATTCATTTTGAACAGCAAAGAGCGATTAATCAAGTCCGACAACGGGTTTTCCAACAAGCC
TTACAAGGAGCTCTAGGAACTCTGAATAGTTGTTTGAACAACGAGTTACATTTACGTACT
ATCAGTGCTAATATTGGCATGTTTGGGGCGATGAAAGAAATAACTGATTAG
>id-EucgrC_p021-1 transcript=gene-EucgrC_p021 gene=nbis-gene-21 name=ycf3 seq_id=NC_014570.1 type=exon
ATGCCTAGATCGAGGATAAATGGAAATTTTATTGATAAGACCTTTTCAATTGTAGCCAAT
ATCTTATTACGAATAATTCCGACAACTTCAGGAGAAAAAGAGGCATTTACCTATTACAGA
GATGGT
>id-EucgrC_p021-2 transcript=gene-EucgrC_p021 gene=nbis-gene-21 name=ycf3 seq_id=NC_014570.1 type=exon
GGATGTCGGCTCAATCCGAAGGAAATTATGCGGAAGCTTTACAGAATTATTATGAAGCTA
TGCGACTAGAAATTGATCCCTATGATCGAAGCTATATACTCTATAACATAGGCCTTATCC
ATACAAGTAACGGAGAACATACGAAAGCTTTGGAATATTATTTTCGGGCACTAGAACGAA
ACCCGTTTTTACCACAAGCTTTTAATAATATGGCTGTGATCTGTCATTAC
>id-EucgrC_p021-3 transcript=gene-EucgrC_p021 gene=nbis-gene-21 name=ycf3 seq_id=NC_014570.1 type=exon
CGGGGAGAGCAGGCCATTCGACAGGGGGATTCTGAAATTGCGGAGGCTTGGTTCGATCAA
GCCGCTGAGTATTGGAAACAAGCTATAGCGCTTACTCCTGGTAATTATATTGAAGCGCAG
AATTGGTTGAAGATCACGAGACGTTTCGAATAA
>id-EucgrC_p046-3 transcript=gene-EucgrC_p046 gene=nbis-gene-43 name=clpP seq_id=NC_014570.1 type=exon
AGGGTAATGATACATCAACCTGCGAGTTCTTTTTATGAGGCACAAACGGGAGAATTTATC
CTGGAAGCAGAAGAACTGCTGAAACTGCGCGAAACCATCACAAGAGTTTATGTACAAAGA
ACGGGCAAACCCCTATGGGTTGTATCCGAAGATATGGAAAGGGATGTTTTTATGTCAGCA
ACAGAAGCCCAAGCTCATGGAATTGTTGATCTTGTAGCGATTGAATAA
>id-EucgrC_p066-2 transcript=gene-EucgrC_p066 gene=nbis-gene-61 name=ndhB seq_id=NC_014570.1 type=exon
TCTCCCACTCCAGTCGTTGCTTTTCTTTCTGTTACTTCGAAAGTAGCTGCTTCAGCTTCA
GCCACTCGAATTTTCGATATTCCTTTTTATTTCTCATCAAACGAATGGCATCTTCTTCTG
GAAATCCTAGCTATTCTTAGCATGATATTGGGGAATCTCATTGCTATTACTCAAACAAGC
ATGAAACGTATGCTTGCATATTCGTCCATAGGTCAAATCGGATATGTAATTATTGGAATA
ATTGTTGGAGACTCAAATGGTGGATATGCGAGCATGATAACTTATATGCTGTTCTATATC
TCCATGAATCTAGGAACTTTTGCTTGCATTGTATTATTTGGTCTACGTACCGGAACTGAT
AACATTCGAGATTATGCAGGATTATACACGAAAGATCCTTTTTTGGCTCTCTCTTTAGCC
CTATGTCTCTTATCCCTAGGAGGTCTTCCTCCACTAGCAGGTTTTTTCGGAAAACTCCAT
TTATTCTGGTGTGGATGGCAGGCAGGCCTATACTTCTTGGTTTCAATAGGACTCCTTACG
AGCGTTATTTCTATCTACTATTATCTAAAAATAATCAAGTTATTAATGACTGGACGAAAC
CAAGAAATAACACCTCACGTGCGAAATTATAGAAGATCCCCTTTAAGATCAAACAATTCC
ATCGAATTGAGTATGATTGTATGTGTGATAGCATCTACTATACCAGGAATATCAATGAAC
CCGATTATTGCAATTGCTCAGGATACCCTTTTTTAG
>id-EucgrC_p081-2 transcript=gene-EucgrC_p081 gene=nbis-gene-74 name=ndhB seq_id=NC_014570.1 type=exon
TCTCCCACTCCAGTCGTTGCTTTTCTTTCTGTTACTTCGAAAGTAGCTGCTTCAGCTTCA
GCCACTCGAATTTTCGATATTCCTTTTTATTTCTCATCAAACGAATGGCATCTTCTTCTG
GAAATCCTAGCTATTCTTAGCATGATATTGGGGAATCTCATTGCTATTACTCAAACAAGC
ATGAAACGTATGCTTGCATATTCGTCCATAGGTCAAATCGGATATGTAATTATTGGAATA
ATTGTTGGAGACTCAAATGGTGGATATGCGAGCATGATAACTTATATGCTGTTCTATATC
TCCATGAATCTAGGAACTTTTGCTTGCATTGTATTATTTGGTCTACGTACCGGAACTGAT
AACATTCGAGATTATGCAGGATTATACACGAAAGATCCTTTTTTGGCTCTCTCTTTAGCC
CTATGTCTCTTATCCCTAGGAGGTCTTCCTCCACTAGCAGGTTTTTTCGGAAAACTCCAT
TTATTCTGGTGTGGATGGCAGGCAGGCCTATACTTCTTGGTTTCAATAGGACTCCTTACG
AGCGTTATTTCTATCTACTATTATCTAAAAATAATCAAGTTATTAATGACTGGACGAAAC
CAAGAAATAACACCTCACGTGCGAAATTATAGAAGATCCCCTTTAAGATCAAACAATTCC
ATCGAATTGAGTATGATTGTATGTGTGATAGCATCTACTATACCAGGAATATCAATGAAC
CCGATTATTGCAATTGCTCAGGATACCCTTTTTTAG
GCF_000612305.1_Egrandis1_0_genomic.exons.GC.fna
chr length #A #C #G #T #4 #CpG
id-EucgrC_p006-2 471 169 68 110 124 0 38
id-EucgrC_p021-1 126 46 19 23 38 0 6
id-EucgrC_p021-2 230 76 43 44 67 0 20
id-EucgrC_p021-3 153 43 27 46 37 0 20
id-EucgrC_p046-3 228 76 40 56 56 0 14
id-EucgrC_p066-2 756 208 154 132 262 0 42
id-EucgrC_p081-2 756 208 154 132 262 0 42
A1.sam.bam
Geneid Chr Start End Strand Length gene ./raw_data/A1.sam.bam
id-EucgrC_p006-2 NC_014570.1 12778 13245 - 468 atpF 15
id-EucgrC_p021-1 NC_014570.1 47867 47974 - 108 ycf3 1
id-EucgrC_p021-2 NC_014570.1 46882 47109 - 228 ycf3 1
id-EucgrC_p021-3 NC_014570.1 45999 46148 - 150 ycf3 1
id-EucgrC_p046-3 NC_014570.1 74612 74836 - 225 clpP 4
id-EucgrC_p066-2 NC_014570.1 99515 100267 - 753 ndhB 0
id-EucgrC_p081-2 NC_014570.1 148743 149495 + 753 ndhB 0
All other 390684 features have their length equals.
code
option) to present your post better. You can use backticks for inline code (`text` becomestext
), or select a chunk of text and use the highlighted button to format it as a code block. I've done it for you this time.Thank you so much! I will do the changes in my next post. Sorry for that!