error ,,, file.vcf --> file snpEff.vcf
1
0
Entering edit mode
10.6 years ago

hi everyone. help me!

I run

VarScan > rice-snp.vcf

and I want run with snpEff tools but error!

./snpEff$ java -jar snpEff.jar rice7 rice-snp.vcf > s.eff.vcf

ERRORS: Some errors were detected
Error type      Number of errors
ERROR_CHROMOSOME_NOT_FOUND      330650

Can anybody help me...thanks all.

software-error SNP • 3.8k views
ADD COMMENT
0
Entering edit mode
# format file input.vcf

#CHROM    POS    ID    REF    ALT    QUAL    FILTER    INFO    FORMAT    Sample1
LOC_Os01g01070    1254    .    A    G    .    PASS    ADP=13;WT=0;HET=0;HOM=1;NC=0   
LOC_Os01g01070    3850    .    A    G    .    PASS    ADP=11;WT=0;HET=0;HOM=1;NC=0    
LOC_Os01g01070    4240    .    C    T    .    PASS    ADP=12;WT=0;HET=0;HOM=1;NC=0   
LOC_Os01g01080    2809    .    T    C    .    PASS    ADP=11;WT=0;HET=0;HOM=1;NC=0   
LOC_Os01g01090    435    .    G    A    .    PASS    ADP=15;WT=0;HET=1;HOM=0;NC=0   
..............
ADD REPLY
0
Entering edit mode

Is the rice7 database one you made yourself or one of the premade ones?

ADD REPLY
0
Entering edit mode

Yes. I made for my group and it is part of the project.

ADD REPLY
1
Entering edit mode

Does it have the same chromosome names? My guess would be not.

ADD REPLY
2
Entering edit mode
10.6 years ago
Pablo ★ 1.9k

Three suggestions:

  1. Make sure the input file is really VCF
  2. Run SnpEff using verbose mode (-v) and it will show all chromosome names and lengths

    $ java -Xmx4g -jar snpEff.jar -v rice7 test.vcf
    
    # Chromosomes names [sizes]  :
    #               '1' [43270923]
    #               '3' [36413819]
    #               '2' [35937250]
    #               '4' [35502694]
    #               '6' [31248787]
    #               '5' [29958434]
    #               '7' [29697621]
    #               '11' [29021106]
    #               '8' [28443022]
    #               '12' [27531856]
    #               '10' [23207287]
    #               '9' [23012720]
    #               'Un' [633585]
    #               'Sy' [592136]
    
  3. If that doesn't work, send me some sample data (http://snpeff.sourceforge.net/about.html#help ) and I'll try to look into it

Cheers

Pablo

ADD COMMENT
0
Entering edit mode

orror!

name chromosome of file input.vcf is "LOC_Os01g01070..."

and name chromosome of data rice7 is:

# Chromosomes names [sizes]  :
#               '1' [43270923]
#               '3' [36413819]
#               '2' [35937250]
#               '4' [35502694]
#               '6' [31248787]
#               '5' [29958434]
#               '7' [29697621]
#               '11' [29021106]
#               '8' [28443022]
#               '12' [27531856]
#               '10' [23207287]
#               '9' [23012720]
#               'Un' [633585]
#               'Sy' [592136]
ADD REPLY
0
Entering edit mode

Well there's your problem.

ADD REPLY
0
Entering edit mode

I'm sorry but My major is biology research and this is my problem.

ADD REPLY
0
Entering edit mode

Many of us are "reformed biologists" :) It looks like you aligned to the transcriptome rather than the genome, since LOC_Os01g01070 is a transcript (apparently of a transcription factor).

ADD REPLY

Login before adding your answer.

Traffic: 2312 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6