Issues while running BaseRecalibrator
1
0
Entering edit mode
14 months ago
Foad ▴ 10

I am facing this error while using the BaseRecalibrator

Thank you for your time and help.

Exact command used:

java -jar ~/WGS/tool/gatk-4.4.0.0/gatk-package-4.4.0.0-local.jar BaseRecalibrator -I dup/oryza.dup -O base/oryza.table -R ref/Oryza-sativa-Japonica-chromosome1.fasta --known-sites oryza_sativa.vcf

Entire error log:

A USER ERROR has occurred: Input files reference and features have incompatible contigs: No overlapping contigs found. reference contigs = [AP014957.1] features contigs = [1, 10, 11, 12, 2, 3, 4, 5, 6, 7, 8, 9]

BaseRecalibrator WGS • 464 views
ADD COMMENT
0
Entering edit mode
14 months ago
Michael 55k

You have different chromosome names in your reference fasta file than in your .vcf file. The reference fasta contains genbank ids (AP014957.1) while the VCF contains chromosome numbers "1", "2", ...

Edit: You could rename the sequences in your reference file (AP014957.1 = chromosome 1):

    >AP014957.1

to look like:

    >1
    ....
    >2

You can either do this manually, it doesn't really pay off to automate this for 12 chromosomes or try to download a different reference genome. In particular, check the VCF file on which reference it is based. The info should be in the VCF, too.

Likely, you must do the initial alignment from scratch if you used a different reference. Before you go ahead, make sure to use the identical reference in all steps. If you want to use base quality recalibration, you need to use the same reference as in the know-sites.vcf in all other steps.

I would personally not bother with renaming VCF or BAM entries if avoidable; chances are that this messes things up. On the other hand, if you intend to run variant annotation after that, you also need to check that the reference of the annotation file used in any subsequent process needs to be compatible and based on the identical assembly, too.

ADD COMMENT

Login before adding your answer.

Traffic: 1803 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6