Hi everyone,
Pretty new you lifting over data here, many thanks for your help!
I have a dataset I'd like to liftover to HG19, rather than it's current HG38 alignment. I'm trying to use picard liftover, but no matter what I do all the variants get rejected.
This is the current script I'm running:
INPUT="/share/hennlab/projects/sa_ponderosa/03-PHASE/khomani_liftover/hg38/khomani100_hg38"
CHAIN="/share/hennlab/reference/liftover_references/hg38ToHg19.over.chain"
OUTPUT="/share/hennlab/projects/sa_ponderosa/03-PHASE/khomani_liftover/hg19"
java -jar /share/hennlab/projects/sa_ponderosa/my_scripts/picard.jar LiftoverVcf \
I=${INPUT}.vcf \
O=${OUTPUT}/khomani100_hg19lifted.vcf.gz \
CHAIN=${CHAIN} \
REJECT=${OUTPUT}/rejected_khomani100_hg19lifted_variants.vcf \
MAX_RECORDS_IN_RAM=100000 \
WARN_ON_MISSING_CONTIG=true \
RECOVER_SWAPPED_REF_ALT=true \
REFERENCE_SEQUENCE=/share/hennlab/projects/sa_ponderosa/03-PHASE/khomani_liftover/hg19.nochr.fa \
> ${OUTPUT}/khomani100_hg19lifted.stdout 2> ${OUTPUT}/khomani100_hg19lifted.stderr
The hg19.nochr.fa
file does have a corresponding .dict file in that directory as well, generated with picard CreateSequenceDictionary
. It looks likes this:
>1
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
I removed the "chr" from throughout the file to see if that was causing the problem and it still resulted in the same outcome, all rejected variants.
Am I missing something obvious? Any help is appreciated!
Could you post your stdout and stderr files too?