GATK3 error message
2
0
Entering edit mode
6.5 years ago
shuksi1984 ▴ 60

I ran following command:

java -Xmx4g -jar /path/to/gatk3.8.jar -T RealignerTargetCreator -R /path/to/Homo_sapiens_assembly38.fasta -I SP1_markduplicate.bam -o SP1_realignedtargets.list

I got following error message:

ERROR MESSAGE: Bad input: We encountered a non-standard non-IUPAC base in the provided reference: '92'

What does this mean? Did Homo_sapiens_assembly38.fasta get corrupted?

GATK next-gen • 2.4k views
ADD COMMENT
0
Entering edit mode
[7 October 2024 at 2:14:19 pm IST] org.broadinstitute.hellbender.tools.walkers.haplotypecaller.HaplotypeCaller done. Elapsed time: 187.74 minutes.
Runtime.totalMemory()=5167382528
***********************************************************************

A USER ERROR has occurred: Bad input: We encountered a non-standard non-IUPAC base in the provided input sequence: '45'

i am getting this error couldn't know how to tackle. please help me through this.

ADD REPLY
0
Entering edit mode

Please don't post this error in multiple threads. Run the command shown below and post the output here.

grep -v '^>' your_fasta_file | tr -d '[ATGCNatgcn\n]' | grep -o '.' | sort | uniq -c
ADD REPLY
0
Entering edit mode

Read the first post here: https://news.ycombinator.com/item?id=18130605

It's not about your error, it's about your approach to solving it. Just googling "non-IUPAC base + GATK" would have led you down the right path. Learning to google search in a productive manner is critical to any computer science related field - any error you faced today has certainly been faced and solved by someone in the past.

ADD REPLY
0
Entering edit mode
6.5 years ago

what is the version of gatk ?

what is the output of:

grep -v '^>' /path/to/Homo_sapiens_assembly38.fasta | tr -d '[ATGCNatgcn\n]' | grep -o '.' | sort | uniq -c
ADD COMMENT
0
Entering edit mode

what is the version of gatk ?

GATK3.8

Output of

grep -v '^>' /path/to/Homo_sapiens_assembly38.fasta | tr -d '[ATGCNatgcn\n]' | grep -o '.' | sort | uniq -c

is:

 2 \
  2 B
  1 F
  1 I
 15 K
  8 M
  2 O
 26 R
  4 S
 13 W
 33 Y
ADD REPLY
0
Entering edit mode

what devon said:

your fasta file is likely corrupted, since it has a \ in it as a base.

futhermore there is a 'O'.

I also suggest you have a look at this fasta file:

grep -n -A 5 -B 5 '/' ref.fasta
ADD REPLY
0
Entering edit mode

Can you share the link to download hg38gatkbundle reference genome? I am trying with the given link:

ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg38/Homo_sapiens_assembly38.fasta.gz

ADD REPLY
0
Entering edit mode

there is no problem with that resource. your data is corrupted.

$ curl   "ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg38/Homo_sapiens_assembly38.fasta.gz"  | gunzip -c | grep -v '^>' | tr -d '[ATGCNatgcn\n]' | grep -o '.' | sort | uniq -c

      2 B
      8 K
      8 M
     26 R
      4 S
     13 W
     33 Y
ADD REPLY
0
Entering edit mode
your data is corrupted

bam file is corrupted?

ADD REPLY
0
Entering edit mode

your data is corrupted

try to understand what I've done in my previous comment please C: GATK3 error message

ADD REPLY
0
Entering edit mode

I ran the above command and got same output. I could not find any file in my working directory, except the output as shown above. I am unable to understand. Kindly, explain

ADD REPLY
0
Entering edit mode
6.5 years ago

Yes, your fasta file is likely corrupted, since it has a \ in it as a base.

ADD COMMENT

Login before adding your answer.

Traffic: 2133 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6