Question

Error using sequenza-utils with WES

0

Entering edit mode

17 months ago

Camilo Andres ▴ 40

Hi! I'm trying to calculate the homologous recombination deficiency score of a cell line (MDA-MB-231) using whole exome sequencing data. To do this, I pretend to use the scarHRD package in R, but I first need a "*.seqz.gz" archive, which is made using sequenza-utils, but I have the following error message when I run this tool:

[mpileup] 1 samples in 1 input files
[mpileup] 1 samples in 1 input files
Traceback (most recent call last):
  File "/Users/camilo/micromamba/envs/bioinfo/bin/sequenza-utils", line 33, in <module>
    sys.exit(load_entry_point('sequenza-utils==2.1.9999b1', 'console_scripts', 'sequenza-utils')())
  File "/Users/camilo/micromamba/envs/bioinfo/lib/python3.8/site-packages/sequenza/commands.py", line 39, in main
    modules[args.module](subparsers, args.module, extra, log)
  File "/Users/camilo/micromamba/envs/bioinfo/lib/python3.8/site-packages/sequenza/programs/seqz_binning.py", line 35, in seqz_binning
    out.write(next(bins))
  File "/Users/camilo/micromamba/envs/bioinfo/lib/python3.8/site-packages/sequenza/seqz.py", line 208, in binned_seqz
    yield format_seqz_window(window_i) + replace_gc(
  File "/Users/camilo/micromamba/envs/bioinfo/lib/python3.8/site-packages/sequenza/seqz.py", line 157, in format_seqz_window
    int(round(window[2] / window[6], 0)),
ZeroDivisionError: division by zero

The code is as follows: ` $ sequenza-utils bam2seqz -gc hg38/hg38.wig.gz --fasta hg38/hg38.fa -n MCF10A/SRR3090727_recall_reads.bam -t MDA-MB-231/SRR1021654_recall_reads.bam | sequenza-utils seqz_binning -w 50 -s - | gzip > tumor_small.seqz.gz

I am quite new to this kind of analyses, so I am sorry if it is a very simple issue to solve.

Thanks in advance

WES NGS Sequenza Sequenza-utils • 1.2k views

ADD COMMENT • link 17 months ago by Camilo Andres ▴ 40

0

Entering edit mode

A quick look at the sequenza-utils cookbook does not show examples of pipe you are using. Do you know if this piping works? You may want to try doing this in two separate steps. Instead of plain gzip you should probably use bgzip.

ADD REPLY • link 17 months ago by GenoMax 149k

0

Entering edit mode

This pipeline is the one recommended by scar HRD, which is the package I pretend to use aftewards. I only removed the chromosome specifying part to see whether it worked that way, but it didn't, just got the same error. I will try using your suggestion. Thanks.

PS: I just tried using bgzip, but didn't work.

ADD REPLY • link 17 months ago by Camilo Andres ▴ 40

0

Entering edit mode

I only removed the chromosome specifying part to see whether it worked that way,

What does this mean?

Can you try doing this in 2 steps:

sequenza-utils bam2seqz -gc hg38/hg38.wig.gz --fasta hg38/hg38.fa -n MCF10A/SRR3090727_recall_reads.bam -t MDA-MB-231/SRR1021654_recall_reads.bam  --output sample.seqz.gz

followed by

sequenza-utils seqz_binning -w 50 -s --seqz sample.seqz.gz -o sample_bin50.seqz.gz

ADD REPLY • link 17 months ago by GenoMax 149k

0

Entering edit mode

Thanks, this make me notice that is the first part of the pipeline that was giving me the error... "Segmentation fault: 11" So, I will look for solving this issue now. Thanks.

ADD REPLY • link 17 months ago by Camilo Andres ▴ 40