Entering edit mode
5.4 years ago
xiaoleiusc
▴
140
Dear Biostars forum users,
I tried to process my PAR-CLIP dataset with PARalyzer version 1.5 ( https://ohlerlab.mdc-berlin.de/software/PARalyzer_85/ ). While I could start running the program without any problem, after several hours I always got an error as below:
Running PARalyzer v1.5
Parsing SAM file(s)...Done
Creating Read Groups & Clusters...Exception in thread "main" java.lang.Exception: * is not found in the .2bit file
at MyTwoBitParser.loadChromosome(MyTwoBitParser.java:58)
at MyTwoBitParser.getSequence(MyTwoBitParser.java:118)
at PARalyze.main(PARalyze.java:204)
I generated by 2bit file by faToTwoBit tools (e.g. faToTwoBit hg19.fasta hg19.2bit).
I really appreciate any input to solve this issue.
Best,
Xiao
In SAM/BAM files, typically
*
is used as chromosome name to indicate unmapped reads. Can you share the command line?strong textHi, ATpoint,
Thanks a lot for your reply. I am sorry for my late reply to your question (I am new here and I did not get email notification somehow for messages). I share my command line which leads to error as below:
My 2bit file was generated by Fatotwobit tool of human hg19, I did not use filter file and my .ini file is as below:
Regards,
Xiao
Hi, ATpoint,
I really appreciate your stimulating input! I found that I have to use sam files with only mapped reads as input in the Paralyzer to make it work. I need to do samtools view -b -F 4 input.bam > output_mapped.bam to generate bam files with only mapped reads and then convert the bam to sam file by samtools view -h output_mapped.bam > output_mapped.sam to generate sam file that is working with Paralyzer. As you mentioned that in SAM/BAM files, typically * is used as chromosome name to indicate unmapped reads. This is likely giving me a problem in my Paralyzer run. My Paralyzer runs well with sam files with only mapped reads!
I had the same issue and this solved it. Even though I ran Bowtie with --no-unal, I guess there were still some unmapped reads in there.
Did you get the same error with the pre-built filter files provided on this tool's webiste?
Hi, Sej,
I found that I have to use sam files with only mapped reads as input in the Paralyzer to make it work. I need to do
to generate bam files with only mapped reads and then convert the bam to sam file by
to generate sam file that is working with Paralyzer.
Thanks for your input.
Xiao
Hi, Xiao
When I ran PARalyzer, it didn't work with "=COLLAPSED" at the end of SAM_FILE lines but only work without "=COLLAPSED". I collapsed the fastq files with both fastx_toolkit and CIMS/fastq2collapse.pl but both didn't work.
Did you have any experience about this problem ?
Best, Seokju
Hi, Seokju,
Sorry for the late reply but I just saw your message today. I did not include
=COLLAPSED
in my PARalyzer ini file. I use fastx_toolkit to collapse reads. Here is one of my ini file for PARalyzer below: