Long-read alignment with Minimap2
1
0
Entering edit mode
5.5 years ago
gokberk ▴ 90

Hi all,

I want to align the coding sequence of a gene (which I cropped out from hg38) to genomic read contigs of a non-human ape. All contigs are in a single fasta file (2.8 GB) and the length of the coding sequence that I want to align to it is ~400 bp. I've been using Minimap2, here is my command:

minimap2 -ax map-ont ape_genome_contigs.fasta human_gene_coding_seq.fa > aln.sam

And here is the output of this command:

goekberk@bonobo:/disk1/goekberk$ minimap2/minimap2 -ax map-ont ape_genome_contigs.fasta human_gene_coding_seq.fa.fa > aln.sam
[M::main::11.782*1.00] loaded/built the index for 2771 target sequence(s)
[M::mm_mapopt_update::13.971*1.00] mid_occ = 630
[M::mm_idx_stat] kmer size: 15; skip: 10; is_hpc: 0; #seq: 2771
[M::mm_idx_stat::15.343*1.00] distinct minimizers: 99452876 (39.18% are singletons); average occurrences: 5.363; average spacing: 5.304
[M::main] Version: 2.17-r943-dirty
[M::main] CMD: minimap2/minimap2 -ax map-ont ape_genome_contigs.fasta human_gene_coding_seq.fa.fa
[M::main] Real time: 16.021 sec; CPU: 16.022 sec; Peak RSS: 7.437 GB

However, for some reason, sam files that I generate have 0 aligments.

$ samtools view -c aln.sam
0

I tried map-pb and asm10 options as well as minimap2 -a [-x preset] target.mmi query.fa > output.sam command but could not manage to have a successful alignment. I was wondering if I'm missing something crucial about the long-read alignments to a set of genomic contigs. Any help is much appreciated.

Cheers, Gökberk

minimap2 long-read DNA-seq • 8.0k views
ADD COMMENT
2
Entering edit mode

Have you tried using blast+?

ADD REPLY
0
Entering edit mode

I tried that but since the genome is about 2.8 GB, I could not manage to upload it and received errors even though I tried multiple times.

ADD REPLY
1
Entering edit mode

You should try this with a local install of blast+. Even blat may be worth a try. No indexing needed.

ADD REPLY
2
Entering edit mode
5.5 years ago
gokberk ▴ 90

I guess my query sequence was not formatted as a proper fasta file, I fixed it and problem is solved now, sorry for bothering :).

ADD COMMENT

Login before adding your answer.

Traffic: 1927 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6