Question

How convert fasta into Iso-seq bam？

0

Entering edit mode

3.6 years ago

JZX • 0

We know raw Iso-seq subreads in bam format which just store sequences and can be used to perform ccs, lima and cluster.

But if data from NCBI SRA database, the data are in fasta/fastq format，and I don't know how to process these data These fasta/fastq data have polyA and primer sequences.

I want to remove primer sequences and orient these data in the 5′-3′ direction just like Lima can do.

Thanks a lot!

Iso-seq subreads • 3.0k views

ADD COMMENT • link updated 3.1 years ago by ttian627 • 0 • written 3.6 years ago by JZX • 0

score 0 · Answer 1 · 2021-05-30

0

Entering edit mode

3.6 years ago

GenoMax 148k

Use the pipeline that PacBio provides.

ADD COMMENT • link 3.6 years ago by GenoMax 148k

0

Entering edit mode

However, the input file of PacificBiosciences/IsoSeq is in BAM format. It doesn't say anything about how to handle the FASTA format.

ADD REPLY • link 3.6 years ago by JZX • 0

0

Entering edit mode

Sometimes submitters will submit original PacBio BAM/BAX files. You can take a look at "Original data" tab of the accession you are looking at. If you post the number(s) here I can take a look as well.

ADD REPLY • link 3.6 years ago by GenoMax 148k

0

Entering edit mode

Yes, I noticed that the original format is listed at the end of the SRA run browser.

In addition, I found an early version of Smart pipeline （smrtanalysis-2.3.0）seems to process input files in fasta format.

Thanks a lot!

ADD REPLY • link 3.6 years ago by JZX • 0

0

Entering edit mode

I'm also curious about converting fasta to bam. Is there a proper way? I see lots of folks trying to convert bam to fasta, (in which case you could use Samtools), but I have built my de novo transcriptome assembly with Trinity, and it's in fasta format. However, I'm interested in using BRAKER2 to train Augustus, and it requires the RNA-seq data as bam. Any ideas?