Improving Transcriptome Sequence With Illumina Reads
1
0
Entering edit mode
11.8 years ago
xmix ▴ 40

Hello Biostars,

I wonder whether there are tools that take an existing genome / transcriptome assembly (fasta), and correct it with short-sequences data (or with SAM/BAM files from that kind of data). Even a tool that will correct mismatches and short indels will be of use by consensus calling. I downloaded iCorn, but read that it requires mate-pair data, which I do not have. Are there alternatives to a samtools / vcf2fq combination?

vcf consensus • 2.3k views
ADD COMMENT
0
Entering edit mode

It sounds like you're looking for SNPs - small differences between the reference genome and your read,s or I misunderstood your question.

ADD REPLY
0
Entering edit mode

Most of them are not true SNPs but errors that are the result of imperfect assembly. The most annoying errors are "frame-shifts" due to error in the assembly of homopolymers.

ADD REPLY
0
Entering edit mode

So tools for finding SNPs should be helpful here

ADD REPLY
0
Entering edit mode

How come you don't want to use samtools?

ADD REPLY
1
Entering edit mode
11.8 years ago
SES 8.6k

You may want to try SEQuel which does not require mate pair information (see the publication for more details) but it does require a fasta reference, not SAM/BAM. I came across this tool recently and haven't had a chance to try it, so I'd be interested in the results.

ADD COMMENT

Login before adding your answer.

Traffic: 2345 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6