A question about reference genome for creating the consensus sequence
1
0
Entering edit mode
7 months ago
me • 0

Hi, I wonder how the samtools consensus work without explicitly pointing out the reference genome. If I intend to add a reference genome to generate the consensus sequence, is it possible based on samtools? Thanks a lot.

Reference: https://www.htslib.org/doc/samtools-consensus.html

fasta fa cram genome • 658 views
ADD COMMENT
0
Entering edit mode
7 months ago
ATpoint 86k

It is the consensus for all reads that overlap a given base, it's not a genome consensus unless your reads cover the entire genome.

ADD COMMENT
0
Entering edit mode

Hi, yes, my cram files cover the whole genome, which is actually individual genome information. I wonder how to adjust the base genome. Thanks.

ADD REPLY
0
Entering edit mode

You aligned your data, that is your reference genome to compare against. If you want to change reference then realign.

ADD REPLY
0
Entering edit mode

Hi, thanks for your information. I am a bit confused now. Does this mean the cram file already contained reference genome information? I intend to generate a new fasta file based on the cram file, so that means I only need to consider the consensu sequence. Is it correct? Thanks.

ADD REPLY
0
Entering edit mode

I wonder how to adjust the base genome.

If you are looking to make changes to the reference you used for alignment then look into https://gatk.broadinstitute.org/hc/en-us/articles/360037594571-FastaAlternateReferenceMaker

ADD REPLY

Login before adding your answer.

Traffic: 1906 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6