Variant calling from FASTA file
2
1
Entering edit mode
4.1 years ago
User000 ▴ 710

Dear all,

I have a fasta file contatining marker ID's:

>marker1
AGCTGGGTGTCGTTGATCTTCAGGTCCTTCTGGATGTACAGCGACGCTCCActcatcacg
ccgatgtcgtagccgaggaggatggaggtcatggaggcgag
>marker2
CTCGTCGGTTTCGACCACCATGTTGTCATGGGCATAGCATAGTGAACTCCACTCACAGCT
AAAAAGGAACGGTGCCTGCCTTCTTTTCAGTATCTAATTAC
>markern
CATCTAATTTGTTGGACGGGGATATGCTGGGAAAACGAAGAGCAAATCTGTGCCCACTGT
GCTGGTCGCTGCAGAATCTATTCCCCATTATGTTGCCTGTT

I would like to align these markers against the reference genome and find the position of variant for every single marker if there any:

#CHROM  POS ID  REF ALT
chr1    1101439 marker1 C   A
chr1    1104710 marker2 A   T
chr1    1104748 markern C   T

I am trying to use both bwa/bowtie to align and bcftools/gatk to call variants, but I do not know how to include also the marker ID in the vcf file? OR MAY BE THIS IS POSSIBLE BY PARSING THE BLAST OUTPUT?

bwa bowtie fasta vcf • 2.5k views
ADD COMMENT
2
Entering edit mode
4.1 years ago
JC 13k

if you want to add the markerID, it's better to align with Blast and parse the result to generate the custom VCF

ADD COMMENT
0
Entering edit mode

any hints how to do this? thanks

ADD REPLY
0
Entering edit mode

I will this and see if it works: Find Mismatch And Gap Positions

ADD REPLY
1
Entering edit mode
4.1 years ago
User000 ▴ 710

This wonderful program solved my problem (Lindenbaum, Pierre (2015): JVarkit: java-based utilities for Bioinformatics. figshare. http://dx.doi.org/10.6084/m9.figshare.1425030):

http://lindenb.github.io/jvarkit/BlastNToSnp.html

ADD COMMENT

Login before adding your answer.

Traffic: 2550 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6