It seems to me that doing some NGS on this project would be like buying a bazooka to kill a fly. You will probably end up paying a large amount of money, and getting way too much coverage for your purpose. You might think that a huge coverage is not a problem, but it is for many assemblers. So you would probably then randomly pick 10% of your data to do your mapping... it's your call.
Having said that, I'll try to answer some of your questions:
- Sequence using primer walking/traditional Sanger sequencing (is the output format from AB 3730s FASTQ nowadays?)
The ABI Sanger 3730 outputs .ab1 files (chromatograms) which contain the trace values. You can then base-call and quality-call these, say with 'phred' for instance. You could then remove low quality bases at the beginning and end of sequences with 'ttuner' if needed.
I have no experience with primer walking, but as I read you have a reference genome, wouldn't it be better to design primers along your 22kb region, so that you can sequence everything (with redundancy and without worries on the length of your fragments) and then assemble these reads together?
-
Align each sample separately to genome (BWA-SW okay for these long reads?)
-
Call SNPs/Indels separately for each sample (SAMtools or GATK - is the GATK workflow overkill for long, accurate reads like these?)
-
Look for SVs (SVDetect or Breakdancer?)
-
Annotate known/novel/consequence etc (Annovar or something similar)
All these tools are NGS-aimed tools. Although 'bwa-sw' (and others) should be fine for these long reads, I would suggest, like #darked89, to use tools that are more Sanger-focused such as Staden/gap4 or MIRA/gap4.
I hope the 22kb is the genomic size, not the size of exons you want to screen.
22 kb is the full size of the gene - introns and exons. The sequence provider seems to think Sanger is the best option (the project is tied to one gene - it can't go beyone that for reasons of consent) but if there's a better option I'd be glad to hear it.
22 kb is the full size of the gene - introns and exons. The sequence provider seems to think Sanger is the best option (the project is tied to one gene - it can't go beyond that for reasons of consent) but if there's a better option I'd be glad to hear it.