Entering edit mode
6.1 years ago
kamel
▴
70
I looked at some links of GATK on pacbio data but I can not understand that they are the parameters that must be used on
-variant_index_parameter , -stand_emit_conf , -stand_call_conf
during the base calling by haplotypecaller.
Also for VariantFiltration of SNP and Indel
-filterExpression 'QD || FS || MQ || HaplotypeScore || MappingQualityRankSum || ReadPosRankSum
Could you give me an example of the base call on pacbio data with the values of these parameters.
I'm not really sure exactly what you're asking, but I'm happy to try to help out here. Are you looking recommendations for GATK HaplotypeCaller and VariantFiltration parameters for PacBio data? What type of PacBio data are you using, and how was it generated?
I want to know what values should be used for these parameters on pacbio sequel data (these are whole genome data from a haploid fungus). For example on the illumina data I use the values below:
For Base calling
For SNP filtration
For Indel Filtration
I have no idea about the values that should be used for these parameters on pacbio sequel data. Or do I use the same values?
Hello kamel ,
Please use the formatting bar (especially the
code
option) to present your post better. I've done it for you this time.Thank you!
Are these subreads, or CCS?
these are subreads generated by sequel
I have parameters for calling and filtering short variants with GATK HC on Q20+ CCS reads, but I haven't had much luck with raw subreads.