Entering edit mode
22 months ago
LayneSadler
▴
90
When working with whole exome sequencing data, how many basepairs should I expect as padding for a given gene: 0, 100, 1000? For example, would the upstream transcription factor binding site for each gene be included?
Hi Prash. I am the receiver of the BAM files from an old biobank, so I am not generating the data myself.
https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/DNA_Seq_Variant_Calling_Pipeline/#pre-alignment
Ok Kermit. Pl ask your service provider about the chemistry. From BAM and sorted, you could as well but the alignment with reference matters as well.